Deep Learning and Explainable Artificial Intelligence

A special issue of Computers (ISSN 2073-431X).

Deadline for manuscript submissions: 30 September 2024 | Viewed by 11075

Special Issue Editor


E-Mail Website
Guest Editor
School of Mechanical Engineering, Purdue University, 585 Purdue Mall, West Lafayette, IN 47907, USA
Interests: predictive maintenance; heath monitoring for ground and aerial vehicles; data analytics; AI; innovation; nonlinear systems analysis and synthesis; adaptation; estimation; filtering; control; general artificial intelligence
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Breakthroughs in 'deep learning' via use of intermediate features in multilayer 'neural networks' and generative adversarial networks using neural networks as generative and discriminative models combined with the massive increase in computing power of GPU chips have resulted in the widespread popularity and use of 'artificial intelligence' in the past decade. The apostrophes in the previous sentence are inserted on purpose to remind the reader that learning, in the biological sense, that improves survival outcomes via biological nervous systems or intelligent decisions improving energy and resource availability are far away from what current software can hope to achieve. The purpose of this Special Issue is to bridge this gap: to develop explanations and an understanding of functioning AI/ML methods, and to develop AI/ML methods that generate outcomes with predictable properties when fed with data satisfying certain conditions.

Thus, it is hoped that the Special Issue will stimulate AI that will increase efficiencies while not compromising safety, trust, fairness, predictability, and reliability when applied to systems with large energy use such as power, water, transport, or financial grids, law and government policy. As a first step towards this goal of transparency of AI algorithms, we seek papers that document the methods so that:

  1. The results are reproducible, at least in the statistical sense;
  2. Algorithms are provided in a common language of sequences of vector matrix algebra operations, which also underlies much deep learning;
  3. Conditions satisfied by data inputs, objective functions of optimization or curve fitting are explicitly listed;
  4. The propagation of data uncertainty to algorithmic outcomes is documented through sensitivity analysis or Monte Carlo simulations.

Potential issues of interest include the following: while there is no repeatability in general in the training of weights in deep learning or most neural networks, there is repeatability in approximating functions or decision boundaries for similar sets of input data. Such results also exist in adaptive control where there is asymptotic tracking without the convergence of parameter estimates. Similarly, a ChatGPT-like AI needs to maintain the consistency of its conclusions, provided the inputs remain consistent. The use of AI in the law can have, for example, quantifiable goals such as the prompt compensation of the victim and long-term reformation of the criminal to higher levels of productivity rather than classical legal outcomes of punishment or retribution, which are subjective. Can a chess or GO GAN handle some level of randomness in the rules of the game? 

Dr. Kartik B. Ariyur
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Computers is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 1800 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Published Papers (7 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Jump to: Review

34 pages, 7324 KiB  
Article
The Explainability of Transformers: Current Status and Directions
by Paolo Fantozzi and Maurizio Naldi
Computers 2024, 13(4), 92; https://doi.org/10.3390/computers13040092 - 04 Apr 2024
Viewed by 640
Abstract
An increasing demand for model explainability has accompanied the widespread adoption of transformers in various fields of applications. In this paper, we conduct a survey of the existing literature on the explainability of transformers. We provide a taxonomy of methods based on the [...] Read more.
An increasing demand for model explainability has accompanied the widespread adoption of transformers in various fields of applications. In this paper, we conduct a survey of the existing literature on the explainability of transformers. We provide a taxonomy of methods based on the combination of transformer components that are leveraged to arrive at the explanation. For each method, we describe its mechanism and survey its applications. We find out that attention-based methods, both alone and in conjunction with activation-based and gradient-based methods, are the most employed ones. A growing attention is also devoted to the deployment of visualization techniques to help the explanation process. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

22 pages, 1949 KiB  
Article
A Low-Cost Deep-Learning-Based System for Grading Cashew Nuts
by Van-Nam Pham, Quang-Huy Do Ba, Duc-Anh Tran Le, Quang-Minh Nguyen, Dinh Do Van and Linh Nguyen
Computers 2024, 13(3), 71; https://doi.org/10.3390/computers13030071 - 08 Mar 2024
Viewed by 958
Abstract
Most of the cashew nuts in the world are produced in the developing countries. Hence, there is a need to have a low-cost system to automatically grade cashew nuts, especially in small-scale farms, to improve mechanization and automation in agriculture, helping reduce the [...] Read more.
Most of the cashew nuts in the world are produced in the developing countries. Hence, there is a need to have a low-cost system to automatically grade cashew nuts, especially in small-scale farms, to improve mechanization and automation in agriculture, helping reduce the price of the products. To address this issue, in this work we first propose a low-cost grading system for cashew nuts by using the off-the-shelf equipment. The most important but complicated part of the system is its “eye”, which is required to detect and classify the nuts into different grades. To this end, we propose to exploit advantages of both the YOLOv8 and Transformer models and combine them in one single model. More specifically, we develop a module called SC3T that can be employed to integrate into the backbone of the YOLOv8 architecture. In the SC3T module, a Transformer block is dexterously integrated into along with the C3TR module. More importantly, the classifier is not only efficient but also compact, which can be implemented in an embedded device of our developed cashew nut grading system. The proposed classifier, called the YOLOv8–Transformer model, can enable our developed grading system, through a low-cost camera, to correctly detect and accurately classify the cashew nuts into four quality grades. In our grading system, we also developed an actuation mechanism to efficiently sort the nuts according to the classification results, getting the products ready for packaging. To verify the effectiveness of the proposed classifier, we collected a dataset from our sorting system, and trained and tested the model. The obtained results demonstrate that our proposed approach outperforms all the baseline methods given the collected image data. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

22 pages, 6565 KiB  
Article
Bus Driver Head Position Detection Using Capsule Networks under Dynamic Driving Conditions
by János Hollósi, Áron Ballagi, Gábor Kovács, Szabolcs Fischer and Viktor Nagy
Computers 2024, 13(3), 66; https://doi.org/10.3390/computers13030066 - 03 Mar 2024
Viewed by 945
Abstract
Monitoring bus driver behavior and posture in urban public transport’s dynamic and unpredictable environment requires robust real-time analytics systems. Traditional camera-based systems that use computer vision techniques for facial recognition are foundational. However, they often struggle with real-world challenges such as sudden driver [...] Read more.
Monitoring bus driver behavior and posture in urban public transport’s dynamic and unpredictable environment requires robust real-time analytics systems. Traditional camera-based systems that use computer vision techniques for facial recognition are foundational. However, they often struggle with real-world challenges such as sudden driver movements, active driver–passenger interactions, variations in lighting, and physical obstructions. Our investigation covers four different neural network architectures, including two variations of convolutional neural networks (CNNs) that form the comparative baseline. The capsule network (CapsNet) developed by our team has been shown to be superior in terms of efficiency and speed in facial recognition tasks compared to traditional models. It offers a new approach for rapidly and accurately detecting a driver’s head position within the wide-angled view of the bus driver’s cabin. This research demonstrates the potential of CapsNets in driver head and face detection and lays the foundation for integrating CapsNet-based solutions into real-time monitoring systems to enhance public transportation safety protocols. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

19 pages, 1275 KiB  
Article
Leveraging Positive-Unlabeled Learning for Enhanced Black Spot Accident Identification on Greek Road Networks
by Vasileios Sevetlidis, George Pavlidis, Spyridon G. Mouroutsos and Antonios Gasteratos
Computers 2024, 13(2), 49; https://doi.org/10.3390/computers13020049 - 08 Feb 2024
Viewed by 1386
Abstract
Identifying accidents in road black spots is crucial for improving road safety. Traditional methodologies, although insightful, often struggle with the complexities of imbalanced datasets, while machine learning (ML) techniques have shown promise, our previous work revealed that supervised learning (SL) methods face challenges [...] Read more.
Identifying accidents in road black spots is crucial for improving road safety. Traditional methodologies, although insightful, often struggle with the complexities of imbalanced datasets, while machine learning (ML) techniques have shown promise, our previous work revealed that supervised learning (SL) methods face challenges in effectively distinguishing accidents that occur in black spots from those that do not. This paper introduces a novel approach that leverages positive-unlabeled (PU) learning, a technique we previously applied successfully in the domain of defect detection. The results of this work demonstrate a statistically significant improvement in key performance metrics, including accuracy, precision, recall, F1-score, and AUC, compared to SL methods. This study thus establishes PU learning as a more effective and robust approach for accident classification in black spots, particularly in scenarios with highly imbalanced datasets. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

17 pages, 2043 KiB  
Article
EfficientNet Ensemble Learning: Identifying Ethiopian Medicinal Plant Species and Traditional Uses by Integrating Modern Technology with Ethnobotanical Wisdom
by Mulugeta Adibaru Kiflie, Durga Prasad Sharma, Mesfin Abebe Haile and Ramasamy Srinivasagan
Computers 2024, 13(2), 38; https://doi.org/10.3390/computers13020038 - 29 Jan 2024
Viewed by 1384
Abstract
Ethiopia is renowned for its rich biodiversity, supporting a diverse variety of medicinal plants with significant potential for therapeutic applications. In regions where modern healthcare facilities are scarce, traditional medicine emerges as a cost-effective and culturally aligned primary healthcare solution in developing countries. [...] Read more.
Ethiopia is renowned for its rich biodiversity, supporting a diverse variety of medicinal plants with significant potential for therapeutic applications. In regions where modern healthcare facilities are scarce, traditional medicine emerges as a cost-effective and culturally aligned primary healthcare solution in developing countries. In Ethiopia, the majority of the population, around 80%, and for a significant proportion of their livestock, approximately 90% continue to prefer traditional medicine as their primary healthcare option. Nevertheless, the precise identification of specific plant parts and their associated uses has posed a formidable challenge due to the intricate nature of traditional healing practices. To address this challenge, we employed a majority based ensemble deep learning approach to identify medicinal plant parts and uses of Ethiopian indigenous medicinal plant species. The primary objective of this research is to achieve the precise identification of the parts and uses of Ethiopian medicinal plant species. To design our proposed model, EfficientNetB0, EfficientNetB2, and EfficientNetB4 were used as benchmark models and applied as a majority vote-based ensemble technique. This research underscores the potential of ensemble deep learning and transfer learning methodologies to accurately identify the parts and uses of Ethiopian indigenous medicinal plant species. Notably, our proposed EfficientNet-based ensemble deep learning approach demonstrated remarkable accuracy, achieving a significant test and validation accuracy of 99.96%. Future endeavors will prioritize expanding the dataset, refining feature-extraction techniques, and creating user-friendly interfaces to overcome current dataset limitations. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

16 pages, 767 KiB  
Article
Constructing the Bounds for Neural Network Training Using Grammatical Evolution
by Ioannis G. Tsoulos, Alexandros Tzallas and Evangelos Karvounis
Computers 2023, 12(11), 226; https://doi.org/10.3390/computers12110226 - 05 Nov 2023
Viewed by 1875
Abstract
Artificial neural networks are widely established models of computational intelligence that have been tested for their effectiveness in a variety of real-world applications. These models require a set of parameters to be fitted through the use of an optimization technique. However, an issue [...] Read more.
Artificial neural networks are widely established models of computational intelligence that have been tested for their effectiveness in a variety of real-world applications. These models require a set of parameters to be fitted through the use of an optimization technique. However, an issue that researchers often face is finding an efficient range of values for the parameters of the artificial neural network. This paper proposes an innovative technique for generating a promising range of values for the parameters of the artificial neural network. Finding the value field is conducted by a series of rules for partitioning the original set of values or expanding it, the rules of which are generated using grammatical evolution. After finding a promising interval of values, any optimization technique such as a genetic algorithm can be used to train the artificial neural network on that interval of values. The new technique was tested on a wide range of problems from the relevant literature and the results were extremely promising. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

Review

Jump to: Research

28 pages, 710 KiB  
Review
A Systematic Review of Using Machine Learning and Natural Language Processing in Smart Policing
by Paria Sarzaeim, Qusay H. Mahmoud, Akramul Azim, Gary Bauer and Ian Bowles
Computers 2023, 12(12), 255; https://doi.org/10.3390/computers12120255 - 07 Dec 2023
Viewed by 3109
Abstract
Smart policing refers to the use of advanced technologies such as artificial intelligence to enhance policing activities in terms of crime prevention or crime reduction. Artificial intelligence tools, including machine learning and natural language processing, have widespread applications across various fields, such as [...] Read more.
Smart policing refers to the use of advanced technologies such as artificial intelligence to enhance policing activities in terms of crime prevention or crime reduction. Artificial intelligence tools, including machine learning and natural language processing, have widespread applications across various fields, such as healthcare, business, and law enforcement. By means of these technologies, smart policing enables organizations to efficiently process and analyze large volumes of data. Some examples of smart policing applications are fingerprint detection, DNA matching, CCTV surveillance, and crime prediction. While artificial intelligence offers the potential to reduce human errors and biases, it is still essential to acknowledge that the algorithms reflect the data on which they are trained, which are inherently collected by human inputs. Considering the critical role of the police in ensuring public safety, the adoption of these algorithms demands careful and thoughtful implementation. This paper presents a systematic literature review focused on exploring the machine learning techniques employed by law enforcement agencies. It aims to shed light on the benefits and limitations of utilizing these techniques in smart policing and provide insights into the effectiveness and challenges associated with the integration of machine learning in law enforcement practices. Full article
(This article belongs to the Special Issue Deep Learning and Explainable Artificial Intelligence)
Show Figures

Figure 1

Back to TopTop