PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach

Alsharif, Roaa; Al-Issa, Yazan; Alqudah, Ali Mohammad; Qasmieh, Isam Abu; Mustafa, Wan Azani; Alquran, Hiam

doi:10.3390/electronics10232949

Open AccessArticle

PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach

by

Roaa Alsharif

^1,2,

Yazan Al-Issa

³

,

Ali Mohammad Alqudah

^4,*

,

Isam Abu Qasmieh

⁴,

Wan Azani Mustafa

^5,*

and

Hiam Alquran

^4,6

¹

College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Jeddah 22384, Saudi Arabia

²

King Abdullah International Medical Research Center, Jeddah 22384, Saudi Arabia

³

Department of Computer Engineering, Yarmouk University, Irbid 21163, Jordan

⁴

Department of Biomedical Systems and Informatics Engineering, Yarmouk University, Irbid 21163, Jordan

⁵

Faculty of Electrical Engineering Technology, Campus Pauh Putra, Universiti Malaysia Perlis, Arau 02000, Malaysia

⁶

Department of Biomedical Engineering, Jordan University of Science and Technology, Irbid 22110, Jordan

^*

Authors to whom correspondence should be addressed.

Electronics 2021, 10(23), 2949; https://doi.org/10.3390/electronics10232949

Submission received: 14 October 2021 / Revised: 4 November 2021 / Accepted: 8 November 2021 / Published: 26 November 2021

(This article belongs to the Topic Machine and Deep Learning)

Download

Browse Figures

Versions Notes

Abstract

:

Pneumonia is an inflammation of the lung parenchyma that is caused by a variety of infectious microorganisms and non-infective agents. All age groups can be affected; however, in most cases, fragile groups are more susceptible than others. Radiological images such as Chest X-ray (CXR) images provide early detection and prompt action, where typical CXR for such a disease is characterized by radiopaque appearance or seemingly solid segment at the affected parts of the lung due to inflammatory exudate formation replacing the air in the alveoli. The early and accurate detection of pneumonia is crucial to avoid fatal ramifications, particularly in children and seniors. In this paper, we propose a novel 50 layers Convolutional Neural Network (CNN)-based architecture that outperforms the state-of-the-art models. The suggested framework is trained using 5852 CXR images and statistically tested using five-fold cross-validation. The model can distinguish between three classes: viz viral, bacterial, and normal; with 99.7% ± 0.2 accuracy, 99.74% ± 0.1 sensitivity, and 0.9812 Area Under the Curve (AUC). The results are promising, and the new architecture can be used to recognize pneumonia early with cost-effectiveness and high accuracy, especially in remote areas that lack proper access to expert radiologists, and therefore, reduces pneumonia-caused mortality rates.

Keywords:

deep learning; CNN; detection; PneumoniaNet; pneumonia; Chest X-ray; CXR

1. Introduction

Pneumonia is a leading cause of death in children under five years of age, taking a life every 39 s [1] accounting for 15% of the population under five years and being responsible for 808,694 deaths in 2017 [2]. It is an acute Lower Respiratory Tract (LRT) disease which creates inflammation in the lung parenchyma that can be caused by a variety of infective organisms such as bacteria, viruses, and fungi, as well as non-infective substances, for instance, sterile gastric contents. Individuals from any age group may be affected; however, in most cases, the cause is specific to a particular group. In the case of children presenting with Pneumonia, symptoms typically include fever, cough with or without sputum and with or without difficulty breathing, and fatigue and retraction of the chest during inhalation. Currently, the diagnostic criteria for pneumonia are based on clinical presentation, findings on Chest-X-ray (CXR), culture and sensitivity from throat swabs or sputum sampling, and blood samples. This disease is preventable, especially through vaccination and, since it is treatable as well, early diagnosis of pneumonia plays a significant factor in preventing complications.

According to the World Health Organization (WHO), Acute Respiratory Infections (ARI) are the worst communicable disease amongst children and an additional 18 million healthcare workers are essential by 2030 to prevent, diagnose, and treat pneumonia [2]. In 2021, the Center for Disease Control (CDC) in the United States estimates that the number of emergency department visits with pneumonia as the primary diagnosis was 1.5 million, and the number of deaths was 43,881 [3].

Even though Chest X-rays (CXRs) have a weaker resolution as compared to Magnetic Resonance Imaging (MRI) or Computerized Tomography (CT) scans, they can be used to perform multiple assessments such as cardiomegaly, pneumonia, pneumothorax, and atelectasis. Diagnosing pneumonia using radiographs is highly subjective and depends on the knowledge and expertise of the radiologist. It is easier to diagnose pneumonia using high resolution MRI and CT scans; however, most radiologists use CXRs to perform assessments owing to quicker turn-over and cost effectiveness of the modality. On a typical radiograph, pneumonia is marked by radio-opacities or white spots in the airways, particularly in the alveoli, which indicates the presence of inflammatory exudate. These radiological findings may present a challenge to a novice radiologist, leading to false positives and false negatives owing to the fact that other diseases mimic these signs. Figure 1 shows samples of CXR images that were utilized in this study and classified as normal, bacterial pneumonia, and viral pneumonia from the pediatric group.

Recently, Artificial Intelligence (AI) has been employed to automatically detect findings consistent with pneumonia from radiographic images. The availability of labelled CXR datasets combined with massive and relatively cheap computing power have made Deep Learning (DL) methods the most known and widely spread tool to detect and classify medical images in general and pneumonia in particular. These systems are promising and can achieve human doctor accuracy in detecting multiple diseases [4]. Instead of using pretrained models and transfer learning, this paper proposes a novel simple Deep Learning structure that can detect and distinguish between three classes of pediatric pneumonia using Chest-X-ray (CXR) images. The new architecture can distinguish between viral, bacterial, and normal with a very high accuracy of 99.7%. The proposed architecture performance far exceeds the state-of-the-art models mentioned in the literature.

The model uses a multi-layered Convolutional Neural Network (CNN) that automatically extracts features from the radiographic images and correlates with any pneumonia category with high accuracy. Computer Aided Diagnostic (CAD) systems can be used to eliminate radiologist subjectivity when diagnosing pneumonia. They can effectively be used to confirm clinical findings as well as in countries or remote areas that lack resources, particularly radiological expertise. Unlike previous research [5,6,7] that focused on using transfer learning and traditional machine learning techniques, this study proposes a novel model that can differentiate between normal, bacterial pneumonia, and viral pneumonia. For this purpose, we used the well-known Guangzhou Women and Children’s Medical Center (GWCMC) dataset together with familiar data augmentation techniques.

The rest of this paper is organized as follows: Section 2 describes the relevant literature; Section 3 describes the proposed model in detail and the data used in this study, our proposed methods, and training procedure; Section 4 presents the experiment results; Section 5 discuss the results of this study; Section 6 describes the conclusion of this study, followed by references.

2. Background and State-of-the-Art Research

A study of the literature reveals that many attempts have been made to use Artificial Intelligence (AI) and Deep Learning techniques to detect the presence or absence of pneumonia (binary classification problem) [5,6,7,8,9]. However, few studies have tried to apply machine learning to detect pneumonia [10,11,12] and Deep Learning (DL), particularly Convolutional Neural Networks (CNNs), to classify pneumonia according to its etiological origin (bacterial and viral).

In 2018, Rajaraman et al. [13] evaluated the performance of different customized CNN architectures in identifying pneumonia and distinguishing between viral and bacterial types in 5232 pediatric chest radiographs. The authors evaluated the performance of Sequential CNN, Inception CNN, Residual CNN, and VGG16, and used a novel visualization technique to define the Region of Interest (ROI). Customized VGG16 outperformed the surveyed models and achieved an accuracy of 96.2% in detecting pneumonia as well as an accuracy of 91.8% in differentiating between viral and bacterial types.

Rahman et al. [14] attempted to automatically diagnose different classes of pneumonia using 5247 CXR images from the Kaggle pneumonia dataset. Using transfer learning, they analyzed the performance of four popular pretrained models, AlexNet, ResNet18, DenseNet201, and SqueezeNet. They found that DenseNet201 outperformed all other models by achieving an accuracy of 98% in detecting pneumonia and an accuracy of 93.3% in differentiating between the two etiological variants. With a similar objective, Polat et al. [15] used two different CNNs, a binary CNN and a triple CNN, to detect pneumonia in 5840 pediatric CXR images. The CNNs were trained using the Walsh function to properly extract the features from digital chest radiographs, they also used a minimum distance classifier for classification. They conducted three different parametric studies and found that their proposed method achieved an accuracy of 100% in detecting pneumonia, 92% in distinguishing between two types of pneumonia, and 90% for distinguishing features that pertained to either normal, bacterial, or viral pneumonia.

In 2021, Alqudah et al. [16] used a modified CNN framework to distinguish bacterial and viral pneumonia from normal CXRs in 5852 images. The framework consisted of two stages; firstly, a CNN was used for feature extraction, and, in the later stage, K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) classifiers were utilized. They built two hybrid models, i.e., CNN-KNN and CNN-SVM, while utilizing a 10-fold cross validation methodology as well. The prior hybrid model achieved an accuracy of 94.03%, and the latter hybrid model achieved an accuracy of 93.9%. Another study [17] deployed a pretrained Xception model with data augmentation to solve the multiclass classification problem. The models classified 5840 images from the Mendeley website and achieved an accuracy of 82.69%.

Lastly, various studies employed deep learning with Chest-X-ray (CXR) images to detect pneumonia. For example, Harsh Agrawal used pre-processing techniques as an initial step before using the ResNet50 v2 deep learning structure, which lead to improving the detection accuracy of pneumonia in CXR images to 96% [18]. Alquran et al. exploited texture features and traditional machine learning algorithms to classify Chest-X-rays (CXR) into three classes, Pneumonia regardless of its source including viral or bacterial, COVID 19, and normal chest images; they obtained a 93.1% accuracy among all classes [19]. Rajasenbagam et al. also utilized deep learning to detect pneumonia infection using Chest-X-ray (CXR) images. The proposed CNN was trained on 12000 CXR and achieved a 99.34% accuracy in test images. Moreover, the suggested CNN outperformed existing CNNs such as AlexNet, VGG16Net, and InceptionNet [20].

3. Materials and Methods

The proposed recognition approach consists of four main stages: the first stage is to load and resize the whole dataset; the second stage is to split the dataset into training, validation, and testing sub-datasets; the third stage involves training and validating the PneumoniaNet CNN model using training and validation datasets; finally, testing the PneumoniaNet using testing dataset. Figure 2 shows a flow diagram for the proposed methodology.

3.1. Pneumonia Dataset

This study utilized a pediatric CXR image dataset from the Guangzhou Women and Children Medical Center (GWCMC), which was published online by Kermany et al. [21]. The dataset contains a total of 5852 Anterior-Posterior (AP) CXRs images from pediatric patients between one and five years of age. Of the total, 4097 (70%) were used for training and 1755 (30%) were used for testing purposes. Table 1 shows the distribution of the dataset images into normal, bacterial, and viral pneumonia that were used.

3.2. Data Pre-Processing and Augmentation

All the images in the dataset were utilized, preprocessed, and resized from 1024 × 1024 pixel to 256 × 256 pixel as the proposed PneumoniaNet requires, and neither low-quality nor low-resolution images were excluded. To prevent overfitting, some noise was added to the dataset; it is well known that adding some noise to the inputs of neural network, in some situations, leads to significant improvement in model generalization capability [22,23]. Moreover, adding noise acts as some sort of augmentation of the dataset. Furthermore, other augmentation techniques were also used. Since not all augmentation approaches were suitable for X-ray images, we processed the images in four steps. First, we resized the images to 256 × 256, then five augmentation techniques were used, Random Horizontal and Vertical Flip (to deal with the pneumonia symptoms on any side of the chest X-ray), Random Horizontal and Vertical Shear (to obtain deeper relation among pixels), and, finally, augmenting images with a varying rotation of images [24,25].

3.3. Proposed Architecture (PneumoniaNet)

Deep learning is one of the most powerful and state-of-the-art technologies that is inspired by the deep neuronal structure of a human brain [26], characterized by numerous hidden layers that allow for the extraction and abstraction of features at different levels. Deep learning starts with a method proposed in 2006 [27], whereby this newly developed algorithm (greedy layer-wise training) is used to train the neuron layers of deep network architecture. It is considered a form of unsupervised learning algorithm that uses unlabeled data and trains the deep network one layer after the other. Since this method is very effective and powerful, it has been chosen as a training algorithm for many deep learning networks. The most powerful, efficient, and widely used deep network is CNN, which includes multiple hidden layers that perform convolution and subsampling to extract low and high level features from the input data whether it is in a single dimension or two dimensions [28]. Basically, CNN consists of six types of layers (input, convolution, RELU, fully connected, classification, and output), and arranging and ordering of these layers is crucial and must take into consideration that they must extract fine details from the input data [26,27]. In general, CNN shows high performance in various sectors, especially in the biomedical field and computer vision, as well as other disciplines [29,30].

In this study, the proposed 50-layer CNN architecture will be utilized to classify and distinguish the input images into three classes as shown in Figure 3. This architecture will decrease the number of the layers as compared to similar pretrained networks that are usually used with transfer learning techniques, i.e., 201 layers in Densnet201, 101 layers in ResNet-101, and 144 layers in GoogleNet, to only 50 layers. Reducing the number of layers will shorten the time required for training and for finding probabilities of new input images in addition to reducing the computing resources required to run the system. Table 2 shows detailed information about layers in the proposed CNN architecture. Using Figure 3 and Table 2, we can notice that the proposed model (PneumoniaNet) has three blocks for features extraction. These blocks are the core of the model and are targeted towards extracting both deep and general features and combining them to obtain the most discriminant features.

The proposed PneumoniaNet model is unique because it combines both the deep features extracted from the three consequent convolution layers separated by ReLU and the batch normalization layer, while the general features extracted using only one convolution layer with batch normalization layer. This is known as the x-block technique and will allow the CNN model to use both general and minor changes in the Chest-X-ray (CXR) images. Moreover, PneumoniaNet will improve the flow of information and gradients through the network, making the optimization of very deep networks tractable. Also, PneumoniaNet will strengthen feature propagation, encourage feature reuse and combination, and substantially reduce the number of parameters. The network weights and biases are initialized using “glorot” weight initialization. This method initializes each weight with a small gaussian value with a zero mean. Finally, the network will be trained end-to-end.

The main difference between the proposed PneumoniaNet model and Resnet50 is that the ResNet50 model uses the output of the previous layer as an input to the next layer, which is known as the residual blocks, to learn from the reference layer inputs instead of learning from unreferenced layers; whereas the main difference between the proposed PneumoniaNet model and VGG model is that the VGG uses a deep structure of very small receptive sequential 3 × 3 filters.

3.4. K-Fold Cross-Validation

In general, evaluating any machine learning or deep learning model will be quite tricky due to variations in the size of the dataset used. Usually, machine learning engineers tend to split the data set into training and testing sets with different ratios and use the training set to train the model and testing set to test the model; then, we evaluate the performance of the model using the accuracy metric [31]. However, this method is not very reliable as the accuracy obtained for one test set can be very different from the accuracy obtained for a different test set. Therefore, the K-fold Cross Validation provides a perfect solution to this problem: the solution is obtained by dividing the data into folds and ensuring that each fold is used as a testing set at some point. Figure 4 shows a block diagram of K-fold cross-validation [32].

Moreover, K-Fold cross-validation is where a given dataset will be split into a K number of folds (groups), where each fold is used as a testing set at some point. A sample scenario will be to choose K = 10, which we will call a 10-Fold cross validation. Here, the data set is split into 10 folds. In the first iteration, the first fold is used to test the model and the rest are used to train the model. In the second iteration, the second fold is used as the testing set while the rest serve as the training set. This process is repeated until each fold of the 10 folds have been used as the testing set [33].

3.5. Running Environment

All the experiments were conducted on a desktop computer with Microsoft Windows, running an Intel core i7-6700/3.4 GHz processor, 16 GB of RAM, and a 500 GB hard disk drive (HDD) using MATLAB 2020b. To test the proposed model, we performed a five-fold methodology, and, for each fold training, we used the Adam optimizer and the cross-entropy loss function [33,34]. The initial learning rate was 0.001 and, using this value, the proposed model was trained for 100 epochs for each fold.

4. Results

By creating a 50-layer CNN, this study has tackled a harder problem than simply detecting the presence or absence of pneumonia (binary classification problem) [35,36]. The suggested architecture was trained to discriminate between normal CXR images versus those from viral or bacterial pneumonia (multiclass problem), and the model seems to have learned how to solve this problem effectively and efficiently. It seems to have succeeded in extracting the features that correlate with every specific class. Figure 5 plots the average accuracy and average loss against epochs. The best results were obtained by the proposed PneumoniaNet network both in terms of loss values and accuracy. A graphical representation of the classifier performance (Figure 6) shows the PneumoniaNet multiclass confusion matrix, where the rows represent the predictions, and the columns represent the actual class. The figure shows the number of accurately and wrongly classified images, and it is clear from the figure that the proposed model managed to discriminate all three classes with 99.7% accuracy; moreover, the error represented by the False Positives and the False Negatives is nearly 3%. Aside from accuracy, the model sensitivity is 0.9974, specificity is 0.9985, and precision is 0.9970. In Figure 7, the Receiver Operating Characteristic (ROC) curve for the proposed model is presented; a very important metric in evaluating the performance of any classifier, the curve shows the tradeoff between specificity and sensitivity. The curve in the figure is very close to the upper left corner, and the AUC is nearly one (0.9812), indicating high performance in discriminating between all three classes. The curve also shows that the proposed model capability to differentiate normal, bacterial, and viral pneumonia is almost identical.

Class Activation Mapping (CAM) helps visualize the regions that the model used to extract the underlying features that are uniquely associated with each class. It helps identify possible areas within an image that contributed to the identification of each class. Figure 8 presents the Class Activation Mapping (CAM) for all three classes under consideration, and these images show the heat map superimposed on the original CXR thus highlighting the discriminative regions of maximum activation. It also shows how the trained model localized the class specific Region of Interest (ROI) that corresponds to the appropriate pneumonia label to make predictions.

Using Figure 6 and Figure 7, we can note that our proposed PneumoniaNet is one the few models targeted towards differentiating pneumonia types, either viral or bacterial, with very high efficiency. Also, the proposed PneumoniaNet will open a new trend towards three class classifications instead of the simple two class classifications, which is easy due to the visual difference between normal and pneumonia, while differentiating between the viral and bacterial pneumonia is a difficult task even to.

5. Discussion

Previous research [13,14,17] focused on using transfer learning and pretrained models, although only two groups [15,16] attempted to build a model from scratch or to modify an existing model for the purpose of detecting pneumonia as this study did. Most researchers [13,14,15] used the GWCMC dataset that this research used; nevertheless, Rahman et al. [14] used a Kaggle dataset, and Madhubala et al. [17] used images from Mendeley. Unlike this study, a small number of researchers [13,15] did not employ data augmentation.

Table 3 compares the proposed model performance with the most recent state-of-the-art. All the models in the table use CXR images, and three of the models use the same dataset that this research used. It is clear from the table that PneumoniaNet outperforms all other models with respect to all important performance metrics. The new model accuracy is the highest, with 99.72% accuracy, a 5% increase from the second-best model accuracy of 94.03% reported by Alqudah et al. in 2021 [16]. Because the accuracy metric by itself can be misleading, other performance metrics of interest such as the new model sensitivity, specificity, precision, and AUC are reported, and they are also better than what is described in the literature.

Sensitivity is the most important metric in medical applications because it shows the percentage of correct positives, and Table 3 shows that the proposed model recall is 99.74%. It is also clear in Table 3 that all previous work [13,14,15,16,17] achieved lower accuracy, sensitivity, precision, and F1 score when compared with the suggested method. On top of that, their dataset size is smaller than the one used in this paper. The proposed model structure is simple, which means that it converges fast, and it does not require a lot of computing power. On the other hand, the suggested model’s generalization capability is not on par with that of pretrained models. Unlike pretrained models that were trained using millions of images, the proposed architecture used only 5852 CXR images for training. The image dataset used for this study is not sufficient to cover all pneumonia inherent image features and to create a reliable CNN model with high accuracy. Lastly, the authors of the previous research did not provide detailed information to perform a comprehensive assessment, they did not make available the approach they used to validate their data and it is not clear whether they used cross validation as this study did or not.

To study the proposed system complexity, the average time required for the system to make a training per fold and to generate a decision about Chest-X-ray (CXR) images has been calculated and is shown in Table 4. The table shows that the suggested architecture converges faster and is very efficient in the real-time classification of Chest-X-ray images.

The results in this research are almost perfect, which makes this method more trustworthy and dependable. Finally, the proposed model will have an impact on medicine, medical staff in rural areas can detect pediatric pneumonia quickly, cost effectively, and with high accuracy using the suggested model. Quick and accurate detection of pneumonia can mitigate fatal complications of pneumonia, particularly in seniors and in children. The proposed model can help alleviate the interpretation variability and subjectivity problem when reading a Chest-X-ray (CXR) radiograph. It can also be used to assist novice radiologists in remote areas that lack expert radiologists to make the right decision. The next step is to build a mobile application that airports can use to discriminate pneumonia using Chest-X-ray (CXR) images.

6. Conclusions

Pneumonia is a preventable and treatable communicable disease and is a leading cause of death especially amongst children. Early detection will help to achieve quicker access to proper treatment and reduce the ramifications of the disease. PneumoniaNet is a novel deep learning-based model that uses CXR images to distinguish normal radiographic images from those with features consistent with viral or bacterial pneumonia in the pediatric group aged one–five years with a 99.72% accuracy, 99.74% sensitivity, 99.85% specificity, 99.7% precision, and 0.9812 AUC. The proposed model outperforms state-of-the-art architectures based on the existing performance metrics. Promising results from this new system are going to help radiologists in rural areas with limited resources detect and identify pneumonia quickly and cost effectively, where this will impact the health of the global population, especially in children, by reducing pneumonia-related morbidity and mortality rates. In the future, we plan to build a more complex system capable of calculating the area of pneumonia area and detecting the position of pneumonia accurately.

Author Contributions

Conceptualization, A.M.A., H.A., I.A.Q. and Y.A.-I.; data curation, R.A. and W.A.M.; methodology, A.M.A., H.A., I.A.Q. and Y.A.-I.; validation, R.A. and W.A.M.; writing, R.A., Y.A.-I., A.M.A., I.A.Q., W.A.M. and H.A.; project administration, A.M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The dataset analyzed during the current study was derived from the following public domain resources. Available online: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia (accessed on 4 November 2020).

Acknowledgments

The authors would like to thank the anonymous reviewers for their valuable comments. Also, the authors would like to thank the authors of the used dataset to make their dataset available online.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ruuskanen, O.; Lahti, E.; Jennings, L.C.; Murdoch, D.R. Viral pneumonia. Lancet 2011, 377, 1264–1275. [Google Scholar] [CrossRef]
National Center for Health Statistics (NCHS). Centers for Disease Control and Prevention (CDC) FastStats: Pneumonia. Last Updated January 2021. Available online: http://www.cdc.gov/nchs/fastats/pneumonia.htm (accessed on 6 August 2021).
World Health Organization. The Top 10 Causes of Death; World Health Organization: Geneva, Switzerland, 2017; Available online: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death (accessed on 6 August 2021).
Liu, N.; Wan, L.; Zhang, Y.; Zhou, T.; Huo, H.; Fang, T. Exploiting convolutional neural networks with deeply local description for remote sensing image classification. IEEE Access 2018, 6, 11215–11228. [Google Scholar] [CrossRef]
Masad, I.S.; Alqudah, A.; Alqudah, A.M.; Almashaqbeh, S. A hybrid deep learning approach towards building an intelligent system for pneumonia detection in chest X-ray images. Int. J. Electr. Comput. Eng. (2088-8708) 2021, 11, 5530–5540. [Google Scholar] [CrossRef]
Ayan, E.; Ünver, H.M. Diagnosis of pneumonia from chest X-ray images using deep learning. In Proceedings of the 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), Istanbul, Turkey, 24–26 April 2019; pp. 1–5. [Google Scholar] [CrossRef]
Chouhan, V.; Singh, S.K.; Khamparia, A.; Gupta, D.; Tiwari, P.; Moreira, C.; Damaševičius, R.; De Albuquerque, V.H.C. A novel transfer learning based approach for pneumonia detection in chest X-ray images. Appl. Sci. 2020, 10, 559. [Google Scholar] [CrossRef] [Green Version]
Luján-García, J.E.; Yáñez-Márquez, C.; Villuendas-Rey, Y.; Camacho-Nieto, O. A transfer learning method for pneumonia classification and visualization. Appl. Sci. 2020, 10, 2908. [Google Scholar] [CrossRef] [Green Version]
Elshennawy, N.M.; Ibrahim, D.M. Deep-pneumonia framework using deep learning models based on chest x-ray images. Diagnostics 2020, 10, 649. [Google Scholar] [CrossRef]
Al Mamlook, R.E.; Chen, S.; Bzizi, H.F. Investigation of the performance of Machine Learning Classifiers for Pneumonia Detection in Chest X-ray Images. In Proceedings of the 2020 IEEE International Conference on Electro Information Technology (EIT), Chicago, IL, USA, 31 July–1 August 2020. [Google Scholar] [CrossRef]
Yee, S.L.K.; Raymond, W.J.K. Pneumonia Diagnosis Using Chest X-ray Images and Machine Learning. In Proceedings of the 2020 10th International Conference on Biomedical Engineering and Technology, Tokyo, Japan, 15–18 September 2020. [Google Scholar] [CrossRef]
Toğaçar, M.; Ergen, B.; Cömert, Z. A deep feature learning model for pneumonia detection applying a combination of mRMR feature selection and machine learning models. Irbm 2020, 41, 212–222. [Google Scholar] [CrossRef]
Rajaraman, S.; Candemir, S.; Kim, I.; Thoma, G.; Antani, S. Visualization and interpretation of convolutional neural network predictions in detecting pneumonia in pediatric chest radiographs. Appl. Sci. 2018, 8, 1715. [Google Scholar] [CrossRef] [Green Version]
Rahman, T.; Chowdhury, M.E.; Khandakar, A.; Islam, K.R.; Islam, K.F.; Mahbub, Z.B.; Kadir, M.A.; Kashem, S. Transfer learning with deep convolutional neural network (CNN) for pneumonia detection using chest X-ray. Appl. Sci. 2020, 10, 3233. [Google Scholar] [CrossRef]
Polat, Ö.; Dokur, Z.; Ölmez, T. Determination of Pneumonia in X-ray Chest Images by Using Convolutional Neural Network. Turk. J. Electr. Eng. Comput. Sci. 2021, 29, 1615–1627. [Google Scholar] [CrossRef]
Alqudah, A.M.; Qazan, S.; Masad, I.S. Artificial Intelligence Framework for Efficient Detection and Classification of Pneumonia Using Chest Radiography Images. J. Med. Biol. Eng. 2021, 41, 599–609. [Google Scholar]
Madhubala, B.; Sarathambekai, S.; Vairam, T.; Sathya Seelan, K.; Sri Sathya, R.; Swathy, A.R. Pre-Trained Convolutional Neural Network Model Based Pneumonia Classification from Chest X-ray Images. 2021. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3852043 (accessed on 6 August 2021).
Agrawal, H. Pneumonia Detection Using Image Processing and Deep Learning. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), Coimbatore, India, 25–27 March 2021; pp. 67–73. [Google Scholar]
Alquran, H.; Alsleti, M.; Alsharif, R.; Qasmieh, I.A.; Alqudah, A.M.; Harun, N.H.B. Employing Texture Features of Chest X-Ray Images and Machine Learning in COVID-19 Detection and Classification. Mendel 2021, 27, 9–17. [Google Scholar] [CrossRef]
Rajasenbagam, T.; Jeyanthi, S.; Pandian, J.A. Detection of pneumonia infection in lungs from chest X-ray images using deep convolutional neural network and content-based image retrieval techniques. J. Ambient Intell. Humaniz. Comput. 2021, 1–8. [Google Scholar] [CrossRef]
Kermany, D.; Goldbaum, M.; Cai, W.; Valentim, C.C.; Liang, H.; Baxter, S.L.; McKeown, A.; Yang, G.; Wu, X.; Yan, F.; et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 2018, 172, 1122–1131. [Google Scholar] [CrossRef]
Shorten, C.; Khoshgoftaar, T.M. A survey on image data augmentation for deep learning. J. Big Data 2019, 6, 1–48. [Google Scholar] [CrossRef]
Lemley, J.; Bazrafkan, S.; Corcoran, P. Smart augmentation learning an optimal data augmentation strategy. IEEE Access 2017, 5, 5858–5869. [Google Scholar] [CrossRef]
Wang, J.; Perez, L. The effectiveness of data augmentation in image classification using deep learning. Convolutional Neural Netw. Vis. Recognit. 2017, 11, 1–8. [Google Scholar]
Cubuk, E.D.; Zoph, B.; Mane, D.; Vasudevan, V.; Le, Q.V. Autoaugment: Learning augmentation policies from data. arXiv 2018, arXiv:1805.09501. [Google Scholar]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Gao, F.; Yue, Z.; Wang, J.; Sun, J.; Yang, E.; Zhou, H. A Novel Active Semisupervised Convolutional Neural Network Algorithm for SAR Image Recognition. Comput. Intell. Neurosci. 2017, 2017, 3105053. [Google Scholar] [CrossRef] [PubMed]
Bakator, M.; Radosav, D. Deep Learning and Medical Diagnosis: A Review of Literature. Multimodal Technol. Interact. 2018, 2, 47. [Google Scholar] [CrossRef] [Green Version]
Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef] [Green Version]
Fang, L.; Jin, Y.; Huang, L.; Guo, S.; Zhao, G.; Chen, X. Iterative fusion convolutional neural networks for classification of optical coherence tomography images. J. Vis. Commun. Image Represent. 2019, 59, 327–333. [Google Scholar] [CrossRef]
Alqudah, A.M. Towards classifying non-segmented heart sound records using instantaneous frequency based features. J. Med. Eng. Technol. 2019, 43, 418–430. [Google Scholar] [CrossRef] [PubMed]
Fushiki, T. Estimation of prediction error by using K-fold cross-validation. Stat. Comput. 2011, 21, 137–146. [Google Scholar] [CrossRef]
Alqudah, A.M.; Qazan, S.; Al-Ebbini, L.; Alquran, H.; Qasmieh, I.A. ECG heartbeat arrhythmias classification: A comparison study between different types of spectrum representation and convolutional neural networks architectures. Journal of Ambient Intell. Humaniz. Comput. 2021, 1–31. [Google Scholar] [CrossRef]
Alqudah, A.M. AOCT-NET: A convolutional network automated classification of multiclass retinal diseases using spectral-domain optical coherence tomography images. Med. Biol. Eng. Comput. 2020, 58, 41–53. [Google Scholar] [CrossRef]
GM, H.; Gourisaria, M.K.; Rautaray, S.S.; Pandey, M. Pneumonia detection using CNN through chest X-ray. J. Eng. Sci. Technol. 2021, 16, 861–876. [Google Scholar]
Cha, S.-M.; Lee, S.-S.; Ko, B. Attention-Based Transfer Learning for Efficient Pneumonia Detection in Chest X-ray Images. Appl. Sci. 2021, 11, 1242. [Google Scholar] [CrossRef]

Figure 1. Pediatric Chest X-ray images; normal (A), bacterial pneumonia (B), and viral pneumonia (C).

Figure 2. Flow diagram of the proposed methodology.

Figure 3. Flow diagram of the proposed methodology.

Figure 4. Block diagram of K-fold cross-validation.

Figure 5. Average Accuracy against epoch (A) and average cross-entropy loss against epoch (B) for all cross-validation folds.

Figure 6. PneumoniaNet three classes confusion matrix.

Figure 7. PneumoniaNet Receiver Operating Characteristic (ROC) Curve.

Figure 8. Class Activation Mapping (CAM); (A) Normal, (B) Bacterial Pneumonia, and (C) Viral Pneumonia.

Table 1. The distribution of radiographic images used in the system.

Case	Number of Images
Normal	1581
Bacterial Pneumonia	2778
Viral Pneumonia	1493
Total	5852

Table 2. Values for information of the layers in the proposed CNN Architecture.

#	Layer	Info	Value	#	Layer	Info	Value
1	Input Layer	Size	256 × 256	19	Batch_Norm_8	Channels	16
2	Conv_1	Filters	48	20	Conv_9	Filters	16
		Kernel Size	3 × 3			Kernel Size	3×3
		Activation	RELU			Activation	RELU
3	Batch_Norm_1	Channels	48	21	Batch_Norm_9	Channels	16
4	Maxpol_1	Kernel Size	2 × 2	22	Maxpol_3	Kernel Size	2×2
4	Maxpol_1	Stride	2 × 2	22	Maxpol_3	Stride	2×2
5	Conv_2	Filters	128	23	Conv_10	Filters	128
		Kernel Size	1 × 1			Kernel Size	1×1
		Activation	RELU			Activation	RELU
6	Batch_Norm_2	Channels	128	24	Batch_Norm_10	Channels	128
7	Conv_3	Filters	64	25	Conv_11	Filters	64
		Kernel Size	1 × 1			Kernel Size	1×1
		Activation	RELU			Activation	RELU
8	Batch_Norm_3	Channels	64	26	Batch_Norm_11	Channels	64
9	Conv_4	Filters	32	27	Conv_12	Filters	32
		Kernel Size	1 × 1			Kernel Size	1×1
		Activation	RELU			Activation	RELU
10	Batch_Norm_4	Channels	32	28	Batch_Norm_12	Channels	32
11	Conv_5	Filters	32	29	Conv_13	Filters	32
		Kernel Size	3 × 3			Kernel Size	3×3
		Activation	RELU			Activation	RELU
12	Batch_Norm_5	Channels	32	30	Batch_Norm_13	Channels	32
13	Maxpol_2	Kernel Size	2 × 2	31	Conv_14	Filters	32
13	Maxpol_2	Stride	2 × 2			Kernel Size	3×3
14	Conv_6	Filters	64			Activation	RELU
		Kernel Size	1 × 1	32	Batch_Norm_14	Channels	32
		Activation	RELU	33	Maxpol_4	Kernel Size	2×2
15	Batch_Norm_6	Channels	64			Stride	2×2
16	Conv_7	Filters	32
		Kernel Size	1 × 1
		Activation	RELU
17	Batch_Norm_7	Channels	32
18	Conv_8	Filters	16
18	Conv_8	Kernel Size	1 × 1

Table 3. Comparison of performance metrics from similar work.

Reference	Accuracy (%)	Sensitivity (Recall)	Specificity	Precision	F1 Score	AUC	Dataset Size
Rajaraman et al. [13]	0.918	0.900	0.960	0.920	0.910	0.939	5232 (same)
Rahman et al. [14]	0.933	0.932	0.967	0.937	0.935	0.95	5247
Polat et al. [15]	0.90	--	--	--	--	--	5840 (same)
Alqudah et al. [16]	0.9403	0.9333	0.9668	0.9422	--	--	5852 (same)
Madhubala et al. [17]	0.8269	0.85	--	0.86	--	--	5840
Proposed Model	0.9972 ± 0.002	0.9974 ± 0.001	0.9985 ± 0.0007	0.9970 ± 0.002	0.9972 ± 0.002	0.9812	5852

Table 4. Time consumption in our proposed model.

Step	Required Time
Average Training Time per fold	2 h
Average Time Per Image	3.144 s

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alsharif, R.; Al-Issa, Y.; Alqudah, A.M.; Qasmieh, I.A.; Mustafa, W.A.; Alquran, H. PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach. Electronics 2021, 10, 2949. https://doi.org/10.3390/electronics10232949

AMA Style

Alsharif R, Al-Issa Y, Alqudah AM, Qasmieh IA, Mustafa WA, Alquran H. PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach. Electronics. 2021; 10(23):2949. https://doi.org/10.3390/electronics10232949

Chicago/Turabian Style

Alsharif, Roaa, Yazan Al-Issa, Ali Mohammad Alqudah, Isam Abu Qasmieh, Wan Azani Mustafa, and Hiam Alquran. 2021. "PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach" Electronics 10, no. 23: 2949. https://doi.org/10.3390/electronics10232949

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach

Abstract

1. Introduction

2. Background and State-of-the-Art Research

3. Materials and Methods

3.1. Pneumonia Dataset

3.2. Data Pre-Processing and Augmentation

3.3. Proposed Architecture (PneumoniaNet)

3.4. K-Fold Cross-Validation

3.5. Running Environment

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI