Next Article in Journal
Factors Influencing the Prognosis of Patients with Myalgic Encephalomyelitis/Chronic Fatigue Syndrome
Next Article in Special Issue
A Novel Framework for Abnormal Risk Classification over Fetal Nuchal Translucency Using Adaptive Stochastic Gradient Descent Algorithm
Previous Article in Journal
Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST
Previous Article in Special Issue
An Efficient Deep Learning-Based Skin Cancer Classifier for an Imbalanced Dataset
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Probabilistic Approach to COVID-19 Data Analysis and Forecasting Future Outbreaks Using a Multi-Layer Perceptron Neural Network

1
Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China
2
Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran 55461, Saudi Arabia
3
School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
4
Department of Mathematical Sciences, Balochistan University of Information Technology, Engineering and Management Sciences, Quetta 87300, Pakistan
5
School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
6
Department of Computer Science, University of Buner, Buner 19290, Pakistan
7
Institute of Computer Science, Kohat University of Science and Technology, Kohat 26000, Pakistan
*
Author to whom correspondence should be addressed.
Diagnostics 2022, 12(10), 2539; https://doi.org/10.3390/diagnostics12102539
Submission received: 28 August 2022 / Revised: 2 October 2022 / Accepted: 12 October 2022 / Published: 19 October 2022
(This article belongs to the Special Issue Deep Disease Detection and Diagnosis Models)

Abstract

:
The present outbreak of COVID-19 is a worldwide calamity for healthcare infrastructures. On a daily basis, a fresh batch of perplexing datasets on the numbers of positive and negative cases, individuals admitted to hospitals, mortality, hospital beds occupied, ventilation shortages, and so on is published. Infections have risen sharply in recent weeks, corresponding with the discovery of a new variant from South Africa (B.1.1.529 also known as Omicron). The early detection of dangerous situations and forecasting techniques is important to prevent the spread of disease and restart economic activities quickly and safely. In this paper, we used weekly mobility data to analyze the current situation in countries worldwide. A methodology for the statistical analysis of the current situation as well as for forecasting future outbreaks is presented in this paper in terms of deaths caused by COVID-19. Our method is evaluated with a multi-layer perceptron neural network (MLPNN), which is a deep learning model, to develop a predictive framework. Furthermore, the Case Fatality Ratio (CFR), Cronbach’s alpha, and other metrics were computed to analyze the performance of the forecasting. The MLPNN is shown to have the best outcomes in forecasting the statistics for infected patients and deaths in selected regions. This research also provides an in-depth analysis of the emerging COVID-19 variants, challenges, and issues that must be addressed in order to prevent future outbreaks.

1. Introduction

The present COVID-19 outbreak is a serious global crisis for healthcare infrastructures. The pandemic has triggered a crisis due to which, schools, administrative institutions, and financial institutions such as banks have been shut down in many major countries. Notably, such disruptions not only cause problems for people in the short term but also have long-term effects, for example, an increase in unemployment [1]. According to a study [2], the situation causes a 2.5–3% decline in the economic stability of GDPs globally every month. Furthermore, based on previous crises, it appears that younger and less-educated workers are the most financially impacted [2]. COVID-19 is thought to have originated from animals. If communities do not follow preventive policies for this highly contagious disease, COVID-19 can spread easily to healthy humans through close contact. Traveling has been the main cause of the huge spread [3,4]. In the early days of the COVID-19 pandemic, almost all reported cases were symptomatic. In a study by Noh J and Danuser G [5] of 50 countries, the number of actual COVID-19 patients in 25 of those countries was predicted to be 5 to 20 times larger than confirmed infected cases. In several European countries in March 2020, the number of total cases/infected patients was around 2.5 times higher than actual reported patients, and currently, it is estimated that the number of unseen infected cases is still 1.5 times higher than reported cases because undiscovered or unseen patients could be symptom-less or exhibit very subtle illness symptoms [6]. Researchers in the fields of pharmacy, chemistry, mathematics, physics, statistics, economics, computer science, geophysics, and medicine have joined hands to fight against COVID-19. However, no one has reached a firm conclusion yet on how to overcome this problem. Furthermore, the structure and symptoms are always mutating. The flu, body temperature, coughing, and shortness of breath are the initial indications of the COVID-19 virus. The severe side effects of this infection may cause acute respiratory disorder (a severe form of asthma), pneumonia, heart failure, renal failure, and possibly death in the subsequent stages [7]. The COVID-19 spread could be significantly slowed down by employing precautionary measures such as minimizing direct contact, social isolation, and smart lockdowns [8].
The accurate and robust forecasting of COVID-19 cases and deaths can assist government interventions and encourage the general public to consider effective actions to slow down the spread of this disease [9]. Researchers have conducted multiple studies to explore the COVID-19 associated risk factors and emotional effects, covering various categories such as nature, health, lockdown, etc., using different models [8,10]. Machine learning models, such as random forest, support vector machine (SVM), K- nearest neighbors (KNN), artificial neural networks, and many others, have also been used to predict the COVID-19 situation [11]. The reproduction rate of a disease is of great concern to epidemiologists as this is what determines a pandemic; a reproduction rate greater than one indicates a pandemic in the population [12]. The nature of COVID-19 has been studied by taking a variety of mathematical models into account. The most used model for analyzing disease dynamics is the Susceptible-Infectious-Recovered (SIR) model. This model uses a system of differential equations that are time-dependent to predict epidemic growth. Researchers have extensively employed the SIR model and different modified forms to study Ebola and AIDS [13,14]. Godio et al. [15] studied the recent SARS-CoV-2 pandemic outbreak by taking data from Italy using the SEIR epidemiological model. They used a Particle Swarm Optimization (PSO) solver to create a stochastic method to fit the model parameters, which improved the predictability of the prediction in a medium run of thirty days. Their findings matched Spanish and South Korean statistics and forecasts. Baleanu et al. [16] and some other researchers [17,18] used the Caputo–Fabrizio derivative to create a COVID-19 fractional differential equation model. The data on COVID-19 reflect a sequence of observations and time-series prediction approaches e.g., artificial neural network-based methods and meta-predictors are all native to the statistics [19,20]. For time-series forecasting, ANNs are frequently used [21]. ANN-based techniques have many advantages over machine learning techniques and one of the key advantages is that ANN can be fed raw data and discover the desired features automatically [22]. ANNs give accurate results based on numerous factors such as performance, accuracy, latency, speed, convergence, and size [23,24]. It is important to note that this research relies on artificial neural networks (ANNs) for forecasting the COVID-19 situation in certain countries.
In this paper, we propose a model to forecast future COVID-19 scenarios in major countries and provide insights for government bodies and policymakers. This work also provides a detailed look at the current COVID-19 variants, challenges, and guidelines for preventing the outbreak effectively. This forecasting is intended to assist organizations, legislators, and the general public in implementing new tactics and reinforcing ongoing COVID-19 precautionary actions. Additionally, this study could aid in relieving the socioeconomic and psychological distress caused by COVID-19. The key contributions of this study are given below.

Key Contributions

  • Awareness about emerging variants of COVID-19: We have collected information about COVID-19 including its types and emerging variants. It is important to note that some of the variants can appear without any prior symptoms.
  • Literature review: This article gives a brief overview of the related work recently undertaken in the field of COVID-19 forecasting using data mining approaches including machine learning, and deep learning techniques.
  • Proposed Methodology: We proposed an artificial neural network-based methodology for the statistical analysis of the current pandemic situation in some eastern and western countries. The results show that our approach works well in terms of precision and model fitting to statistical data.
  • Challenges and future directions: We discussed the current issues associated with utilizing Artificial Intelligence methods to resolve the COVID-19 pandemic. Furthermore, we demonstrate how machine learning and deep learning can assist in preventing the spread of COVID-19 in the future. We also address the potential future contributions of AI and blockchain-based solutions to analyze the outbreak response.

2. Coronavirus

Coronaviruses are indeed a huge family of viruses that are found both in humans and animals [25]. Seven different types have been identified, including the ones that caused COVID-19 and the SARS and MERS illnesses. According to initial estimations, the retrovirus seemed to be more contagious than the one that caused SARS, although it appeared to be less probable to provoke catastrophic illnesses. We still have a lot to learn about the novel coronavirus (COVID-19) [26].

2.1. Symptoms of COVID-19

COVID-19 has been related to a variety of indications, ranging from simple headaches to life-threatening diseases. Upon being exposed to the illness, symptoms and signs may appear after 2 to 14 days [27]. The severity of the symptoms varies from mild to severe. COVID-19 is a virus that can cause the following symptoms in patients:
  • Temperature or chills
  • Runny nose
  • Coughing
  • Breathing problems
  • Fatigue
  • Aches in the muscles or throughout the body
  • Loss of smell or taste
  • Diarrhea
  • Sore throat
  • Nausea or vomiting
This is not an extensive list of all symptoms and manifestation. The CDC [27] continues to update the list of possible symptoms whenever new information becomes available from research labs or other academic sources. COVID-19 infection appears to put elderly persons with serious medical conditions, such as diabetes, heart disease, or respiratory problems, at an increased risk of developing more serious conditions.

2.2. Types of Coronavirus

In a new study on COVID-19, UK-based scientists discovered that there are six different varieties of COVID-19 infection, each with its own set of symptoms.
  • Flu-like without a temperature
    Fatigue, muscle aches, absence of smell, sore throat, coughing, shortness of breath, and no temperature are some of the additional symptoms.
  • Flu-like with temperature
    Fatigue, absence of smell, sore throat, coughing, uncontrollable shaking, a decrease in hunger, and a temperature.
  • Gastrointestinal
    Fatigue, absence of smell, sore throat, a decrease in hunger, chest pain, no coughing, and diarrhea.
  • Extreme level one, severe exhaustion
    Fatigue, loss of smell, cough, chest pain, a temperature, and hoarseness.
  • Extreme level two, misconception (uncertainty)
    Fatigue, absence of smell, a decrease in hunger, coughing, sore throat, chest pain, a temperature, hoarseness, muscle pain, and confusion.
  • Extreme level three, abdominal and pulmonary
    Fatigue, absence of smell, a decrease in hunger, coughing, sore throat, chest pain, a temperature, hoarseness, and muscle pain.

2.3. Emerging Variants of COVID-19

New variants are emerging with time. For example, recently, a new mutant (B.1.1.529 also known as Omicron) has emerged, which is fast spreading and can pose a big threat to the effectiveness of COVID-19 vaccinations [28]. Researchers are closely monitoring this novel mutant of COVID-19. This variant contains various changes, which were earlier reported in other mutants, particularly Delta. This new variant has been observed to be expanding rapidly within South Africa. Nowadays, the main goal is to focus on its expansion. The said mutation was identified in Botswana on 11 November 2021 [29] and was identified in a South African traveler who traveled to Hong Kong. Omicron was added to the list of “variants of concern” by the WHO, which also contains Alpha, Beta, Gamma, and Delta. Viruses transform themselves all the time and the majority of mutations are minor. Some of these mutations may be harmful to the virus itself, whereas others can make the infection more aggressive or dangerous. Table 1 illustrates the alterations with the highest risk, which are described as the “variants of concern” and are regularly observed by healthcare practitioners. Regarding vaccinations against COVID-19, the vaccinations from Chinese Sinopharm, Pfizer, and AstraZeneca are very efficacious against the variations after two doses, whereas resistance after one dosage appears to be diminished [30].
There are several variants of SARS-CoV-2, including a brand-new, extremely contagious variant that was detected in the United Kingdom [26]. Another of these new variants is known as VOC202101/02 or P.1 and was reported in visitors from Brazil who traveled to Japan in January 2021. This gene contains the 1–4 nt insertion, three reductions, four identical modifications, and 17 distinct amino acid modifications [31]. Travel restrictions were implemented in an effort to stop the spread of P.1 throughout the nation after it was discovered in the United Kingdom [32]. However, another variety from Brazil (known in the UK as VUI202101/01) was discovered in the UK and comprises a minor recessive mutation. Eight instances of this type, which appeared to be of minimal significance, had been reported as of 14 January 2021. The “expansion and importance of this mutation continues under investigative process”, according to Public Health England (PHE). At same time as the English variant, the South African variant appeared and has since been found in at least 20 countries. According to South African genomic data, the 501Y.V2 mutation swiftly supplanted other circulating progenitors in the country because it appeared to have a greater infection rate and hence is more transmittable. The N501Y and E484K spike protein variants are present in this version, as they are in the English and Brazilian variants.

2.4. Variants of Interest (VOI)

There is significant proof that the differences in the variants have a massive effect on infectivity, disease intensity, and/or resistance, affecting the epidemiologic scenario in the EU/EEA [30]. There is at least reasonable certainty in the findings for these features, which included genetic, epidemiologic, and in vitro investigations. Additionally, all of the prerequisites for the variants of concern and under investigation listed in Table 2 apply. The indications are labeled to show whether they come from the variants themselves (v) or from mutations linked to the variants (m). Evidence with a “low confidence” rating is labeled to highlight that it is inconclusive. Blank fields or null fields indicate that there are no existing evaluations or scientific evidence for the category, whereas “no” means that there has been no change associated with the feature. B.1 is the comparable virus that is presumed to be “wild-type” (with D614G and no other spike protein modifications) [27].

2.5. Variants under Observation

SARS-CoV-2 variants under observation were discovered as indications through outbreak intelligence, rules-based genomic variant screening, and initial technical data [38]. There is some indication that they are similar to the VOIs in terms of quality; however, the evidence is either inadequate or is still to be examined by the ECDC [27]. One or more outbreaks in communities or proof of the communal spread of the mutation elsewhere in the world must have been established for the mutations mentioned in Table 3.

3. Related Work

Machine learning algorithms often employ data sequences collected over time as the input data to forecast the COVID-19 pandemic situation. The COVID-19 spread has been predicted using a variety of methodologies. The Long Short-Term Memory (LSTM) algorithm is one of the methodologies that has been used. The multi-layer perceptron (MLP), for example, is now being used to forecast the spread of COVID-19. This strategy has made it easier to anticipate the maximum number of COVID-19 victims, the highest proportion of survivors, and the highest number of fatalities per region in a specific time period [44].
Al-Qanes et al. [45] developed a more advanced form of the adaptive neuro-fuzzy infererence system (ANFIS) to calculate the infected patients in different four countries: United States, Iran, Italy, and Korea. Their approach was founded on the marine predators algorithm, a revolutionary nature-inspired optimization. The ANFIS variables were optimized using this technique, improving prediction accuracy. The model has shown efficient prediction performance for MAE, RMSE, MAPE, and R 2 [45]. Other research used an improved ANFIS model by integrating the flower pollination algorithm (FPA) and salp swarm algorithm (SSA). The proposed FPASSA-ANFIS framework was evaluated by employing verified data obtained from the WHO website. Additionally, the proposed model’s performance was evaluated using two different datasets of weekly infected patients [20].
The Susceptible-Exposed-Infectious-Recovered (SEIR) approach was used by Alsayed et al. [46] to forecast pandemic peaks in Malaysia. Researchers have utilized the ANFIS approach to anticipate the number of infected people in the short term. Additionally, researchers have hypothesized that extending the treatment time may lessen the severity of the pandemic at its height. The MAPE, RMSE, and R 2 values for this study were 2.79, 46.87, and 0.9973, respectively [46]. Behnood et al. [47] evaluated the influence of several climate-related elements and the size of the population on the spread of COVID-19 by integrating the viral optimization algorithm (VOA) and ANFIS. They showed that the density of the population had a surprising impact on how well their constructed scenarios operated, highlighting the critical role that social distance plays in reducing the rate as well as the spread of COVID-19. They reported the RMSE as 22.47, MAE as 7.33, and R 2 as 0.83 [47].
Aora et al. [48] employed RNN-related LSTM variations to predict the number of positive patients in India. The LSTM model was chosen for forecasting daily as well as weekly COVID-19 patients with approximated errors of three percent for daily cases and eight percent for weekly cases based on the lowest false alarm rate. Depending on the volume of confirmed patients and everyday progression of the designation of COVID-19 hotspots, they divided Indian states into various zones [48]. A bidirectional LSTM network was used by Fokas et al. [49] to produce a reliable generalization of RNNs. This technique was used to forecast new COVID-19 infected individuals in the United States, Spain, Italy, Germany, France, and Sweden [49].
The regression model proposed by Yadav et al. [50] for the forecasting of COVID-19 cases was based on six regression analyses including quadratic, third-degree, fourth-degree, fifth-degree, sixth-degree, and exponential polynomials. The sixth-degree polynomial regression method was the best model for the forecasting of short-term new cases [50]. Geographical hierarchies were employed by Kim et al. [51] to develop Hi-COVIDNet in accordance with a neural network of two-level machinery based on information gathered from the continent and at the country level. This approach comprehended the complex connections between far-off nations and connected their unique risks of infection to the targeted community [51].
Three hybrid techniques for COVID-19 time-series forecasting were developed by Abbasimehr and Paki [52] by combining the Bayesian optimization algorithm with the multi-head attention, LSTM, and CNN deep learning techniques. These findings revealed that deep neural networks outperformed the benchmark model in terms of both the short-term and long-term predictions. In addition, the best deep learning model’s average SMAPE had short-term forecasts of 0.25 and long-term forecasts of 2.59 [52]. Additionally, deep neural networks (DNNs) have been proposed as a technique for prediction. This approach is a significant substitute for estimating a partial differential equation’s solution [11]. Based on the distribution of COVID-19 over three time periods, a recent work employed the K-means approach to group countries into various clusters [11].

4. Methods

The proposed model for this work is the multi-layer perceptron neural network (MLPNN), whose flowchart/structure is illustrated in Figure 1. For this study, we collected data from the website of the World Health Organization [53]. The data used for this research were statistical data and contained no personally identifiable human photos, audios, videos, or other materials. Additionally, all procedures were conducted in accordance with the necessary rules and laws. As shown in Figure 1, the downloaded dataset was pre-processed using features extraction. We considered the categorical features (infected cases, number of deaths, and number of weeks) for this study. We tuned the model by removing the disconnected features that were causing the class imbalance, for example, we did not consider patients who had other diseases such as heart disease, cancer, diabetes, old age, etc. These features were causing a class imbalance, e.g., it was not necessary for all COVID-19 infected patients to be heart patients and vice versa. After removing the disconnected features, we normalized the data and initialized the input data by splitting it into subsets, i.e., 80% for training and 20% for testing. This splitting is typically made in a layered or randomized way to ensure the data are dispersed in the sample data of the subgroups, which minimizes biases or deviations in the data. The classification model that we utilized in the approach was trained using the training data and test data to evaluate the classifier’s performance over an unobserved subset of the data. We applied a three-layered feed-forward network (multi-layer perceptron neural network) model for training, testing, and validation. The MPLN is discussed briefly in the following sections.

4.1. Multi-Layer Perceptron Neural Network

We employed a multi-layer perceptron neural network [54] and a feed-forward neural network with an input layer, hidden layers, and an output layer (see Figure 2). In this research, two separate multi-layer perceptron neural networks were trained, i.e., one for each of the goals— infected cases and deaths. The data of the infected cases and deaths were used from various countries including China, Bangladesh, Germany, Italy, India, Iran, Pakistan, and the United Kingdom.
Ten hidden neurons were used in a single hidden layer and a sigmoid function was also used. The sigmoid is the activation function, which is specified as
N i = 1 1 + e w k i
where w k i are the weights of input values and N i is the value of the hidden neurons.
In the output layer, there are two input neurons that show the number of deaths and number of active cases. Furthermore, Equation (2) defines the output of a hyperbolic tangent transfer function that ranges from −1 to +1, that is,
N ¯ j = 2 1 + e 2 w i j 1
where w i j is a weighted output between the hidden neuron i and the output neuron j. N ¯ j is the output of j.
The best technique for calculating the best values for all the neural network variables, for example, the input and output weights, are used in the supervised learning approach. As a result, establishing the parameters of an ANN results in the development of an ANN model. Training through observed values and optimization is known as supervised learning (see Figure 3).

4.2. Mortality/Fatality Rate

The seriousness of a pandemic can be inferred from the fatality (case fatality ratio) rates/ratios, defined by
C F R = D e a t h s C o n f i r m e d c a s e s 100
where C F R is the case fatality ratio.

4.3. Cronbach’s Alpha

Cronbach’s alpha is a risk-adjusted evaluation metric that shows us how much the expected case returns differ from the actual case returns and whether deaths from COVID-19 are above or below the active cases/deaths. We calculated the actual cases and death ratio using Cronbach’s formula [55] (Equation (4)) as follows;
C α = K K 1 S y 2 S i 2 S y 2
where C α denotes the actual cases and deaths, S 2 describes the number of samples, S y 2 represents the variance in the total score. S i 2 is the variance of the individual week, whereas S i 2 is the sum of the scores of the individual week.

4.4. Mean Absolute Error (MAE)

We used the mean absolute error (MAE) (see Equation (4)) to achieve forecasting with minimized errors. Based on the MAE’s values, the mean absolute scaled error (MASE) (Equation (5)) was calculated for the actual infected cases/deaths and predicted cases/deaths for future weeks.
M A E = 1 k y = 1 k e y
where y k and then the y t h error e y is denoted by e y = x y x y ^

4.5. Mean Absolute Scaled Error (MASE)

We computed the MASE (mean absolute scaled error) using the actual numbers of infected cases and deaths and the forecasted values of the cases and deaths using the following equation (Equation (6)).
M A S E = 1 k y = 1 k e y 1 k 1 y = 2 k x y x y 1

4.6. Symmetric Mean Absolute Percentage Error (SMAPE)

We further calculated our data using SMAPE (Equation (7)). SMAPE uses the squared values such as the root mean square error (RMSE) (Equation (8)).
S M A P E = 1 k y = 1 k e y x y + x ^ y / 2
where k represents the sample size, x y indicates the actual values of the infected cases/deaths, and x ^ y indicates the forecasted values of the cases/deaths. y k and then the y t h error e y is denoted by e y = x y x y ^

4.7. Root Mean Square Error (RMSE)

The RMSE computes the difference of the error between two actual values and the forecasted values. We compared the anticipated value and real measurements, i.e., (a) the predicted values and (b) the observed values, respectively. We divided the total number of observations by the sum of all the values. Finally, we calculated the root mean square error (RMSE) (8) below:
R M S E = ( O E ) 2 n .
where n represents the total number of infected people, O denotes the number of observed values of actual cases, and E represents the number of the total expected values.

4.8. Data Pre-Processing and Experimental Setup

Authentic sources [53] were used to collect the data. We used the datasets of various countries including China, Bangladesh, Germany, Italy, India, Iran, Pakistan, and the United Kingdom. This study contains no personally identifiable human photos, audios, videos, or other materials. All procedures were followed in compliance with the necessary rules and regulations. A Windows 10, 64-bit operating system, with 16 GB of RAM was employed. For the training and validation datasets, we used CSV files. We normalized the data and initialized the input data by splitting them into subsets, i.e., 80% for training and 20% for testing. This splitting is typically made in a layered or randomized way to ensure the data are dispersed in the sample data of the subgroups, which minimizes biases or deviations in the data. K-fold validation was used to validate the performance of our proposed framework.

4.9. Model Forecasting

A time-series analysis is a very important component of deep learning and is utilized for forecasting. Time is the only input variable (independent feature) used to forecast the target feature (dependent feature) in time-series data, which are a type of univariate regressive data. It is used to predict the future values of coming occurrences and is crucial for predicting the occurrence of respiratory disorders such as COVID-19. Positive cases are growing every day, thus it is important to predict whether the rate of growth will continue based on earlier data. Governments can mobilize resources to prevent disease transmission based on forecasts and take action in the future to slow the pace of infection increase without impacting more citizens. Forecast numbers cannot be assured because predictions depend entirely on past patterns. To counter a pandemic emergency such as COVID-19, governments can use this approximate projection of occurrences to evaluate future resource management. This section discusses the actual situation with COVID-19-infected cases and forecasts future situations for infected cases and deaths.
Table 4 exhibits the CFRs for the selected countries as well as globally. A CFR of 5.33 % was reported for China, 2.99 % for Italy, 2.85 % for the United Kingdom, 1.17 % for India, 1.58 % for Bangladesh, and 2.25 % for Pakistan, whereas a global CFR of 2.08 % was reported [53]. Due to the large number of deaths at the beginning of the pandemic, China had the highest CFR among the other countries; however, after May, China’s fatalities decreased as a result of the lockdowns used to contain the pandemic. It is worth noting that the CFR is influenced by the number of tests performed and the size of the population. Therefore, a solid approach should be developed to avoid this constraint. The CFR changes when new cases of infection and fatalities appear. Table 5 and Table 6 show the results for the alpha, MASE, SMAPE, MAE, and RMSE for actual cases and deaths, respectively. Alpha returned a base value parameter of between 0 and 1. MASE returned a mean absolute scaled error measurement of the forecasting. The symmetric mean absolute percentage measurement parameter was returned by the SMAPE function. The MAE returned the mean absolute error and the RMSE returned the root mean squared error metric. Figure 3 denotes a detailed visualization of the weeks, that is, 60 weeks on the x-axis and the number of infected patients plus the number of deaths on the y-axis. Graph (A) shows the data from Bangladesh, graph (B) from China, graph (C) from Germany, graph (D) from India, graph (E) from Iran, graph (F) from Italy, graph (G) from Pakistan, and graph (H) shows the data from the United Kingdom.
Table 7 shows the test results of the best models for the death forecasting. Table 8 shows the weekly death forecasts for the upcoming months. The model forecast results for India show an increase in weekly deaths at a faster rate compared to the other specified countries. Consequently, if the same strategy is maintained, COVID-19 will be completely out of control in India and fatalities could reach more than 121 thousand by the start of the upcoming year. The weekly death forecasts for Pakistan, Bangladesh, and Iran show decreases but at a relatively slow rate. The forecasts indicate that for Pakistan, COVID-19 deaths in the 1st week of the upcoming month in 2022 are 380, and this number will not exceed 537, with a confidence level of 95 % . However, weekly deaths will reduce to 316, indicating a reasonably considerable difference in a couple of months. For Iran, the forecast for deaths is 1367 and will not exceed 1732, whereas for Bangladesh, it is 198 and will not exceed 292. The forecasting results for Germany are also declining at a slower rate. The forecast results show that in the last week of the first month, the weekly deaths will be 775 and will not exceed 5812, with a confidence level of 95 % . The upper limit suggests an alarming situation. It is highly recommended for their governments to take steps and implement new policies as preventive measures regarding the pandemic situation. The forecast for the UK shows that weekly deaths will increase and in the last week of the upcoming month will be 126 and not exceed 8876. The results indicate that these countries’ current strategies are working effectively in controlling the pandemic but the future situation may worsen, as shown by the upper limit of the forecast; it is highly recommended that they revise their policies in a timely manner.
Finally, regarding Italy’s future scenario, the situation will not be as difficult as in India. However, there is a considerably high weekly deaths forecast (more than a couple of hundred) for the end of the current year and the start of the next year. Table 7 gives a brief overview of the best models’ test results for death forecasting. The WHO should give special consideration and help countries, such as India, Italy, and others with high mortality forecasts for COVID-19, to fight against the pandemic.
Figure 4 represents a detailed view of the number of weeks on the x-axis and the number of actual cases and predicted cases on the y-axis. Graph (A) shows the data from Bangladesh, graph (B) from China, graph (C) from Germany, graph (D) from India, graph (E) from Iran, graph (F) from Italy, graph (G) from Pakistan, and graph (H) shows the data from the United Kingdom.
Figure 5 shows a detailed view of the total number of weeks on the x-axis and the number of actual deaths and predicted deaths on the y-axis. Graph (A) shows the data from Bangladesh, graph (B) from China, graph (C) from Germany, graph (D) from India, graph (E) from Iran, graph (F) from Italy, graph (G) from Pakistan, and graph (H) shows the data from the United Kingdom.

4.10. The Model’s Performance

The results of the best accuracy, training, testing, and validation of our framework are briefly summarized in Figure 6. The results show a 99.60 % accuracy, which means that the validation effectiveness is satisfactory. These outcomes were seen when initializing the input parameters for the model, indicating that the model was properly trained and the data were error-free.
Figure 7 gives a brief visualization of the output results. The value of the training correlation coefficient of the target output was observed to be 99.44 % , the validation was observed to be 99.77 % , the testing was observed to be 64.16 , and the overall value was observed to be 90.6 % , which means that our model was efficient. The correlation quantifies the strength of a linear relationship between two variables. We used a correlation to investigate whether a relationship existed between the variables to assume or fit a specific model to our data. A value close to 1 ( 90.6 % in this research) indicated that there was a positive linear relationship between the data columns, which means that our proposed model was precisely or accurately working on the given dataset.

5. Challenges and Future Directions

We discuss the current issues associated with utilizing Artificial Intelligence methods to resolve the COVID-19 pandemic. Furthermore, we demonstrate how machine learning and deep learning can assist in preventing the transmission rate of COVID-19 in the future.

5.1. Challenges

Applications based on AI for investigating COVID-19 are presently facing numerous hurdles, for example, the scarcity, legislation, and inaccessibility of substantial data; there are a lot of noisy data as well as false feedback; the inadequate alertness of the juncture between medicine and computer science; the issue of security and data privacy, etc.
Policies and Regulations
As the epidemic spreads and the numbers of reported affected and deceased people rises, several measures to limit the outbreak have been discussed, for example, social distancing and lockdowns. Authorities have an important role in establishing regulations and rules to motivate citizens, experts, educators, entrepreneurs, medical centers, technology giants, and large corporations to cooperate in COVID-19 mitigation during an outbreak.
Large-scale training data are scarce and unavailable
Many Artificial Intelligence deep learning (AIDL) systems rely on large-scale datasets, including diagnostic image processing, with a variety of environmental variables. Yet, because of COVID-19’s explosive expansion, there are inadequate resources for AI. In practice, analyzing datasets is a time-consuming task and demands the support of trained health professionals.
Noisy data and speculation on the internet
The problems occur as a result of a reliance on easily available online social networking sites; vast amounts of audio/video, fake information, and misleading news have been reported in thousands of different online channels without any substantial modifications. Artificial intelligence-based techniques appeared to be slow when evaluating and processing noisy data. Furthermore, the outputs of Artificial Intelligence ML and DL techniques become skewed with noisy data. These issues reduce the performance and efficiency of Artificial intelligence algorithms, especially for epidemic forecasts and spreading analyses.
Lack of integration between computer science and medicine arenas
Numerous Artificial Intelligence experts have a strong hold on computer science applications, but considerable expertise in diagnostic imaging, epidemiology, pharmacology, and other relevant domains is also required to incorporate other medical information into artificial intelligence methods in the war against COVID-19. To handle COVID-19, it will be essential to arrange for specialists from different majors to work together and integrate data from numerous works.
Data security and privacy
In the era of Artificial Intelligence, the cost of acquiring confidentiality of data is incredibly low. In the presence of healthcare issues such as the current pandemic situation, several government agencies strove to gather a wide range of personal data including contact numbers, ID numbers, and medical data. How to properly maintain individual confidentiality and human rights during Artificial Intelligence discovery and handling is a topic worth tackling.
Unstructured data or incorrect structural data (e.g., numerical, text, and image data)
Working with incorrect facts and ambiguous data in textual material can be challenging. It is possible for large amounts of data from several sources to be erroneous. Furthermore, a lot of data makes it difficult to extract valuable bits of metadata.
Early detection of COVID-19 via image analysis such as chest X-rays and CT scans
Handling unbalanced datasets results in insufficient diagnostic imaging and extensive training periods and being unable to describe the problems of the efficient outcomes.
Risk assessments of old-age people and patients with other diseases
Old-age people should be screened, functioning treatments and cures should be discovered, risk assessments should be conducted, survival projections should be made, healthcare should be provided, and medical source planning should be conducted. The task at hand is to obtain the physical features and therapeutic outcomes for patients. An additional challenge is dealing with low-quality data, which can lead to skewed and incorrect predictions for old-age people and people with other diseases, for example, heart disease, diabetes, asthma, and so on.

5.2. Future Research Direction

Artificial Intelligence and blockchain-based solutions can also contribute to the fighting the outbreak in the following ways.
Non-contact illness diagnostics
Using automatic feature categorization in X-ray and CT imaging during COVID-19 outbreaks will successfully limit the outbreaks. A patient’s posture can be detected and CT image detection, X-rays, and smart camera facilities can all be utilized in AI-based systems.
Video diagnostics and consulting remotely
To deliver COVID-19 hospital admissions and early diagnosis data, a mix of Artificial Intelligence and natural language processing (NLP) modules can be utilized to construct remote diagnostic programs and automation systems.
Bio-technological research
AI-based algorithms can be utilized to accurately examine biomedical knowledge in terms of biotechnological research, such as major protein structures, genomic sequencing, and viral itineraries, to determine protein compositions and viral components.
Vaccination and drug development
AI-based algorithms can be used to find prospective medications and vaccinations, as well as replicate drug–protein and vaccine–receptor pairings, allowing for the prediction of future drug and vaccine responses in COVID-19 patients.
Fake information must be identified and screened
In order to provide real, accurate, and comprehensive COVID-19 statistics, Artificial Intelligence models must be used to filter out erroneous news and material online. Blockchain-based [56] systems can be used to track and trace the actual information source.
Impact analysis and appraisal
Various sorts of computations can use machine learning, deep learning techniques to evaluate the influence of different social management strategies on the spread of the pandemic. Data could then be used to evaluate logical and efficient strategies for disease prevention and control in the general public.
Tracking of patients’ contacts
By establishing social networking sites and an information architecture, blockchain-based federated learning can be used to detect and track the characteristics of individuals residing in close proximity to COVID-19 sufferers, effectively anticipating and tracking the pandemic progression.
Smart robots
Robotic systems are likely to be used in activities, for example, public sanitation, deliveries, supply chains, and in healthcare infrastructures that do not require human resource management, e.g., medical treatment. This can stop the COVID-19 virus from spreading.
Future work with descriptive federated learning methods
The effectiveness of federated learning methods and graphic properties that cause distinctions between COVID-19 and other strains of tuberculosis must be determined. This will aid radiologists and doctors in being more conscious of the infection and effectively analyzing probable COVID-19 X-rays and CT imaging data.
Importance of COVID-19 diagnostic tools and treatment
These are both necessary but the early detection of COVID-19 is far more important. Substantial future study efforts based on ML and DL are needed in order to identify COVID-19 therapies.

6. Conclusions

The applications of operational research that uses mathematical, statistical, and demographic modeling are crucial in assisting decision makers in education, health, socioeconomic, and other aspects of daily life. By adopting preventative measures beforehand, the transmission of COVID-19 could be considerably slowed. In order to maintain attention on the most sensitive location, country, or region, scientists, research professionals, and global leaders must be informed in advance of the emergency scenarios. For forecasting the pandemic situation, this study proposed a multi-layer perceptron neural network (MLPNN) with the integration of Cronbach’s alpha and the MAE, MASE, SMAPE, RMSE, and CFR. We also focused on the current challenges in preventing the outbreak from further spread and what is needed in the future to normalize social and economic activities. High accuracy was observed in estimating the percentages of afflicted patients and deaths. According to the MLPNN model’s encouraging results, the volume of COVID-19 people in India will rise in the upcoming weeks and the death rate will also rise. This was evident from the 95% upper limit confidence interval, which was becoming wider for subsequent weeks. In general, forecasts for the near future were more precise compared to the longer term. Furthermore, providing the breakdown of the forecasting for each of the past COVID-19 variants could be a very interesting contribution to the research and will be explored in future studies. For this research, we could not find actual data about the numbers of patients who were affected by the particular variants in the selected countries.

Author Contributions

Conceptualization, R.U.K., S.A. and S.M.H.; Data curation, R.U.K., R.K. and A.U.H.; Formal analysis, M.A., I.A. and M.I.U.; Investigation, A.U.H.; Methodology, R.U.K. and R.K.; Project administration, I.K.; Resources, S.A., I.K. and M.I.U.; Supervision, R.U.K.; Validation, A.U.; Writing—original draft, I.A.; Writing—review & editing, A.U. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by the Deanship of Scientific Research at Najran University (Grant No: NU/RC/SERC/11/8) and research funding from the Yangtze Delta Region Institute Huzhou, University of Electronic Science and Technology of China (Grant No. U03210068).

Conflicts of Interest

During the project, the authors declare that they had no financial or commercial relationships that could be construed as potential conflict of interest.

References

  1. Burgess, S.; Sievertsen, H.H. Schools, Skills, and Learning: The Impact of COVID-19 on Education. 2020. Available online: https://cepr.org/voxeu/columns/schools-skills-and-learning-impact-covid-19-education (accessed on 12 January 2021).
  2. Fernandes, N. Economic Effects of Coronavirus Outbreak (COVID-19) on the World Economy. SSRN Electron. J. 2020, 2, 1–33. [Google Scholar] [CrossRef]
  3. Bellan, M.; Patti, G.; Hayden, E.; Azzolina, D.; Pirisi, M.; Acquaviva, A.; Aimaretti, G.; Valletti, P.A.; Angilletta, R.; Arioli, R.; et al. Fatality rate and predictors of mortality in an Italian cohort of hospitalized COVID-19 patients. Sci. Rep. 2020, 10, 1–10. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, Y.; Cheng, J.; Jiang, Y.; Liu, K. A time delay dynamic system with external source for the local outbreak of 2019-nCoV. Appl. Anal. 2020, 101, 146–157. [Google Scholar] [CrossRef] [Green Version]
  5. Noh, J.; Danuser, G. Estimation of the fraction of COVID-19 infected people in U.S. states and countries worldwide. PLoS ONE 2021, 16, e0246772. [Google Scholar] [CrossRef] [PubMed]
  6. Böhning, D.; Rocchetti, I.; Maruotti, A.; Holling, H. Estimating the undetected infections in the Covid-19 outbreak by harnessing capture–recapture methods. Int. J. Infect. Dis. 2020, 97, 197–201. [Google Scholar] [CrossRef] [PubMed]
  7. Paules, C.I.; Marston, H.D.; Fauci, A.S. Coronavirus infections—More than just the common cold. JAMA 2020, 323, 707–708. [Google Scholar] [CrossRef] [Green Version]
  8. Tomar, A.; Gupta, N. Prediction for the spread of COVID-19 in India and effectiveness of preventive measures. Sci. Total Environ. 2020, 728, 138762. [Google Scholar] [CrossRef]
  9. Feroze, N. Forecasting the patterns of COVID-19 and causal impacts of lockdown in top five affected countries using Bayesian Structural Time Series Models. Chaos Solitons Fractals 2020, 140, 110196. [Google Scholar] [CrossRef]
  10. Greaney, A.J.; Loes, A.N.; Crawford, K.H.D.; Starr, T.N.; Malone, K.D.; Chu, H.Y.; Bloom, J.D. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe 2021, 29, 463–476. [Google Scholar] [CrossRef]
  11. Silva, R.F.; Xavier, F.; Saraiva, A.M.; Cugnasca, C.E. Unsupervised machine learning and pandemics spread: The case of COVID-19. In Proceedings of the 20th Brazilian Symposium on Computing Applied to Healthcare (SBCAS2020), SBC, Virtual, 18–22 October 2020; pp. 506–511. [Google Scholar] [CrossRef]
  12. Khan, R.U.; Hussain, S.M.; Ul Haq, A.; Asif, M.; Yousaf, M.; Zafar, A.; Almakdi, S.; Li, J.; Malghani, M.A. Forecasting Time Series COVID-19 Statistical Data with Auto-Regressive Integrated Moving Average and Box-Jenkins’ Models. In Proceedings of the 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), Chengdu, China, 17–19 December 2022; pp. 353–358. [Google Scholar] [CrossRef]
  13. Zakary, O.; Larrache, A.; Rachik, M.; Elmouki, I. Effect of awareness programs and travel-blocking operations in the control of HIV/AIDS outbreaks: A multi-domains SIR model. Adv. Differ. Equ. 2016, 2016, 1–17. [Google Scholar] [CrossRef] [Green Version]
  14. Khaleque, A.; Sen, P. An empirical analysis of the Ebola outbreak in West Africa. Sci. Rep. 2017, 7, 1–8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Godio, A.; Pace, F.; Vergnano, A. SEIR modeling of the Italian epidemic of SARS-CoV-2 using computational swarm intelligence. Int. J. Environ. Res. Public Health 2020, 17, 3535. [Google Scholar] [CrossRef] [PubMed]
  16. Baleanu, D.; Mohammadi, H.; Rezapour, S. A fractional differential equation model for the COVID-19 transmission by using the Caputo–Fabrizio derivative. Adv. Differ. Equ. 2020, 2020, 1–27. [Google Scholar] [CrossRef] [PubMed]
  17. Ahmad, S.; Ullah, A.; Shah, K.; Salahshour, S.; Ahmadian, A.; Ciano, T. Fuzzy fractional-order model of the novel coronavirus. Adv. Differ. Equ. 2020, 2020, 1–17. [Google Scholar] [CrossRef]
  18. Salahshour, S.; Ahmadian, A.; Pansera, B.A.; Ferrara, M. Uncertain inverse problem for fractional dynamical systems using perturbed collage theorem. Commun. Nonlinear Sci. Numer. Simul. 2021, 94, 105553. [Google Scholar] [CrossRef]
  19. Meng, F.; Uversky, V.N.; Kurgan, L. Comprehensive review of methods for prediction of intrinsic disorder and its molecular functions. Cell. Mol. Life Sci. 2017, 74, 3069–3090. [Google Scholar] [CrossRef] [PubMed]
  20. Al-Qaness, M.A.A.; Ewees, A.A.; Fan, H.; Abd El Aziz, M. Optimization method for forecasting confirmed cases of COVID-19 in China. J. Clin. Med. 2020, 9, 674. [Google Scholar] [CrossRef] [Green Version]
  21. Wang, L.; Wang, Z.; Qu, H.; Liu, S. Optimal forecast combination based on neural networks for time series forecasting. Appl. Soft Comput. 2018, 66, 1–17. [Google Scholar] [CrossRef]
  22. Eriksson, T.A.; Bülow, H.; Leven, A. Applying neural networks in optical communication systems: Possible pitfalls. IEEE Photonics Technol. Lett. 2017, 29, 2091–2094. [Google Scholar] [CrossRef] [Green Version]
  23. King Abdullah Petroleum Studies and Research Center. Saudi Arabia Coronavirus Disease (COVID-19) Situation—Demographics; King Abdullah Petroleum Studies and Research Center: Riyadh, Saudi Arabia, 2021. [Google Scholar]
  24. Tilahun, S.L.; Ong, H.C. Prey-predator algorithm: A new metaheuristic algorithm for optimization problems. Int. J. Inf. Technol. Decis. Mak. 2015, 14, 1331–1352. [Google Scholar] [CrossRef]
  25. Public Health England. SARS-CoV-2 Variants of Concern and Variants under Investigation in England-Technical Briefing 11; Technical report; Public Health England: London, UK, 2021. [Google Scholar]
  26. Public Health England. SARS-CoV-2 Variants of Concern and Variants under Investigation in England-Technical Briefing 12; Technical report; Public Health England: London, UK, 2021. [Google Scholar]
  27. ECDC. SARS-CoV-2 Variants of Concern as of 5 August 2021; ECDC: Solna, Sweden, 2021. [Google Scholar]
  28. Callaway, E. Heavily mutated coronavirus variant puts scientists on alert. Nature 2021, 600, 21. [Google Scholar] [CrossRef] [PubMed]
  29. National Institute for Communicable Diseases. New COVID-19 Variant Detected in South Africa-NICD; National Institute for Communicable Diseases: Johannesburg, South Africa, 2021. [Google Scholar]
  30. Focosi, D.; Tuccori, M.; Baj, A.; Maggi, F. SARS-CoV-2 Variants: A Synopsis of In Vitro Efficacy Data of Convalescent Plasma, Currently Marketed Vaccines, and Monoclonal Antibodies. Viruses 2021, 13, 1211. [Google Scholar] [CrossRef] [PubMed]
  31. Aarts, M.; Dekker, M.; de Vries, S.; van der Wal, A.; te Riele, H. Generation of a mouse mutant by oligonucleotide-mediated gene modification in ES cells. Nucleic Acids Res. 2006, 34, 147. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Public Health England. Confirmed Cases of COVID-19 Variants Identified in UK-GOV.UK; Public Health England: London, UK, 2021. [Google Scholar]
  33. Collier, D.A.; De Marco, A.; Ferreira, I.A.; Meng, B.; Datir, R.P.; Walls, A.C.; Kemp, S.A.; Bassi, J.; Pinto, D.; Silacci-Fregni, C.; et al. Sensitivity of SARS-CoV-2 B. 1.1. 7 to mRNA vaccine-elicited antibodies. Nature 2021, 593, 136–141. [Google Scholar] [CrossRef] [PubMed]
  34. Davies, N.G.; Jarvis, C.I.; Edmunds, W.J.; Jewell, N.P.; Diaz-Ordaz, K.; Keogh, R.H. Increased mortality in community-tested cases of SARS-CoV-2 lineage B. 1.1. 7. Nature 2021, 593, 270–274. [Google Scholar] [CrossRef] [PubMed]
  35. Tut, G.; Lancaster, T.; Krutikov, M.; Sylla, P.; Bone, D.; Kaur, N.; Spalkova, E.; Bentley, C.; Amin, U.; Jadir, A.; et al. Profile of Humoral and Cellular Immune Responses to Single BNT162b2 or ChAdOx1 Vaccine in Residents and Staff within Residential Care Homes (VIVALDI): An observational study. Lancet Healthy Longev. 2021, 2, E544–E553. [Google Scholar] [CrossRef]
  36. Lopez Bernal, J.; Andrews, N.; Gower, C.; Gallagher, E.; Simmons, R.; Thelwall, S.; Stowe, J.; Tessier, E.; Groves, N.; Dabrera, G.; et al. Effectiveness of Covid-19 vaccines against the B. 1.617. 2 (delta) variant. N. Engl. J. Med. 2021, 385, 585–594. [Google Scholar] [CrossRef]
  37. Aziz, S.; Jim, M.; Bob, T.; Chris, R. SARS-CoV-2 Delta VOC in Scotland: Demographics, risk of hospital admission, and vaccine effectiveness. Lancet 2021, 397, 2461–2462. [Google Scholar]
  38. Davies, N.G.; Abbott, S.; Barnard, R.C.; Jarvis, C.I.; Kucharski, A.J.; Munday, J.D.; Pearson, C.A.; Russell, T.W.; Tully, D.C.; Washburne, A.D.; et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B. 1.1. 7 in England. Science 2021, 372, eabg3055. [Google Scholar] [CrossRef]
  39. Tegally, H.; Wilkinson, E.; Giovanetti, M.; Iranzadeh, A.; Fonseca, V.; Giandhari, J.; Doolabh, D.; Pillay, S.; San, E.J.; Msomi, N.; et al. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature 2021, 592, 438–443. [Google Scholar] [CrossRef]
  40. Cele, S.; Gazy, I.; Jackson, L.; Hwa, S.H.; Tegally, H.; Lustig, G.; Giandhari, J.; Pillay, S.; Wilkinson, E.; Naidoo, Y.; et al. Escape of SARS-CoV-2 501Y. V2 from neutralization by convalescent plasma. Nature 2021, 593, 142–146. [Google Scholar] [CrossRef] [PubMed]
  41. Madhi, S.A.; Baillie, V.; Cutland, C.L.; Voysey, M.; Koen, A.L.; Fairlie, L.; Padayachee, S.D.; Dheda, K.; Barnabas, S.L.; Bhorat, Q.E.; et al. Efficacy of the ChAdOx1 nCoV-19 Covid-19 vaccine against the B. 1.351 variant. N. Engl. J. Med. 2021, 384, 1885–1898. [Google Scholar] [CrossRef]
  42. Carl AB Pearson; Russell, T.W.; Davies, N.G.; Kucharski, A.J.; CMMID COVID-19 working group; Edmunds, W.J.; Eggo, R.M. Estimates of Severity and Transmissibility of Novel South Africa SARS-CoV-2 Variant 501Y.V2. Working Paper. Available online: https://cmmid.github.io/topics/covid19/sa-novel-variant.html (accessed on 19 March 2021).
  43. Funk, T.; Pharris, A.; Spiteri, G.; Bundle, N.; Melidou, A.; Carr, M.; Gonzalez, G.; Garcia-Leon, A.; Crispie, F.; O’Connor, L.; et al. Characteristics of SARS-CoV-2 variants of concern B. 1.1. 7, B. 1.351 or P. 1: Data from seven EU/EEA countries, weeks 38/2020 to 10/2021. Eurosurveillance 2021, 26, 2100348. [Google Scholar] [CrossRef] [PubMed]
  44. Khan, R.U.; Haq, A.U.; Hussain, S.M.; Ullah, S.; Almakdi, S.; Kumar, R.; Shah, H.H.; Li, J. Analyzing and Battling The Emerging Variants Of Covid-19 Using Artificial Neural Network And Blockchain. In Proceedings of the 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), Chengdu, China, 16–18 December 2022; pp. 101–105. [Google Scholar] [CrossRef]
  45. Al-Qaness, M.A.; Ewees, A.A.; Fan, H.; Abualigah, L.; Elaziz, M.A. Marine Predators Algorithm for Forecasting Confirmed Cases of COVID-19 in Italy, USA, Iran and Korea. Int. J. Environ. Res. Public Health 2020, 17, 3520. [Google Scholar] [CrossRef] [PubMed]
  46. Alsayed, A.; Sadir, H.; Kamil, R.; Sari, H. Prediction of Epidemic Peak and Infected Cases for COVID-19 Disease in Malaysia, 2020. Int. J. Environ. Res. Public Health 2020, 17, 4076. [Google Scholar] [CrossRef]
  47. Behnood, A.; Mohammadi Golafshani, E.; Hosseini, S.M. Determinants of the infection rate of the COVID-19 in the U.S. using ANFIS and virus optimization algorithm (VOA). Chaos Solitons Fractals 2020, 139, 110051. [Google Scholar] [CrossRef]
  48. Arora, P.; Kumar, H.; Panigrahi, B.K. Prediction and analysis of COVID-19 positive cases using deep learning models: A descriptive case study of India. Chaos Solitons Fractals 2020, 139, 110017. [Google Scholar] [CrossRef]
  49. Fokas, A.S.; Dikaios, N.; Kastis, G.A. Mathematical models and deep learning for predicting the number of individuals reported to be infected with SARS-CoV-2. J. R. Soc. Interface 2020, 17, 1–12. [Google Scholar] [CrossRef]
  50. Yadav, R.S. Data analysis of COVID-2019 epidemic using machine learning methods: A case study of India. Int. J. Inf. Technol. 2020, 12, 1321–1330. [Google Scholar] [CrossRef] [PubMed]
  51. Kim, M.; Kang, J.; Kim, D.; Song, H.; Min, H.; Nam, Y.; Park, D.; Lee, J.G. Hi-COVIDNet: Deep Learning Approach to Predict Inbound COVID-19 Patients and Case Study in South Korea. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, Virtual, 6–10 July 2020; pp. 3466–3473. [Google Scholar] [CrossRef]
  52. Abbasimehr, H.; Paki, R. Prediction of COVID-19 confirmed cases combining deep learning methods and Bayesian optimization. Chaos Solitons Fractals 2021, 142, 110511. [Google Scholar] [CrossRef]
  53. Worldometer. COVID-19-Coronavirus Pandemic Weekly Trends. 2021. Available online: https://www.worldometers.info/coronavirus/weekly-trends (accessed on 10 June 2021).
  54. Hamadneh, N.N.; Tahir, M.; Khan, W.A. Using Artificial Neural Network with Prey Predator Algorithm for Prediction of the COVID-19: The Case of Brazil and Mexico. Mathematics 2021, 9, 180. [Google Scholar] [CrossRef]
  55. Cronbach, L.J. Coefficient alpha and the internal structure of tests. Psychometrika 1951, 16, 297–334. [Google Scholar] [CrossRef] [Green Version]
  56. Alshehri, M.; Panda, B.; Alazeb, A.; Halawani, H.; Al Mudawi, N.; Khan, R.U.; Lera, I.; Juiz, C. A Novel Blockchain-Based Encryption Model to Protect Fog Nodes from Behaviors of Malicious Nodes. Electronics 2021, 10, 3135. [Google Scholar] [CrossRef]
Figure 1. Flowchart diagram of our proposed model.
Figure 1. Flowchart diagram of our proposed model.
Diagnostics 12 02539 g001
Figure 2. Architecture of Artificial Neural Network.
Figure 2. Architecture of Artificial Neural Network.
Diagnostics 12 02539 g002
Figure 3. COVID-19 active cases and deaths in 60 weeks.
Figure 3. COVID-19 active cases and deaths in 60 weeks.
Diagnostics 12 02539 g003aDiagnostics 12 02539 g003b
Figure 4. Actual cases in 60 weeks and predicted cases in future weeks.
Figure 4. Actual cases in 60 weeks and predicted cases in future weeks.
Diagnostics 12 02539 g004
Figure 5. Actual deaths in 60 weeks and predicted deaths in future weeks.
Figure 5. Actual deaths in 60 weeks and predicted deaths in future weeks.
Diagnostics 12 02539 g005
Figure 6. Training, validation, test, and best observed results.
Figure 6. Training, validation, test, and best observed results.
Diagnostics 12 02539 g006
Figure 7. Best fitting for training, validation, testing, and all observed results.
Figure 7. Best fitting for training, validation, testing, and all observed results.
Diagnostics 12 02539 g007
Table 1. Some of the recent variants categorized by WHO.
Table 1. Some of the recent variants categorized by WHO.
Scientific NameName Given by the WHOSpike Protein SubstitutionsAttributes
70del, A570D,1. 50% higher spread capability
B..1.1.7Alpha69del,2. Possible enhanced severity based
(S494P),on hospital admissions and case
(E484K),mortality rates
P681H, 144del,3. Treatment with EUA monoclonal
N501Y, D614G,antibodies has no effect on
T716I, D1118H,susceptibility
S982A4. Minimal effect on recovery and
(K1191N)post-vaccination serum
neutralizing
A701V, D215G,1. 50 % higher spread capability
B.1.351BetaD614G, D80A,2. Susceptibility to a combination
E484K,of bamlanivimab and etesevimab
N501Y,monoclonal antibody treatment
K417N,was drastically lowered; however,
241delthere are other EUA monoclonal
242delantibody treatments available
243del3. Condensed neutralization by
convalescent and post-vaccination sera
D138Y, D614G,1. Susceptibility to the combination
P.1GammaE484K, H655Y,of bamlanivimab and etesevimab
K417T, L18F,monoclonal antibody treatment was
N501Y, P26Sdrastically lowered; however, there
R190S, T20N,are other EUA monoclonal antibody
T1027Itreatments available
2. Condensed neutralization by convalescent
and post-vaccination sera
T95I, G142D,1. Higher spread capability
B.1.617.2DeltaT19R, (V70F),2. Possible decrease in neutralization
R158G, (A222V),by some EUA monoclonal antibody
E156-, F157-,treatments
D614G, D950N,3. Possible decrease in neutralization
(W258L), (K417N)by post-vaccination sera
P681R, L452R,
T478K
Table 2. Variants of Interest (VOI) [27].
Table 2. Variants of Interest (VOI) [27].
Labeled by the WHOAdditional Variations in the LineageCountry First DiscoveredSpike Changes of InterestDate of First DetectionInfluence on TransmissibilityPossibility of a Negative Effect on ImmunityTransmission in Europe
EtaE484KNigeriaQ677HDecember 2020Neutralization (m) [33]Communities
D614G
B.1.525
EpsilonB.1.429,United StatesD614GSeptember 2020Ambiguous [26]Neutralization (v) [26]Inconsistent/Travels
B.1.427L452R
ThetaP.3PhilippineD614GJanuary 2021Yes (m) [34]Neutralization (m) [33]Inconsistent/Travels
E484K
P681H
N501Y
B.1.616FranceD614GFebruary 2021Recognition (c) [25]One-Time Occurrence
G669S
H655Y
V483A
KappaB.1.617.1IndiaD614GDecember 2020Yes (v) [35]Neutralization (v) [25,36]Multiple Occurrences
E484Q
L452R
P681R
B.1.620Not clearD614GFebruary 2021 Neutralization (m) [33,37]Multiple Occurrences
E484K
P681H
S477N
B.1.621ColombiaD614GJanuary 2021Yes (m) [34]Neutralization (m) [33]Inconsistent/Travels
E484K
P681H
N501Y
R346K
Table 3. Variants under observation [27].
Table 3. Variants under observation [27].
Labeled by the WHOAdditional Variations in the LineageCountry First DiscoveredSpike Changes of InterestsDate of First DetectionInfluence on TransmitabilityPossibility of a Negative Effect on ImmunityProof of Link to IntensityTransmission in Europe
B.1.617.3IndiaD614GFebruary 2021Yes (m) [34]Neutralization (m) [26,33]Not found
E484Q
L452R
P681R
B.1.214.2not clear (b)D614GDecember 2020found (a)
ins214TDR
N450K
Q414K
A.23.1+E484KUKE484KDecember 2020Neutralization (m) [33]found (a)
Q613H
V367F
A.27not clear (b)A653VDecember 2020Yes (m) [34]Neutralization (m) [26]found (a)
N501Y
L452R
H655Y
A.28not clear (b)E484KOctober 2020Neutralization (m) [33]found (a)
H655Y
N501T
C.16not clear (b)L452RDecember 2020Neutralization (m) [33]found (a)
D614G
LabmdaC.37PeruD614GDecember 2020found (a)
F490S
L452Q
B.1.351+P384LSouth AfricaA701VDecember 2020Yes (v) [39]Escape (v) [40,41]not clear  [42]found (a)
D614G
E484K
K417N
N501Y
P384L
B.1.351+E516Qnot clear (b)A701VJanuary 2021Yes (v) [39]Escape (v) [40,41]not clear [42]found (a)
D614G
E484K
E516Q
K417N
N501Y
B.1.1.7+L452RUKD614GJanuary 2021Yes (v) [34]Neutralization (m) [26]Yes (v) [43]found (a)
L452R
P681H
N501Y
B.1.1.7+S494PUKD614GJanuary 2021Yes (v) [34]Neutralization (m) [36]Yes (v) [43]found (a)
N501Y
P681H
S494P
C.36+L452REgyptD614GDecember 2020Neutralization (m) [26]found (a)
L452R
Q677H
AT.1RussiaD614GJanuary 2021Neutralization (m) [33]found (a)
E484K
ins679GIAL
N679K
IotaB.1.526USA701VDecember 2020Neutralization (m) [33]found (a)
D614G
E484K
B.1.526.1USD614GOctober 2020Neutralization (m) [26]found (a)
L452R
B.1.526.2USD614GDecember 2020found (a)
S477N
B.1.1.318not clear (b)D614GJanuary 2021Neutralization (m) [33]found (a)
E484K
P681H
ZetaP.2BrazilD614GJanuary 2021Neutralization (m) [33]found (a)
E484K
B.1.1.519MexicoD614GNovember 2020Neutralization (m) [26]found (a)
T478K
AV.1UKD614GMarch 2021Neutralization (m) [33]found (a)
E484K
P681H
N439K
P.1+P681HItalyD614GFebruary 2021not clear
H655Y
E484K
N501Y
P681H
K417T
Table 4. Case Fatality Ratios of COVID-19 [53].
Table 4. Case Fatality Ratios of COVID-19 [53].
CountriesCFR (%)
Bangladesh1.58
India1.17
China5.33
Pakistan2.25
Iran2.75
Germany2.39
Italy2.99
United Kingdom2.85
World-wide2.08
Table 5. Forecasting metric results for all active cases.
Table 5. Forecasting metric results for all active cases.
CountryAlphaMASESMAPEMAERMSE
Bangladesh0.550.4766,660.9610,128.56
China0.90.150.45205.99270.5
Germany0.253.20.3419,705.3122,740.67
India10.860.220,950.7630,570.1
Iran10.590.1414,199.1518,578.85
Italy11.330.1212,815.1715,552.92
Pakistan10.970.172418.552875.24
United Kingdom0.115.740.69135,707.55152,719.52
Table 6. Forecasting metrics results of deaths.
Table 6. Forecasting metrics results of deaths.
CountryAlphaMASESMAPEMAERMSE
Bangladesh0.255.30.58137.17180.56
China0.250.23221.0521.6
Germany10.880.13213.27313.1
India0.16.040.82508.594225.44
Iran0.90.280.0952.2864.89
Italy10.450.07176.9199.38
Pakistan0.50.90.1358.3571.94
United Kingdom05.841.042523.592861.2
Table 7. Test results of the best models for death forecasting.
Table 7. Test results of the best models for death forecasting.
CountriesBest ModelMAPEDFT p-Value *ACF1 **
Bangladesh1,1,019.22 *0.04−0.002
China5,1,1inf *0.01−0.077
Germany1,1,024.870.01−0.042
India0,2,016.24 *0.010.089
Iran0,1,313.39 *0.010.040
Italy4,1,036.170.010.010
Pakistan1,1,018.62 *0.01−0.078
UK2,1,127.590.010.029
World1,1,012.010.05−0.073
* Duckey Fuller test, alternative hypothesis: stationary; ** First-order autocorrelation function.
Table 8. Upcoming 16-week forecast for weekly deaths from COVID-19 for Bangladesh, China, Germany, India, Pakistan, Iran, Italy, the United Kingdom, and the world.
Table 8. Upcoming 16-week forecast for weekly deaths from COVID-19 for Bangladesh, China, Germany, India, Pakistan, Iran, Italy, the United Kingdom, and the world.
PakistanIran
MonthWeekPoint Forecast 95 % CI(Upper)Point Forecast 95 % CI(Upper)
1138053713671732
234261513622070
332669613382493
432077213382878
2131784213383185
231690513383447
331696213383681
4315101613383893
31315106513384089
2315111213384271
3315115613384443
4315119813384606
41315123813384762
2315127613384910
3315131313385053
4315134913385190
IndiaItaly
MonthWeekPoint Forecast 95 % CI(Upper)Point Forecast 95 % CI(Upper)
1130,50632,6397141522
231,09435,8646972310
331,68239,6647403164
432,27043,9547983910
2132,85848,6798454500
233,44653,7968714944
334,03459,2758785279
434,62265,0918725549
3135,21071,2238615787
235,79877,6558516014
336,38684,3728456241
436,97491,3618436469
4137,56298,6128446695
238,150106,1138466917
338,738113,8588487132
439,326121,8368497338
BangladeshUK
MonthWeekPoint Forecast 95 % CI(Upper)Point Forecast 95 % CI(Upper)
1119829292978
21943601222148
31924201403333
41914731454344
211915201415122
21905621335695
31905991266126
41906341226476
311906671206790
21906971217094
31907261237400
41907531257709
411907791268015
21908041268314
31908281268601
41908511268876
GermanyChina
MonthWeekPoint Forecast 95 % CI(Upper)Point Forecast 95 % CI(Upper)
1188115871100
282420781204
379825251277
478629231354
2178032781391
277735970400
377638870404
477641540404
3177544030404
277546350412
377548551427
477550631458
4177552621497
277554531522
377556361539
477558120545
World
MonthWeekPoint Forecast 95 % CI(Upper)
1173,42783,134
271,63589,025
370,76394,891
470,339100,348
2170,133105,326
270,032109,863
369,983114, 019
469,960117,857
3169,948121,430
269,942124,779
369,940127,941
469,938130,941
4169,938133,801
269,937136,539
369,937139,169
469,937141,702
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Khan, R.U.; Almakdi, S.; Alshehri, M.; Kumar, R.; Ali, I.; Hussain, S.M.; Haq, A.U.; Khan, I.; Ullah, A.; Uddin, M.I. Probabilistic Approach to COVID-19 Data Analysis and Forecasting Future Outbreaks Using a Multi-Layer Perceptron Neural Network. Diagnostics 2022, 12, 2539. https://doi.org/10.3390/diagnostics12102539

AMA Style

Khan RU, Almakdi S, Alshehri M, Kumar R, Ali I, Hussain SM, Haq AU, Khan I, Ullah A, Uddin MI. Probabilistic Approach to COVID-19 Data Analysis and Forecasting Future Outbreaks Using a Multi-Layer Perceptron Neural Network. Diagnostics. 2022; 12(10):2539. https://doi.org/10.3390/diagnostics12102539

Chicago/Turabian Style

Khan, Riaz Ullah, Sultan Almakdi, Mohammed Alshehri, Rajesh Kumar, Ikram Ali, Sardar Muhammad Hussain, Amin Ul Haq, Inayat Khan, Aman Ullah, and Muhammad Irfan Uddin. 2022. "Probabilistic Approach to COVID-19 Data Analysis and Forecasting Future Outbreaks Using a Multi-Layer Perceptron Neural Network" Diagnostics 12, no. 10: 2539. https://doi.org/10.3390/diagnostics12102539

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop