Next Article in Journal
Self-Nanoemulsifying Drug Delivery System of 2-Methoxyestradiol Exhibits Enhanced Anti-Proliferative and Pro-Apoptotic Activities in MCF-7 Breast Cancer Cells
Next Article in Special Issue
The Impact of Digital Contact Tracing Apps Overuse on Prevention of COVID-19: A Normative Activation Model Perspective
Previous Article in Journal
Immune Checkpoint Inhibitor-Induced Myocarditis vs. COVID-19 Vaccine-Induced Myocarditis—Same or Different?
Previous Article in Special Issue
Did Usage of Mental Health Apps Change during COVID-19? A Comparative Study Based on an Objective Recording of Usage Data and Demographics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Turf-Based Feature Selection Technique for Predicting Factors Affecting Human Health during Pandemic

by
Alqahtani Saeed
1,
Maryam Zaffar
2,*,
Mohammed Ali Abbas
2,
Khurrum Shehzad Quraishi
3,
Abdullah Shahrose
4,
Muhammad Irfan
5,
Mohammed Ayed Huneif
6,
Alqahtani Abdulwahab
6,
Sharifa Khalid Alduraibi
7,
Fahad Alshehri
7,
Alaa Khalid Alduraibi
7 and
Ziyad Almushayti
7
1
Department of Surgery, Faculty of Medicine, Najran University, Najran 61441, Saudi Arabia
2
Faculty of Computer Sciences, IBADAT International University, Islamabad 44000, Pakistan
3
Department of Chemical Engineering, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Islamabad 44000, Pakistan
4
Department of Computer Science, HITEC University, Taxila 47080, Pakistan
5
Electrical Engineering Department, College of Engineering, Najran University Saudi Arabia, Najran 61441, Saudi Arabia
6
Department of Pediatrics, College of Medicine, Najran University, Najran 61441, Saudi Arabia
7
Department of Radiology, College of Medicine, Qassim University, Buraidah 52571, Saudi Arabia
*
Author to whom correspondence should be addressed.
Life 2022, 12(9), 1367; https://doi.org/10.3390/life12091367
Submission received: 18 July 2022 / Revised: 22 August 2022 / Accepted: 24 August 2022 / Published: 1 September 2022
(This article belongs to the Special Issue The Digital Health in the Pandemic Era)

Abstract

:
Worldwide, COVID-19 is a highly contagious epidemic that has affected various fields. Using Artificial Intelligence (AI) and particular feature selection approaches, this study evaluates the aspects affecting the health of students throughout the COVID-19 lockdown time. The research presented in this paper plays a vital role in indicating the factor affecting the health of students during the lockdown in the COVID-19 pandemic. The research presented in this article investigates COVID-19’s impact on student health using feature selections. The Filter feature selection technique is used in the presented work to statistically analyze all the features in the dataset, and for better accuracy. ReliefF (TuRF) filter feature selection is tuned and utilized in such a way that it helps to identify the factors affecting students’ health from a benchmark dataset of students studying during COVID-19. Random Forest (RF), Gradient Boosted Decision Trees (GBDT), Support Vector Machine (SVM), and 2- layer Neural Network (NN), helps in identifying the most critical indicators for rapid intervention. Results of the approach presented in the paper identified that the students who maintained their weight and kept themselves busy in health activities in the pandemic, such student’s remained healthy through this pandemic and study from home in a positive manner. The results suggest that the 2- layer NN machine-learning algorithm showed better accuracy (90%) to predict the factors affecting on health issues of students during COVID-19 lockdown time.

1. Introduction

The COVID-19 was triggered by Sars-Cov-2 coronavirus, which was initially identified in Wuhan, China in December 2019 [1,2]. This disease spread in the whole world rapidly and has significantly affected many aspects of life including mental health, social life, supply chain, energy consumption, education, etc., [3,4]. Lockdown measures were taken by governments all over the world to impede the disperse of the disease. People all over the world were restricted to quarantine and keep social distancing to determine the number of people who have become infected [5]. Studies have shown that the lockdown during this COVID-19 had different physiological effects including anxiety, stress, confusion [6], and anger [7]. Similar effects were observed in the education domain and various educational stakeholders were affected by lockdown in COVID-19. According to a report of UNSECO [8], about 1.6 billion students faced school closure issues. Face-to-face education was replaced by e-learning. This transformed the lives of the students reducing them to their homes. Students’ mental and physical health is affected by the COVID-19 lockout situation. Different studies are taking part to figure out the reasons causing the disturbance in the students’ mental health. During COVID-19, a variety of statistical tools were used to investigate the elements that influence the students’ mental health. The results of different existing studies reported that lockdown causes depression, anxiety, mental stress and health issues in quarantined populations during COVID-19 [9]. Furthermore, social distancing and different lockdown measures during COVID-19 negatively affect the health of student [10]. The efficient execution of education depends on the health of students. As the students are the main pillar of society and the nation’s leadership and control will rely on them in the future. Therefore, it is necessary to put maximum possible effort to maintain the health of students [11].
Artificial intelligence (AI) plays a vital role in predicting coronavirus effects in the future by analyzing the covid data [12]. Different Al-based supervised and unsupervised algorithms are being employed in studies for COVID-19 predictions and analysis [13]. As the education sector is the base of every country’s development, so AI techniques help the educational stakeholders and government officials of countries all over the world to plan strategies and techniques to maintain the health of students. The focus of this study is to employ the AI-based technique for the identification of factors affecting the students’ mental health in the pandemic of COVID-19. The following are the primary contributions of the proposed work:
  • Identifying the factors affecting the health of students in the lockdown phase of COVID-19;
  • To assist the educational stakeholders in taking proactive measures for maintaining the student’s health for the duration of COVID-19;
  • Proposing AI-based identification of features affecting the student’s health during the lockdown period of COVID-19;
  • Explore AI approaches namely RF, GBDT, SVM, and NN along with Turf feature selection for selecting the optimal feature set affecting the health of students.
In this study, COVID-19 related student data will be evaluated in order to determine factors affecting students’ health during COVID-19 lockdown. The paper is structured in the subsequent pattern: a summary of related literature is provided in Section 2; the suggested AI-based strategy is described in detail in Section 3; Section 4 presents the analysis evaluation of the proposed technique; and the research and future work is summarized in Section 5.

2. Related Work

Different studies are conducted to illustrate the effects of lockdown during COVID-19. In this section, an overview of existing approaches is presented, focusing on the students’ mental health during the lockdown period in COVID-19. Different studies in different countries are conducted all over the world to analyze the health of students during the lockdown in this pandemic situation. Some of the studies are selected from the related work on the mental and physical health of the students, available on Google Scholar. Table 1 shows the reference of the papers, country in which the study was conducted, also presents the different variations in sizes of datasets collected for analyzing the factors affecting the health of students in a pandemic situation. Furthermore, Table 1 analyzes that whether the existing studies are utilizing AI techniques or not. In the end, Table 1 also presents what is the conclusion of the recent studies regarding the factors affecting the mental health of students.
The paper presents 16 most relevant literature on student’s health in COVID-19. Recent studies are evaluated on 5 different parameters: the country through which dataset is taken, study level of student’s understudy, number of students in the dataset, utilization of machine learning technique for identification of factors affecting student’s health, and, lastly, the factor identified by the existing studies that may affect the health of students all through lockdown phase of COVID-19. Different the different levels and sizes of students with varying datasets sizes. The studies focus on graduate, undergraduate, college, public schools, and medical and forestry students of different countries. The recent literature indicates that there is so much gap in studies regarding machine learning utilization for analyzing the mental health of students. Different factors come across while analyzing the existing literature on the students’ mental health. Mainly, the following factors were found to be very crucial in association with the students’ mental health in the COVID-19 pandemic.
  • Loneliness [6];
  • The feeling of isolation [13];
  • Fear of academic year loss;
  • Availability of space for studies;
  • Family functioning;
  • Females have more mental health issues than male students during COVID-19;
  • Fear of own health;
  • Fear of dear one’s health;
  • Poverty;
  • Student Counseling.
Different factors are found in the literature that has an association with the mental health of students. These factors will help the educational admiration to take measures for maintaining the health of students during COVID-19. Different remote techniques and activities should be planned by educational stakeholders to minimize the anxiety of students during the lockdown period of the pandemic. However, as the health of students is an important concern so there is a need for deep insight into the data of students during such pandemic situations. However, the need for AI algorithms is still there for a better insight into data and its analysis. Main shortcomings in recent studies regarding the health of students in COVID-19 are still required to address, some of the shortcomings found in the literature that may help the educational stakeholders to build educational strategies. Firstly, there is a need to the utilization of feature selection techniques to identify the features affection health of students during the lockdown in COVID-19. To our knowledge, there has never been a study that conducted a comprehensive literature analysis and identified factors affecting the health of kids during COVID-19’s lockdown period based on feature selection, whereas [28] has presented and utilized AI, but did not consider feature selection. In the coming sections of this article, we will discuss our novel proposed approach for the analysis of factors affecting the health of students in COVID-19.

3. Methods and Materials

In this section proposed approach for identifying the factors affecting the health of students is presented. As it is very important to figure out that what are factors affecting the health of Figure 1 presents the main flow of the proposed approach main steps of the proposed approach is as follows:
  • Dataset Selection;
  • Dataset Cleaning;
  • Feature Selection;
  • Machine learning algorithm.
Each of the steps is explained further in detail in the coming subsections.

3.1. Student Dataset in COVID-19

A benchmark dataset of 1182 students in COVID-19 [29] is utilized to analyze the factors affecting the health of students in the lockdown period of COVID-19. The dataset is freely available and, hence, utilized easily for research purpose. Table 2 describes the main properties of the student dataset.

3.2. Data Preprocessing

Python programming language platform is utilized for coding the proposed approach, and its various libraries like NumPy, pandas for better insight of data [30]. Different steps are taken to preprocess the imbalanced dataset, firstly by scaling and data cleaning by deleting ids, dropping duplicating rows, and filling all NA values. Moreover, categorical features are mapped to numbers. Furthermore, to convert the text features like (stress buster, what you miss most), pretrained bert is utilized for generating word vectors. Then words are mapped to a single feature by following the normalization formula as:
x = s u m ( v e c t o r ) m a x ( v e c t o r ) m i n ( v e c t o r ) .  
Figure 2 and Figure 3 represents variable count after and before sampling, whereas SMOTE (Synthetic minority oversampling technique) addresses imbalance class issues very effectively in various domains of research [31]. SMOTE oversampling technique is applied to resample student’s datasets for COVID-19. Based on feature space similarity, the SMOTE approach combines extra minority samples [32]. Let k = nearest neighbor for xi using Euclidean distance.
Random Selection of k nearest neighbor
Feature vector difference between k and xi
Adding M in xi
Equation (2) presents the formula for calculating SMOTE
This is example 2 of an equation:
x n e w   = x i ( x i k x i ) × δ .  
x i k = A nearest neighbors of xi, and δ is an arbitrary value belongs to (0, 1).

3.3. Feature Selection

Feature selection is a process to obtain an optimal set of features, to obtain better classification accuracy. There are different types of feature selection algorithm filter and wrapper feature selection. Filter feature selection is high in speed [33] and consumes less time, and is the main reason for selecting filter feature selection in our proposed approach. Filter feature selection is further divided into two types, univariant and multivariant filter feature selection methods. The univariant filter feature ignores the features dependencies and that leads to a poor selection of feature set [34], whereas multivariant feature selection takes consideration of feature dependencies while selecting the feature set [35]. Turf is the tuned form of Relief multivariant filter feature selection. When selecting relief features, feature dependencies are taken utilizing the full feature vector, which may ignore the noisy features, so that Turf feature selection step by step low-quality features, hence, generating optimal feature set [36]. The Turf algorithm is presented in Algorithm 1.
Algorithm 1. TuRF algorithm [36].
a = features in dataset
Let p = iterations
For                   i:= 1 to p do
Estimation of feature weights through ReliefF
Features sorting through weight
+
+   remove p/a of outstanding features with smallest weights
end for
return                  final ReliefF weight estimations for outstanding features

3.4. Machine Learning Algorithms

After the selection of features, classification is performed. SVM (Support Vector Machine) is a classifier for binary classification of data. The hyperplane is used to solve the learning problem in SVM. A robust method with different kernel values is considered one of the best classifiers for classification [37]. RF (Random Forest) utilized various trees to predict. It is being utilized by different research areas of research with remarkable results. RF produces high classification accuracy with an even dataset with a large number of features. It handles unbalanced data by accessing important features. Whereas GBDT (Gradient Boosting Decision Tree) is selected due to its property of selecting fewer parameters as compared to the other classification algorithms. In existing research, in machine learning, GBDT shows tremendous results. It is based on the CART algorithm. GBDT merges the concept of regression and boosting tree and intends the use of residual gradient to optimize the assimilation process of regression tree [38]. ANN (Artificial Neural Network) is a popular classification technique utilized in different areas of research like agriculture, medical, security, education, business, art, etc. It is very easy to use and can manage complex data [39]. Moreover, the performance of the proposed approach presented in this paper is evaluated through accuracy, precision, recall, and f-measure, whereas accuracy is defined as the predicted observations over a total number of observations [40,41,42]. Precision is the fraction of the recovered instances that belong to the target class, whereas F-measure is the harmonic mean of precision and recall. Equations (3)–(6) presents the formula of evaluation parameters, whereas TP, FN, and FP stand for true positive, false negative, and false-positive respectively.
Accuracy = T P + F N T P + F N + F P + F N
Precision = T P T P + F P
Recall = T P T P + F N
F - Measure = 2 ( P r e c i s i o n × R e c a l l ) P r e c i s i o n + R e c a l l
whereas Table 3 presents the Parameters of classification algorithm utilized in proposed work.

4. Results

Results of the proposed approach for the identification of factors affecting the health of students in COVID-19 will be discussed in detail in this section. Figure 4 explains the proposed method in detail with results. The results show that the dataset of a feature vector of 16 features is balanced through applying SMOTE technique. The health of students is taken as a target feature, and the Turf feature selection technique is utilized to detect the factors influencing the health of students. Different classification algorithms are applied to the selected feature datasets of student’s health during COVID-19. The performance of the suggested method was assessed using accuracy, precision, recall, and f-measure assessment metrics.
The result s shows that the student who utilized their time during lockdown period in COVID-19 in different activities remain healthy. Utilization of time appears as the main factor affecting the health of students. The academic organization may keep that factor in front and must plan activities, guide, and motivate students to participate in some indoor actives in such a way that maintains their health. Emotional attachment of students with family members also affects the health of students, as the fear of any family loss due to COVID-19 affects the health of students. Moreover, change in the weight of students during COVID-19 also affects the health of students. Figure 5 presents the results of four classifiers, GBDT, RF, SVM, and NN, on students COVID-19 dataset, whereas the accuracy describes the number of healthy students correctly classified by proposed work over a total number of students. Results show that a Neural network (NN) outperforms other existing classification algorithms in terms of accuracy. However, GBDT also performs well on students COVID-19 dataset and showed around 87% of accuracy. Equation (7) presents the accuracy formula for the student COVID-19 dataset.
A c c u r a c y = N u m b e r   o f s t u d e n t s c o r r e c l y   c l a s s i f i e d   T o t a l   n u m b e r   o f   s t u d e n t s
Figure 6 presents the performance evaluation of the proposed work in terms of precision, whereas precision calculates the number of healthy students in the COVID-19 student dataset correctly classified by proposed work divided by the total number of healthy students in the COVID-19 dataset, classified by the proposed approach. Results show that neural network performs better than other classification algorithms. Equation (8) presents the formula of precision for calculation precision of proposed approach on student COVID-19 dataset.
P r e c i s i o n = N u m b e r   o f   h e a l t h y   s t u d e n t s   i d e n t i f i e d   b y   p r o p s e d   a p p r o a c h T o t a l   n u m b e r   o f   h e a l t h   a n d   u n h e a l t h y   s t u d e n t s     c l a s s i f i e d   b y   p r o p o s e d   a p p r o a c h
The results in Figure 7 show the performance evaluation of the proposed work in terms of recall. The recall is the calculation of a total number of healthy students in the COVID-19 student dataset classified by the proposed approach divided by the total number of healthy students in the COVID-19 student dataset. The results show that the GBDT classifier outperforms other classifiers in recall performance evaluation measures. Furthermore, RF and NN also show better performance. Equation (9) presents the formula for calculating the recall for evaluating proposed approach on students COVID-19 students.
This is example 2 of an equation:
R e c a l l = N u m b e r   o f   h e a l t h y   s t u d e n t s c l a s s f i e d   b y   t h e   p r o p o s e d   a p p r o a c h T o t a l   n u m b e r   o f   h e a l t h y   s t u d e n t s
Figure 8 presents a comparison of the performance of four classifiers in terms F-measure performance evaluation measure, whereas the f-measure of the proposed approach considers precision and recall both, presented already in Equation (6). Results show that GBDT and NN give better performance on the proposed work on the COVID-19 student dataset in terms of F-measure.

5. Conclusions

COVID-19 affects every field of life, the educational sector all over the world faces different issues. During the lockdown, students face a lot of issues, whereas health issue becomes the main issue. Results presented in the proposed approach identifies the main factors affecting the health of students during the lockdown. Results show that the health of students affects the factors that how they utilized their time during the lockdown in COVID-19, whereas weight and family concerns also appear as factors affecting the health of students during a lockdown of COVID-19. Henceforth, there is a need to take proactive measures to discover the approaches to sustain the health of students, either by guiding them in health time utilization activities or by counseling them about family matters. These well-timed taken measures may reduce the health issues in students caused by pandemic situation in COVID-19. Moreover, reported results in this paper `show that neural network outperforms and shows 90% accuracy on the proposed approach as compared to GBDT, RF, and SVM.

Author Contributions

M.Z., A.S. (Alqahtani Saeed), M.A.A. and K.S.Q. performed data curation, methodology, visualization, software, investigation, simulations, analysis, and writing the manuscript. A.S. (Abdullah Shahrose), M.I., A.K.A., S.K.A., Z.A., M.A.H., A.A. and F.A. were responsible for conceptualization, editing, investigation, resource management, project management, analysis, comparisons, and visualization. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to express their gratitude to the Ministry of Education and the Deanship of Scientific Research, Najran University, Kingdom of Saudi Arabia, for their financial and technical support under code number NU/MID/18/031.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The authors would like to express their gratitude to the Ministry of Education and the Deanship of Scientific Research, Najran University. Kingdom of Saudi Arabia, for their financial and technical support under code number NU/MID/18/031.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, Q.; Med, M.; Guan, X.; Wu, P.; Wang, X. Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. N. Engl. J. Med. 2020, 382, 1199–1209. [Google Scholar] [CrossRef] [PubMed]
  2. Wei, J.T.; Liu, Y.X.; Zhu, Y.C.; Qian, J.; Ye, R.Z. Impacts of transportation and meteorological factors on the transmission of COVID-19. Int. J. Hyg. Environ. Health 2020, 230, 113610. [Google Scholar] [CrossRef] [PubMed]
  3. Rahman, M.; Paul, K.C.; Hossian, A.; Ali, G.G.N.; Rahman, S. Machine Learning on the COVID-19 Pandemic, Human Mobility and Air Quality: A Review. IEEE Access 2021, 9, 72420–72450. [Google Scholar] [CrossRef] [PubMed]
  4. Silva, P.C.L.; Batista, P.V.C.; Lima, H.S.; Alves, M.A.; Guimarães, F.G. COVID-ABS: An agent-based model of COVID-19 epidemic to simulate health and economic effects of social distancing interventions. Chaos Solitons Fractals 2020, 139, 110088. [Google Scholar] [CrossRef]
  5. Shafiq, S.; Nipa, S.N.; Sultana, S.; Rahman, M.R.U.; Rahman, M. Exploring the triggering factors for mental stress of university students amid COVID-19 in Bangladesh: A perception-based study. Child. Youth Serv. Rev. 2021, 120, 105789. [Google Scholar] [CrossRef]
  6. Azlan, C.A.; Wong, J.H.D.; Tan, L.K.; Huri, M.H.N.A.D.; Ung, N.M. Teaching and learning of postgraduate medical physics using Internet-based e-learning during the COVID-19 pandemic–A case study from Malaysia. Phys. Med. 2020, 80, 10–16. [Google Scholar] [CrossRef]
  7. Karasmanaki, E.; Tsantopoulos, G. Impacts of social distancing during COVID-19 pandemic on the daily life of forestry students. Child. Youth Serv. Rev. 2021, 120, 105781. [Google Scholar] [CrossRef]
  8. d’Orville, H. COVID-19 causes unprecedented educational disruption: Is there a road towards a new normal? Prospects 2020, 49, 11–15. [Google Scholar] [CrossRef]
  9. Amerio, A.; Brambilla, A.; Morganti, A.; Aguglia, A.; Bianchi, D. COVID-19 lockdown: Housing built environment’s effects on mental health. Int. J. Environ. Res. Public Health 2020, 17, 5973. [Google Scholar] [CrossRef]
  10. Elmer, T.; Mepham, K.; Stadfeld, C. Students under lockdown: Comparisons of students’ social networks and mental health before and during the COVID-19 crisis in Switzerland. PLoS ONE 2020, 15, e0236337. [Google Scholar]
  11. Andriningrum, H.; Gunawan, I. Cultivatation of Healthy Life for Students in School: A Literature Review. In Proceedings of the International Conference on Education and Technology (ICET 2018), Malang, Indonesia, 26–27 October 2018. [Google Scholar]
  12. Vaishya, R.; Javiad, M.; Khan, I.H.; Haleem, A. Artificial Intelligence (AI) applications for COVID-19 pandemic. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 337–339. [Google Scholar] [CrossRef] [PubMed]
  13. Hussain, A.A.; Bouachir, O.; Turjman, F.A.; Aloqaily, M. AI techniques for COVID-19. IEEE Access 2020, 8, 128776–128795. [Google Scholar] [CrossRef]
  14. Rzymski, P.; Nowicki, M. COVID-19-related prejudice toward Asian medical students: A consequence of SARS-CoV-2 fears in Poland. J. Infect. Public Health 2020, 13, 873–876. [Google Scholar] [CrossRef]
  15. Hasan, N.; Bao, Y. Impact of “e-Learning crack-up” perception on psychological distress among college students during COVID-19 pandemic: A mediating role of “fear of academic year loss”. Child. Youth Serv. Rev. 2020, 118, 105355. [Google Scholar] [CrossRef] [PubMed]
  16. Llieva, G.; Yankova, T.; Belcheva, S.K.; Lvanova, S. Effects of COVID-19 Pandemic on University Students’ Learning. Information 2021, 12, 163. [Google Scholar]
  17. Wang, C.; Zhao, H.; Zhang, H. Chinese College Students Have Higher Anxiety in New Semester of Online Learning During COVID-19: A Machine Learning Approach. Front. Psychol. 2020, 11, 3465. [Google Scholar] [CrossRef]
  18. Pan, Y.; Yang, Z.; Han, H.; Qi, S. Family functioning and mental health among secondary vocational students during the COVID-19 epidemic: A moderated mediation model. Personal. Individ. Differ. 2021, 171, 110490. [Google Scholar] [CrossRef]
  19. Cleofas, J.V.; Rocha, I.C.N. Demographic, gadget and internet profiles as determinants of disease and consequence related COVID-19 anxiety among Filipino college students. Educ. Inf. Technol. 2021, 26, 6771–6786. [Google Scholar] [CrossRef]
  20. Khattar, A.; Jain, P.R.; Quadri, S.M.K. Effects of the disastrous pandemic COVID 19 on learning styles, activities and mental health of young Indian students-a machine learning approach. In Proceedings of the 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 13–15 May 2020. [Google Scholar]
  21. Khasawneh, A.I.; Humeidan, A.A.; Alsulaiman, J.W.; Bloukh, S.; Ramadan, M. Medical students and COVID-19: Knowledge, attitudes, and precautionary measures. A descriptive study from Jordan. Front. Public Health 2020, 8, 258. [Google Scholar] [CrossRef]
  22. Baloch, G.M.; Sundarasen, S.; Chinna, K.; Nurunnabi, M.; Kamaludin, K. COVID-19: Exploring impacts of the pandemic and lockdown on mental health of Pakistani students. PeerJ 2021, 9, e10612. [Google Scholar] [CrossRef]
  23. Son, C.; Hegde, S.; Smith, A.; Wang, X.; Sasangohar, F. Effects of COVID-19 on college students’ mental health in the United States: Interview survey study. J. Med. Internet Res. 2020, 22, e21279. [Google Scholar] [CrossRef] [PubMed]
  24. Saddik, B.; Hussein, A.; Askari, F.S.S.; Kheder, W.; Temsah, M.H. Increased levels of anxiety among medical and non-medical university students during the COVID-19 pandemic in the United Arab Emirates. Risk Manag. Healthc. Policy 2020, 13, 2395. [Google Scholar] [CrossRef] [PubMed]
  25. Khoshaim, H.B.; Sukayt, A.A.; Chinna, K.; Nurunnabi, M.; Sundarasen, S. Anxiety Level of University Students During COVID-19 in Saudi Arabia. Front. Psychiatry 2020, 11, 1397. [Google Scholar] [CrossRef] [PubMed]
  26. Calder, N.; Jafri, M.; Guo, L. Mathematics Education Students’ Experiences during Lockdown: Managing Collaboration in eLearning. Educ. Sci. 2021, 11, 191. [Google Scholar] [CrossRef]
  27. Ranjbar, K.; Hosseinpour, H.; Shahriarirad, R.; Ghaem, H.; Jafari, K. Students’ attitude and sleep pattern during school closure following COVID-19 pandemic quarantine: A web-based survey in south of Iran. Environ. Health Prev. Med. 2021, 26, 1–10. [Google Scholar] [CrossRef]
  28. Yuduang, N.; Ong, A.K.; Vista, N.B.; Prasetyo, Y.T.; Nadlifatin, R.; Persada, S.F.; Gumasing, M.J.; German, J.D.; Robas, K.P.; Chuenyindee, T.; et al. Utilizing structural equation modeling–artificial neural network hybrid approach in determining factors affecting perceived usability of mobile mental health application in the Philippines. Int. J. Environ. Res. Public Health 2022, 19, 6732. [Google Scholar] [CrossRef]
  29. Chaturvedi, K.; Vishwakarma, K.D.; Singh, N. COVID-19 and its impact on education, social life and mental health of students: A survey. Child. Youth Serv. Rev. 2021, 121, 105866. [Google Scholar] [CrossRef]
  30. McKinney. W. Why python for data analysis? In Book Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Ipython; O’ Reilly Media, Inc.: Sebastopol, CA, USA, 2012; Volume 1, pp. 1–17. [Google Scholar]
  31. Byeon, H. Predicting the depression of the South Korean elderly using SMOTE and an imbalanced binary dataset. Int. J. Adv. Comput. Sci. Appl. 2021, 12, 74–79. [Google Scholar] [CrossRef]
  32. Shen, F.; Zhao, X.; Kou, G.; Alsaadi, F.E. A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique. Appl. Soft Comput. 2021, 98, 106852. [Google Scholar] [CrossRef]
  33. Wah, Y.B.; Ibrahim, N.; Hamid, H.A.; Rahman, S.A.; Fong, S. Feature Selection Methods: Case of Filter and Wrapper Approaches for Maximising Classification Accuracy. Pertanika J. Sci. Technol. 2018, 26, 329–340. [Google Scholar]
  34. Yusta, S.C. Different metaheuristic strategies to solve the feature selection problem. Pattern Recognit. Lett. 2009, 30, 525–534. [Google Scholar] [CrossRef]
  35. Moore, J.H.; White, B.C. Tuning Relief, F for genome-wide genetic analysis. In Proceedings of the European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, Valencia, Spain, 11–13 April 2007. [Google Scholar]
  36. Urbanowicz, R.J.; Meeker, M.; Cava, W.L.; Olson, R.S.; Moore, J.H. Relief-based feature selection: Introduction and review. J. Biomed. Inform. 2018, 85, 189–203. [Google Scholar] [CrossRef] [PubMed]
  37. Sun, J.; Zhang, R.; Chen, M.; Chen, B.; Wang, X. Identification of Porosity and Permeability While Drilling Based on Machine Learning. Arab. J. Sci. Eng. 2021, 46, 7031–7045. [Google Scholar] [CrossRef]
  38. Kostopoulos, G.; Kotsiantis, S.; Pintelas, P. Estimating student dropout in distance higher education using semi-supervised techniques. In Proceedings of the 19th Panhellenic Conference on Informatic, Athens, Greece, 1–3 October 2015. [Google Scholar]
  39. Chuenyindee, T.; Buaphiban, T. Utilization of random forest classifier and artificial neural network for predicting factors influencing the perceived usability of COVID-19 contact tracing “Morchana” in Thailand. Int. J. Environ. Res. Public Health 2022, 19, 7979. [Google Scholar]
  40. Velazquez, R.M.; Tobon, D.P.; Sanchez, A.; Saddik, A.E.; Petriu, E. A Machine Learning Approach as an Aid for Early COVID-19 Detection. Sensor 2021, 21, 4202. [Google Scholar] [CrossRef]
  41. Almalki, Y.E.; Qayyum, A.; Irfan, M.; Haider, N.; Glowacz, A. A Novel Method for COVID-19 Diagnosis Using Artificial Intelligence in Chest X-ray Images of the article. Healthc. J. 2021, 9, 522. [Google Scholar] [CrossRef]
  42. Irfan, M.; Iftikhar, M.A.; Yasin, S.; Draz, U.; Ali, T.; Hussain, S.; Bukhari, S.; Alwadie, A.S.; Rahman, S.; Glowacz, A.; et al. Role of Hybrid Deep Neural Networks (HDNNs), Computed Tomography, and Chest X-rays for the Detection of COVID-19. Int. J. Environ. Res. Public Health 2021, 18, 3056. [Google Scholar] [CrossRef]
Figure 1. Proposed flow AI-Based feature selection of factors affecting the health of students in COVID-19.
Figure 1. Proposed flow AI-Based feature selection of factors affecting the health of students in COVID-19.
Life 12 01367 g001
Figure 2. Variable count before sampling.
Figure 2. Variable count before sampling.
Life 12 01367 g002
Figure 3. Variable count after sampling.
Figure 3. Variable count after sampling.
Life 12 01367 g003
Figure 4. Some feature selection process of factors affecting the student’s health during COVID-19.
Figure 4. Some feature selection process of factors affecting the student’s health during COVID-19.
Life 12 01367 g004
Figure 5. Comparison of accuracy of proposed COVID-19 approach.
Figure 5. Comparison of accuracy of proposed COVID-19 approach.
Life 12 01367 g005
Figure 6. Comparison of precision of proposed COVID-19 approach.
Figure 6. Comparison of precision of proposed COVID-19 approach.
Life 12 01367 g006
Figure 7. This Comparison of recall of proposed COVID-19 approach.
Figure 7. This Comparison of recall of proposed COVID-19 approach.
Life 12 01367 g007
Figure 8. This Comparison of f-measure of proposed COVID-19 approach.
Figure 8. This Comparison of f-measure of proposed COVID-19 approach.
Life 12 01367 g008
Table 1. Analysis of studies on student’s health during COVID-19 in different countries.
Table 1. Analysis of studies on student’s health during COVID-19 in different countries.
ReferenceCountryLevel of StudentsSize of DatasetMachine LearningEffect on Mental and Physical Health in COVID-19
[6]MalaysiaPostgraduate-NoLoneliness, anxiety, stress, and depression.
[14]Asian students in PolandMedical students85NoFeeling of isolation to students who live abroad.
[15]BangladeshCollege Students400NoIn COVID-19, perceptions of e-Learning failure and fear about academic year failure were connected with psychological distress.
[16]BulgariaGraduate and undergraduate students134YesAvailability of separate rooms for students affects their education.
[17]ChinaNon-graduating undergraduate students1172YES
XGBOOST
School closure, Social distancing or Isolation, and Online learning are the reason for anxiety.
[18]ChinaSecondary vocational students5783NoGood family functioning can positively affect the mental health of students.
[19]PhilippinesCollege Students952NoSocioeconomic gaps and the digital divide affect the mental health of students.
[20]IndiaUndergraduate and post-graduate516YesUncertainty regarding examination affects the mental health of students.
[21]JordanMedical Student1404NoStudents focus on strategies to prevent covid.
[22]PakistanHigher Educational Institutions494NoUnaffordability of digital devices and the internet.
[23]USA (United States of America)University Students195NoFear of own health and dear one’s health affects the mental health of students.
[24]UAE (United Arab Emirates)Medical and non-medical students1485NoFear of the unknown might affect the mental health of students so that students must be aware of the COVID-19.
[25]Saudi ArabiaUniversity students400YesFemales and fourth-semester students face anxiety during COVID-19.
[26]New ZealandMater level
Graduate-level
Teaching degree
(Mathematics education learning)
3NoTeachers help in the transition of a new way of learning that affects students.
[27]GreeceUndergraduate forestry students181NoStudents must be counseled properly to control negative emotions during the lockdown.
[7]IranPublic school students20,697NoBehavioral and socializing changes during COVID-19 affects mental health.
Table 2. Student COVID-19 dataset description.
Table 2. Student COVID-19 dataset description.
Number of Students1182
Number of features19
FeaturesId of the student, home location of students, Student_age, Time consumed _online Class, Rating of Online Class experience, Instruction medium for an online class, Time consumed_ self-study, Time consumed_ fitness, sleeping_ time, Time consumed_ social media, preferred social media platform, Time consumed_ TV, meals _per day, changes _weight, Health issue_ lockdown, Stressbusters, Utilization _time, what you miss the most
Target featureHealth issue during lockdown
Number of classes2
Table 3. Parameters of classification algorithm utilized in proposed work.
Table 3. Parameters of classification algorithm utilized in proposed work.
SRNameParameters
1GBDTn_estimators = 19, learning_rate = 0.3, max_depth = 7, random_state = 0
2SVMSVC(C = 5, break_ties = False, cache_size = 200, class_weight = Balanced, degree = 3, gamma = 11, kernel = ‘rbf’)
3Random Forestbootstrap = True, criterion = ‘gini’, max_depth = 15, max_features = ‘auto’,
min_samples_leaf = 1, min_samples_split = 2,
n_estimators = 20
4NN-2 LayersMomentum0.9, learning rate = 0.003, layers = 2, drop_out = 0.1, optimizer = adam, Loss Binary Class
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Saeed, A.; Zaffar, M.; Abbas, M.A.; Quraishi, K.S.; Shahrose, A.; Irfan, M.; Huneif, M.A.; Abdulwahab, A.; Alduraibi, S.K.; Alshehri, F.; et al. A Turf-Based Feature Selection Technique for Predicting Factors Affecting Human Health during Pandemic. Life 2022, 12, 1367. https://doi.org/10.3390/life12091367

AMA Style

Saeed A, Zaffar M, Abbas MA, Quraishi KS, Shahrose A, Irfan M, Huneif MA, Abdulwahab A, Alduraibi SK, Alshehri F, et al. A Turf-Based Feature Selection Technique for Predicting Factors Affecting Human Health during Pandemic. Life. 2022; 12(9):1367. https://doi.org/10.3390/life12091367

Chicago/Turabian Style

Saeed, Alqahtani, Maryam Zaffar, Mohammed Ali Abbas, Khurrum Shehzad Quraishi, Abdullah Shahrose, Muhammad Irfan, Mohammed Ayed Huneif, Alqahtani Abdulwahab, Sharifa Khalid Alduraibi, Fahad Alshehri, and et al. 2022. "A Turf-Based Feature Selection Technique for Predicting Factors Affecting Human Health during Pandemic" Life 12, no. 9: 1367. https://doi.org/10.3390/life12091367

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop