Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST

Deng, Jia; Fu, Yan; Liu, Qi; Chang, Le; Li, Haibo; Liu, Shenglin

doi:10.3390/diagnostics12102538

Open AccessArticle

Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST

by

Jia Deng

¹,

Yan Fu

^1,*

,

Qi Liu

¹,

Le Chang

²,

Haibo Li

³ and

Shenglin Liu

⁴

¹

School of Mechanical Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China

²

York Region Secondary Virtual School, York Region, Markham, ON L3R 3Y3, Canada

³

Shenzhen Rehabilitation & Aiding Devices Industry Association, Shenzhen 518000, China

⁴

Wuhan Union Hospital, Wuhan 430022, China

^*

Author to whom correspondence should be addressed.

Diagnostics 2022, 12(10), 2538; https://doi.org/10.3390/diagnostics12102538

Submission received: 13 September 2022 / Revised: 5 October 2022 / Accepted: 8 October 2022 / Published: 19 October 2022

(This article belongs to the Special Issue Machine Learning in Signal and Image Analysis for Biomedical Application)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Objective: Among various assessment paradigms, the cardiopulmonary exercise test (CPET) provides rich evidence as part of the cardiopulmonary endurance (CPE) assessment. However, methods and strategies for interpreting CPET results are not in agreement. The purpose of this study is to validate the possibility of using machine learning to evaluate CPET data for automatically classifying the CPE level of workers in high-latitude areas. Methods: A total of 120 eligible workers were selected for this cardiopulmonary exercise experiment, and the physiological data and completion of the experiment were recorded in the simulated high-latitude workplace, within which 84 sets of data were used for XGBOOST model training and36 were used for the model validation. The model performance was compared with Support Vector Machine and Random Forest. Furthermore, hyperparameter optimization was applied to the XGBOOST model by using a genetic algorithm. Results: The model was verified by the method of tenfold cross validation; the correct rate was 0.861, with a Micro-F1 Score of 0.864. Compared with RF and SVM, all data achieved a better performance. Conclusion: With a relatively small number of training samples, the GA-XGBOOST model fits well with the training set data, which can effectively evaluate the CPE level of subjects, and is expected to provide automatic CPE evaluation for selecting, training, and protecting the working population in plateau areas.

Keywords:

CPET; XG-BOOST; cardiopulmonary assessment

1. Introduction

In high-altitude areas, people’s physical condition, training, and skills should be taken into special consideration [1]. Cardiopulmonary Endurance (CPE) is a core component of evaluation criteria and an important index in considering job placement, training and occupational risk in high-latitude workplaces [2]. Cardio Pulmonary Exercise Testing (CPET) is a cardiological test that measures the heart’s ability to respond to external stress in a controlled clinical environment [3]. Considering the paradigmatic changes in research, CPET provides more accurate and extensive experimental data, compared with traditional research methods such as walking or running tests. CPET also improves the accuracy of subsequent data processing and analysis. It has become an important clinical tool for assessing exercise capacity and predicting a prognosis in patients with heart failure and other cardiac diseases [4]. CPET provides a wealth of data for CPE evaluation. The most widely used parameters are maximal oxygen uptake (VO2max) and anaerobic threshold (AT) oxygen uptake. VO2max is the maximum oxygen uptake of the human body, which defines the maximum energy that can be obtained through aerobic metabolism during the maximum exercise period [4]. Studies have shown that VO2max can reflect the maximum exercise limit of the cardiopulmonary system and is a key predictor of cardiopulmonary function [5,6,7]. The anaerobic threshold (AT) is defined as the highest level of sustained activity at which the measured oxygen uptake accounts for the entire energy requirement [8]. AT has been shown to have a high correlation with exercise endurance, and has numerous clinical applications, such as prediction [9,10,11,12]. Exercise economy is used to define the energy consumption under a certain absolute exercise intensity, and VO2 represents the energy consumption of a certain physical labor or power output [13]. Carriere et al. showed that the identification of AT and RCP has a potential role in prognostic stratification of HFrEF [14]. Obviously, different parameters reflect various features of CPE, which are hard to be replaceable for each other; thus, it is widely accepted that multi-parameters should be collected for a more accurate cardiopulmonary endurance assessment. Traditionally, the multi-parameter interpretation has, nevertheless, placed more pressure on the expertise of professionals to interpret the data and draw a conclusive assessment [15]. Stringer used VO2max, ventilation, work rate and a series of data to study the cardiopulmonary function in patients with COPD [16]. Wagner et al. used VO2max, V-VCO slope and EOV to analyze and manage heart failure with reduced ejection fraction [17]. Remington et al. collected variables including VO2max, VE/VCO2 slope, VTand O2 pulse to study aortic stiffness [18]. These CPET interpretations are based on individual experts’ subjective analysis, which varied the results across subjects and testing scenarios. Moreover, clinical studies intended to set up universally applicable guidelines; however, the systematic review conducted by the Canadian Cardiovascular Society Guidelines identified that those guidelines are of “low” quality due to various factors such as small sample size, single-center design, heterogeneous patient populations, variable CPET methodology and outcome measurements [19]. This may lead to the underuse of CPET, despite its advantages in providing rich, useful information with noninvasive test procedures.

Researchers have attempted to apply machine learning to interpret CPET data, achieving suggestive results. Leopold et al. developed a greedy heuristic algorithm based on feature clustering to study the ability of CPET to predict the anaerobic mechanical power outputs [20]. Braccioni et al. used a random forest algorithm to analyze the relationship between symptoms and cardiopulmonary parameters of lung transplant recipients; this was based on incremental CPET [21]. Inbar et al. combined SVM with CPET data and obtained a prediction model with quite an accurate overall prediction ability [22]. Li et al. proposed a sequential clustering method to identify elite athletes based on CPET data including oxygen consumption, carbon dioxide, heart rate, and stroke output, but without good performance [23]. Gratiela et al. proposed a fuzzy evaluation method based on multiple physiological parameters such as VO2max, SV and HR to evaluate CPE levels of athletes and non-athletes with an accuracy of about 90% [24]. Chen et al. proposed a congestive force failure (CHF) detection model based on decision tree and support vector machine [25]. Unfortunately, the above work only selected some special values of cardiopulmonary metabolic indicators based on a limited dataset. Furthermore, due to the black-box principle of machine learning, the lack of interpretability seriously affected its wider application in the medical field, in which it is critical for users to understand and trust the logic of the corresponding decision-making [26].

XGBOOST, known in full as extreme Gradient Boosting, shares a fundamental idea with GBDT, which is to grow a tree through constant feature splitting. Compared with the two traditional machine-learning algorithms, SVM and RF, XGBOOST achieves higher accuracy and a higher Macro-F1 Score, and can improve the generalization ability and robustness of a single learner by learning from the residual of the previous tree; thus, it has a better capability of interpretability. Song et al. used XGBOOST to improve a steel property optimization model [27]. Pan used the XGBOOST algorithm to predict hourly PM2.5 concentration [28]. Ogunleye built an XGBOOST model to diagnose chronic kidney disease [29]. The purpose of this study is to build a CPET evaluation model based on XGBOOST and the genetic algorithm (GA), to achieve accurate and explanatory evaluation results based on high-noise and large-volume of data collected from CPET.

2. Materials and Methods

2.1. Participants

A total of 120 young men (21 ± 3.5 yrs.) with good physical fitness were selected for this experiment. The healthy male subjects were within the height range of 174 ± 3.51 cm, weight range of 65.2 ± 8.44 and BMI range of 21.55 ± 2.90 kg/m². Before the experiment, physical screenings were conducted on the subjects, to ensure that there were no signs of the following diseases: cardiovascular disease; bone disease and metabolic system disease. All subjects were instructed to avoid alcohol, to exercise vigorously, and to obtain sufficient sleep for the first 24 h of the experiment. All subjects signed the consent and notified about the potential risks in the experiment.

2.2. The Cardiopulmonary Test (CPET)

The CPET test was conducted on the Quark CPET measurement platform (COSMED Co., Rome, Italy). The entire exercise process for all participants was completed under the supervision of professional medical staff in the hospital to guarantee reliable and highly comparable data. All subjects in this experiment completed the maximum exercise test on a power bicycle, as shown in Figure 1. As it is difficult to maintain a constant bicycle speed throughout the test, the bicycle speed is a component of power-per-minute, subjects were required to maintain a maximum of 55 to 65 r/min during the pedaling. After a 3 min rest period and a 3 min warm-up period, subjects pedaled at a rate of 30 w/min until exhausted.

Before the CPET data were used to train the automatic assessment model, a classification standard was set up, as Figure 2 shows. A CPE evaluation index system was first established and then optimized. The weights to the indexes were assigned based on the C-OWA operator and entropy weight method, as shown in Table 1 [30]. The TOPSIS-RSR method was used to determine subjects’ CPE rating levels [30]. Finally, GA-XGBOOST was used to optimize the weight matrix.

2.3. Algorithms

In this study, XGBOOST was used as the classification value of the model: input training set X, namely CPET data and output tag set. The first and second derivatives of each sample were calculated. The tree with the maximum Gain was selected by greedy strategy, and the sum of all the first and second derivatives of the tree can be used to calculate the classification value of the sample. Its process was approximately shown in Figure 3. The hyperparameter settings of the XGBOOST algorithm were as follows: the default base learner was GB-tree; the number of base learners was 100; the learning rate is 0.1; and the other parameters were default values. Detailed code for XGBOOST and genetic algorithms is provided in the Supplementary material.

The genetic algorithm was used to optimize the machine learning model. The steps of the genetic algorithm are shown in Figure 4. The parameters of XGBOOST were categorized according to 3 types: general parameters; BOOST parameters; and learning target parameters. Among the BOOST parameters, the number of iterators (N_estimator), the learning rate (Learning_rate) and the maximum depth of the number (max_depth) largely determine the performance of the model; thus, they were optimized in modeling. After setting the genetic algorithm parameters, the genetic operator was determined and the Macro-F1 Score was selected as the fitness function, with the maximum value as the optimal solution of the genetic algorithm.

The RF algorithm mainly optimizes the number of the decision tree and the maximum feature ratio. The number 100 was selected for this decision tree, and the root of the maximum feature ratio was selected for the number of features. Other parameters were given default values. In the SVM algorithm the penalty function was 1; the kernel function was linear kernel function; the kernel function constant was 0; the highest term coefficient was 0; and the multiclassification strategy was “1 pair of complementary classes”.

2.4. Performance Analysis

2.4.1. Tenfold Validation

The most popular algorithms—support vector machine (SVM), random forest (RF) and extreme gradient enhancement (XGBOOST)—were trained and validated the performance. A total of 120 subjects were divided into the categories of training set and test set in a ratio of 7:3. To ensure that the final evaluation model was not affected by this process, the machine learning model was constructed and screened by using ten-fold cross validation (rounded if it is not an integer) in the training set. The model with the best performance will be assessed by the test set. All the calculations were run on Python 3.6.2 (https://www.lfd.uci.edu/~gohlke/pythonlibs/#xgboost (accessed on 10 June 2022)).

2.4.2. Statistical Analyses

The Macro-Average method was used to evaluate the performance of the machine learning model. Accuracy, Macro-Precision, Macro-Recall and Macro-F1 Score were applied to evaluate the model performance:

Accuracy = \frac{T P + T N}{T P + T N + F P + F N}

(1)

Macro Precision = \frac{1}{n} \sum_{i = 1}^{n} Pi

(2)

Macro Recall = \frac{1}{n} \sum_{i = 1}^{n} R i

(3)

Macro F 1 = \frac{1}{n} \sum_{i = 1}^{n} F 1 i

(4)

2.4.3. Confusion Matrix

The test set was used to evaluate the results of the XGBOOST model classification. Confusion matrix is a commonly used auxiliary tool in machine learning, which can intuitively understand the prediction of each category in the sample. The darker the color, the greater the density/number and the better the prediction accuracy.

3. Results

3.1. Performance of Classifying Models

As Table 2 shows, XGBOOST has the highest accuracy rate. The three machine learning models can identify the CPE level of the human body, and the scores of the Macro-F1 are all above 81%. Among them, the XGBOOST algorithm has achieved better results than the first two algorithms in both accuracy and the Macro-F1 Score.

The confusion matrix of XGBOOST to test set classification is shown in Figure 5. Figure 5a shows the confusion matrix predicted by XG-BOOST. The actual number of subjects with average, good and excellent levels was 12, 13 and 11, respectively, while the predicted number of subjects with those three levels were 11, 14 and 9, respectively. This means that the model may be more sensitive to the judgment of intermediate levels. Figure 5b showed the process in more detail: subjects 2, 19 and 35 are overestimated, while subjects 29 and 34 are overestimated. These may be due to individual errors and personal factors. The accuracy of test set recognition was 0.861 and the Macro-F1 Score was 0.864. XGBOOST correctly identified all excellent individuals but failed to correctly identify three “good” and “general” individuals.

3.2. Hyperparameter Optimization of XGBOOST Based on Genetic Algorithm

The official documentation for XGBOOST divides the parameters of XGBOOST into three categories: general parameters; BOOST parameters; and learning target parameters. Among the BOOST parameters, the number of iterators (n_estimator), the learning rate (learning_rate) and the maximum depth of the number (max_depth) largely determine the performance of the model. In this study, the genetic algorithm was used to optimize the three variables of XGBOOST, with the following steps:

Step 1: Set genetic algorithm parameters

The individuals were coded in binary. The initial population and maximum iteration times were 50 and 100, respectively. The crossover probability PC was set as 0.95, and the mutation probability PM was set as 0.05. The gene length was 13;

Step 2: Determine genetic operators

Crossover operators: Single-point crossover was selected for the first three and four–six loci, and multi-point crossover was selected for the last six loci. Suppose that four crossover points are randomly generated (point 2, point 4, point 9 and point 12), two new individuals can be generated by crossing the two, as shown in Figure 6.

Mutation operator: Single-point variation was selected for the first three positions and positions four–six of the mutation operator, and uniform variation was selected for the last six positions. Suppose that four crossover points (point 2, point 4, point 9 and point 12) are randomly generated, then the individual generated after the mutation of the altered individual is shown in Figure 7.

Selection operator: To avoid the elimination of excellent individuals, this study adopted the ranking method and elite strategy to “recommend” the top 5% individuals to ensure that they were not eliminated;

Step 3: Determine fitness function

The Macro-F1 Score was selected as the fitness function, and the maximum value was used as the optimal solution for the genetic algorithm. The genetic algorithm was used to optimize the XGBOOST algorithm, and the fitness iteration curve is shown in Figure 8. The best Macro-F1 Score was 0.915. Table 3 lists the optimized parameters.

Using the genetic algorithm to adjust parameters for three times, the results of the 50th iteration are better than the default parameters of XGBOOST, as shown in Figure 8. The accuracy and the Macro-F1 Score of XGBOOST, optimized based on the genetic algorithm, are improved by 6.5% and 5.9%, respectively.

3.3. Interpretability of GA-XGBOOST Output

SHAP is applied to explain the importance of features in GA-XGBOOST established in this paper. Shapley value can describe the importance of each feature when the model makes a prediction on specific samples. Compared with traditional feature-importance description methods, SHAP has better consistency [31]. The SHAP value of each feature is drawn for each sample, as shown in Figure 9. The ordinate signifies different physiological characteristics, the abscissa signifies the SHAP value, a point represents a sample, and the color represents the feature value (red is high, blue is low). The absolute average SHAP value of each feature is taken as the importance of the feature, as shown in Figure 10.

The results show that the first five feature parameters—

X_{12}

,

X_{6}

,

X_{7}

,

X_{2}

,

and X_{16}

; namely, VO2_MAX, OP_MAX, VE/VO2_AT, HR_MAX, and C(a-v)O2_MAX—have the greatest impact on the classification results. Among them, maximal oxygen uptake has the greatest influence on the model. The higher the characteristic value, the lower the SHAP value, which can improve the CPE of the subjects; when using the XGBOOST model for classification, “1” is excellent, “2” is good, and “3” is general, as mentioned above.

X_{6}

, the oxygen pulse at exhaustion, is the second important characteristic, and the higher the value of this characteristic, the greater the CPE.

X_{9}

signifies the ventilation at the anaerobic threshold, which is the least important feature. The above experimental results are consistent with the clinical and related findings in the previous review [5,6,10,11,12]. Figure 11 shows the SHAP feature-dependence diagram of the two indicators. (VO2_MAX) is approximately positively correlated with the target variable (C(A-V)O2_MAX). As shown in Figure 11b, (VE/VO2_AT) is negatively correlated with the target variable (VO2_MAX). Baba R pointed out that any subject with a defective motor system would decrease VO2, which leads to a rapid increase in oxygen equivalent [29]. This is consistent with the results explored in this study.

4. Discussion

The goal of the current study is to develop and validate a computer-aided algorithm for automatic evaluation of CPET test results and to obtain a relatively reasonable and accurate model. In this study, it is shown that a wide range of 16 CPET parameters, after being automatically selected and weighted, are fed into the model, generating quite an accurate CPE level assessment, to distinguish between the “Excellent”, “Good” and “General” individual CPE level. Different classification scaling methods have been proposed to distinguish CPE in different populations: Guo et al. studied the CPE of adults and divided them into the categories of excellent, good, medium, poor and very poor [32]. ACSM’s Health-Related Physical Fitness Assessment Manual classifies adult males by percentile: excellent (95–99%); very good (80–95%); good (60–80%); general (40–60%); weak (20–40%); and quite weak (0–20%) [32]. Considering the general requirements for high-latitude on-site work, the three level granularity is sufficient [33].

A series of data (WR, VO2/kg, HR, and VE, etc.) provided by CPET may obtain different results in different expert diagnoses, which means that manual interpretation can be very confusing and subjective. As such, a machine learning approach on analysis of all CPET data can diminish the differences between individuals’ expertise and achieve a relatively constant criteria. The results showed that the CPE evaluations based on XGBOOST were accurate and consistent with the clinical diagnosis results of expert scoring. Distinct from the subjective selection of CPET parameters in other studies, this study determines the physiological parameters of the selected subjects through Pearson correlation analysis, principal component analysis (PCA), the TOPSIS-RSR rating evaluation and the C-OWA weighting method. Furthermore, it generates a classification scale for the assessment of the CPE rating. The complete process proves the feasibility of fully automatic CPE assessment. CPET possesses a major advantage over traditional cardiopulmonary endurance assessment in that it provides multiple parameters related to cardiopulmonary endurance, which facilitate a more comprehensive assessment rather than relying on a single parameter. In view of the weaknesses of CPET (its large amount of data and strong subjectivity in expert evaluation), the use of a machine learning algorithm for modeling can well solve these two problems and achieve the goals of high accuracy, speed and objectivity.

At present, many CPET-related studies have been published; however, only a few have been successfully modeled using machine learning methods. Traditional CPET research methods mainly use statistical methods to generate a general result from multivariate CPET and cannot be traced back for an explanation on how key features are connected to the assessed classification level. In Ong’s research, CPET data were generalized to indicate the serious level of chronic obstructive pulmonary disease (COPD) but could not indicate which parameters doctors should mainly focus on to treat the syndrome of this disease [34]. Several research studies have used machine learning to investigate CPET and have yielded considerable results: A sequential clustering method proposed by Li et al. selected CPET data including oxygen consumption, carbon dioxide, heart rate and stroke output for training, but the actual recognition effect of the model was not good. Deak et al. proposed a fuzzy evaluation method based on multiple physiological parameters such as VO2max, SV and HR, but it could not quantitatively evaluate the CPE level of the body. The contribution of this study is that relatively high accuracy was achieved after the qualitative classification method was proposed; unlike the traditional regression algorithms currently available, the neural network model used in this study can constantly learn from new data, and its predictive power can be improved by providing new CPET experimental data. In our study, the correlation analysis method was used to screen and analyze the importance of each CPET parameter, and then a model was established based on XGBOOST, which proved to be relatively successful in the verification stage. As in many scientific tasks and other complex tasks correct models will undoubtedly help facilitate medical diagnosis and implement appropriate measures that are less susceptible to the subjective judgment of experts or doctors. Current efforts represent a practical possibility for medical interpretation software to aid in objective assessment.

Some study limitations are to be noted. First, a comparatively small sample size may increase the error, and the model accuracy may also be affected by the quality of CPET original data. The accuracy and the Macro-F1 Score of the XGBoost CPE assessment model in this study were not particularly ideal. Therefore, more experimental data can be collected in the future to modify the model: on the one hand, to study how to select physiological characteristic parameters, and on the other hand, to optimize the parameters of XGBOOST. Secondly, the measurement error caused by the technical limitation of experimental equipment should not be ignored. To fit the model into the personnel placement and management of high-latitude worksites, future work should be compared with more CPE testing protocols and high-latitude work factors should be added into the protocols to train and test the model. Finally, although machine learning has achieved good success in many fields, its wider application in some security-sensitive application fields is affected by the lack of interpretability, due to the black-box principle of machine learning. In high-latitude on-site workers’ selection, training and occupational prevention, it is of vital importance to build up a model with good interpretability. Based on the advantage of XGBoost’s capacity to interpret the black-box output, this study has proved the possibility of automatic interpretation from various information granularity. Further studies can be completed to train the model on further diagnostic expertise.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics12102538/s1.

Author Contributions

Conceptualization, Y.F. and S.L.; methodology, Y.F., J.D. and Q.L.; software, J.D. and Q.L.; validation, Q.L., J.D. and L.C.; formal analysis, Q.L. and J.D.; investigation, H.L. and S.L.; writing—original draft preparation, Y.F., J.D. and L.C., writing—review & editing, Y.F., J.D. and L.C. All authors have read and agreed to the published version of the manuscript.

Funding

National Innovative Research Project, China (No: 2020-519).

Institutional Review Board Statement

The experiment was conducted in accordance with the rules of the 1975 Declaration of Helsinki revised in 2013, and was approved by the Ethics Committee of Chinese PLA General Hospital (S2020-519-01; 24 December 2020).

Informed Consent Statement

Informed consent was obtained from all the subjects involved in the study.

Data Availability Statement

The data used to support the findings of this study cannot be made freely available to protect patient privacy. Requests for access to these data should be made to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Amelchugov, S.; Inzhutov, I.; Nikitina, O.; Berseneva, M. On the issue of personnel management in high-latitude construction. MATEC Web Conf. 2018, 245, 03021. [Google Scholar] [CrossRef]
Huang, B. Analysis and Comprehensive Evaluation of Core Elements of Physical Fitness of Young Male at Different Altitudes. Ph.D thesis, Tianjin Institute of Physical Education, Tianjin, China, 2020. [Google Scholar]
Balady, G.J.; Arena, R.; Sietsema, K.; Myers, J.; Coke, L.; Fletcher, G.F.; Forman, D.; Franklin, B.; Guazzi, M.; Gulati, M.; et al. Clinician’s guide to cardiopulmonary exercise testing in adults: A scientific statement from the American Heart Association. Circulation 2010, 122, 191–225. [Google Scholar] [CrossRef] [Green Version]
Hill, D.W.; Rowell, A.L. Running velocity at VO2max. Med. Sci. Sport. Exerc. 1996, 28, 114–119. [Google Scholar] [CrossRef] [PubMed]
Pedersen, B.K.; Saltin, B. Exercise as medicine–evidence for prescribing exercise as therapy in 26 different chronic diseases. Scand. J. Med. Sci. Sport. 2015, 25, 1–72. [Google Scholar] [CrossRef] [Green Version]
Mezzani, A.; Guazzi, M. Balancing the evidence on the cardiovascular determinants of oxygen uptake improvement after endurance training in the elderly: What are the next steps? Eur. J. Prev. Cardiol. 2016, 23, 730–732. [Google Scholar] [CrossRef] [Green Version]
Jones, L.W.; Eves, N.D.; Hayowsky, M.; Joy, A.A.; Douglas, P.S. Cardiorespiratory exercise testing in clinical oncology research: Systematic review and practice recommendations. Lancet Oncol. 2008, 9, 757–765. [Google Scholar] [CrossRef]
Svedahl, K.; MacIntosh, B.R. Anaerobic threshold: The concept and methods of measurement. Can. J. Appl. Physiol. 2003, 28, 299–323. [Google Scholar] [CrossRef] [Green Version]
Braga AC, M.; Pinto, A.; Pinto, S.; de Carvalho, M. The role of moderate aerobic exercise as determined by cardiopulmonary exercise testing in ALS. Neurol. Res. Int. 2018, 2018, 8218697. [Google Scholar] [CrossRef] [Green Version]
Jones, A.M.; Carter, H. The effect of endurance training on parameters of aerobic fitness. Sport. Med. 2000, 29, 373–386. [Google Scholar] [CrossRef] [PubMed]
Abbasi, A.; Tartibian, B. Prediction and comparison of maximum O₂ pulse in male adolescents. Facta Univ.-Ser. Phys. Educ. Sport 2006, 4, 81–92. [Google Scholar]
Laukkanen, J.A.; Kurl, S.; Salonen, J.T.; Lakka, T.; Rauramaa, R. Peak oxygen pulse during exercise as a predictor for coronary heart disease and all cause death. Heart 2006, 92, 1219–1224. [Google Scholar] [CrossRef] [PubMed]
Petek, B.J.; Churchill, T.W.; Sawalla Guseh, J.; Loomer, G.; Gustus, S.K.; Lewis, G.D.; Weiner, R.B.; Baggish, A.L.; Wasfy, M.M. Utility of the oxygen pulse in the diagnosis of obstructive coronary artery disease in physically fit patients. Physiol. Rep. 2021, 9, e15105. [Google Scholar] [CrossRef] [PubMed]
Carriere, C.; Corrà, U.; Piepoli, M.; Bonomi, A.; Merlo, M.; Barbieri, S.; Salvioni, E.; Binno, S.; Mapelli, M.; Righini, F.; et al. Anaerobic threshold and respiratory compensation point identification during cardiopulmonary exercise tests in chronic heart failure. Chest 2019, 156, 338–347. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Biccard, B.M. Improving the evidence-base for preoperative cardiopulmonary exercise testing. Br. J. Anaesth. 2018, 120, 419–421. [Google Scholar] [CrossRef] [Green Version]
Stringer, W.; Marciniuk, D. The role of cardiopulmonary exercise testing (CPET) in pulmonary rehabilitation (PR) of chronic obstructive pulmonary disease (COPD) patients. COPD J. Chronic Obstr. Pulm. Dis. 2018, 15, 621–631. [Google Scholar] [CrossRef] [PubMed]
Wagner, J.; Agostoni, P.; Arena, R.; Belardinelli, R.; Dumitrescu, D.; Hager, A.; Myers, J.; Rauramaa, R.; Riley, M.; Takken, T.; et al. The role of gas exchange variables in cardiopulmonary exercise testing for risk stratification and management of heart failure with reduced ejection fraction. Am. Heart J. 2018, 202, 116–126. [Google Scholar] [CrossRef] [PubMed]
Remington, H.E. Aortic Stiffness and Cardiopulmonary Exercise Test Variables. Master’s Thesis, School of Kinesiology, Ann Arbor, MI, USA, 2020. [Google Scholar]
Duceppe, E.; Parlow, J.; MacDonald, P.; Lyons, K.; McMullen, M.; Srinathan, S.; Graham, M.; Tandon, V.; Styles, K.; Bessissow, A.; et al. Canadian Cardiovascular Society Guidelines on Perioperative Cardiac Risk Assessment and Management for Patients Who Undergo Noncardiac Surgery. Can. J. Cardiol. 2017, 33, 17–32. [Google Scholar] [CrossRef] [Green Version]
Leopold, E.; Navot-Mintzer, D.; Shargal, E.; Tsuk, S.; Tuller, T.; Scheinowitz, M. Prediction of the Wingate anaerobic mechanical power outputs from a maximal incremental cardiopulmonary exercise stress test using machine-learning approach. PLoS ONE 2019, 14, e0212199. [Google Scholar] [CrossRef]
Braccioni, F.; Bottigliengo, D.; Ermolao, A.; Schiavon, M.; Loy, M.; Marchi, M.R.; Gregori, D.; Rea, F.; Vianello, A. Dyspnea, effort and muscle pain during exercise in lung transplant recipients: An analysis of their association with cardiopulmonary function parameters using machine learning. Respir. Res. 2020, 21, 1–11. [Google Scholar] [CrossRef] [PubMed]
Inbar, O.; Inbar, O.; Reuveny, R.; Segel, M.J.; Greenspan, H.; Scheinowitz, M. A Machine Learning Approach to the Interpretation of Cardiopulmonary Exercise Tests: Development and Validation. Pulm. Med. 2021, 2021, 5516248. [Google Scholar] [CrossRef] [PubMed]
Li, H.L.; Xia, Y.Y.; Zou, J.C. Selection method of middle and long distance endurance athletes based onCPET time series clustering. Comput. Eng. 2022, 48, 262–268. [Google Scholar]
Deak, G.; Miron, R.; Astilean, A.; Domuţa, C. Fuzzy based method for assessing the training level of nonathletes and athletes. In Proceedings of the 2010 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR), IEEE, Cluj-Napoca, Romania, 28–30 May 2010; Volume 2, pp. 1–5. [Google Scholar]
Chen, W.; Zheng, L.; Li, K.; Wang, Q.; Liu, G.; Jiang, Q. A novel and effective method for congestive heart failure detection and quantification using dynamic heart rate variability measurement. PLoS ONE 2016, 11, e0165304. [Google Scholar] [CrossRef] [PubMed]
Bellos, C.; Papadopoulos, A.; Rosso, R.; Fotiadis, D.I.; Papadopoulos, A. Categorization of patients’ health status in COPD disease using a wearable platform and random forests methodology. In Proceedings of the 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics, IEEE, Hong Kong, China, 5–7 January 2012; pp. 404–407. [Google Scholar]
Song, K.; Yan, F.; Ding, T.; Gao, L.; Lu, S. A steel property optimization model based on the XGBoost algorithm and improved PSO. Comput. Mater. Sci. 2020, 174, 109472. [Google Scholar] [CrossRef]
Pan, B. Application of XGBoost algorithm in hourly PM2. 5 concentration prediction. In IOP Conference Series: Earth and Environmental Science; IOP Publishing: Bristol, UK, 2018; Volume 113, p. 012127. [Google Scholar]
Ogunleye, A.; Wang, Q.G. XGBoost model for chronic kidney disease diagnosis. IEEE/ACM Trans. Comput. Biol. Bioinform. 2019, 17, 2131–2140. [Google Scholar] [CrossRef]
Qi, L. Research and Application of the Intelligent Evaluation Method of Cardiopulmonary Endurance. Master’s thesis, Huazhong University of Science and Technology, Wuhan, China, 2020. [Google Scholar]
Winter, E. The shapley value. In Handbook of game theory with economic applications; Elsevier: Amsterdam, The Netherland, 2002; Volume 3, pp. 2025–2054. [Google Scholar]
Hui, G. Development of Self-Evaluation Method for Adult Cardiopulmonary Endurance and Establishment of Reference Grade. Ph.D. Thesis, Beijing Sport University, Beijing, China, 2016. [Google Scholar]
Yuri, F.; Meir, M. ACSM’s Health-Related Physical Fitness Assessment Manual; American College of Sports Medicine: Baltimore, MD, USA, 2021. [Google Scholar]
Ong, K.C.; Ong, Y.Y. Cardiopulmonary exercise testing in patients with chronic obstructive pulmonary disease. Ann. Acad. Med. Singap. 2000, 29, 648–652. [Google Scholar] [PubMed]

Figure 1. Illustration of cardiopulmonary exercise testing.

Figure 2. Schema of the study’s design.

Figure 3. Pseudocodes of the XGBOOST algorithm.

Figure 4. Optimization Flow Chart of Genetic Algorithm.

Figure 5. (a) Confusion matrix predicted by XGBOOST; (b) Hierarchical classification.

Figure 6. Crossover Operator.

Figure 7. Mutation Operator.

Figure 8. Iteration diagram of genetic algorithm.

Figure 9. SHAP value of each feature for each sample.

Figure 10. Importance of each feature.

Figure 11. SHAP feature dependency graph examples of important indicators.

Table 1. Determination of the weights of subjective and objective indicators.

	Index	Weights
Cardiovascular Capacity	$HR_AT (X_{1}$ )	0.078
	$HR_MAX (X_{2})$	0.058
	$SV_AT (X_{3})$	0.073
	$SV_MAX (X_{4})$	0.048
	$OP_MAX (X_{6})$	0.075
Respiratory Metabolic Capacity	$VE / V O_{2}_AT (X_{7})$	0.096
	$VE / V O_{2}_MAX (X_{8})$	0.064
	$VE_AT (X_{9})$	0.075
	$VE_MAX (X_{10})$	0.087
Metabolic Capacity	$V O_{2}_AT (X_{11})$	0.064
	$V O_{2}_MAX (X_{12})$	0.111
	$RQ_MAX (X_{14})$	0.053
	$C (a - v) O_{2}_AT (X_{15})$	0.054
	$C (a - v) O_{2}_MAX (X_{16})$	0.066

Table 2. Performance parameters of machine learning algorithms.

Model	Accuracy	Macro-Recall	Macro-Precision	Macro-F1 Score
RF	0.810	0.810	0.886	0.812
SVM	0.843	0.843	0.876	0.841
XGBOOST	0.861	0.861	0.879	0.864

Table 3. Optimal parameters of XGBOOST.

Parameter Name	Meaning	Default Value	Optimal Value
max_depth	The maximum depth of number	10	221
learning_rate	Learning rate	0.1	0.285
n_estimator	Number of iterators	100	3

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Deng, J.; Fu, Y.; Liu, Q.; Chang, L.; Li, H.; Liu, S. Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST. Diagnostics 2022, 12, 2538. https://doi.org/10.3390/diagnostics12102538

AMA Style

Deng J, Fu Y, Liu Q, Chang L, Li H, Liu S. Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST. Diagnostics. 2022; 12(10):2538. https://doi.org/10.3390/diagnostics12102538

Chicago/Turabian Style

Deng, Jia, Yan Fu, Qi Liu, Le Chang, Haibo Li, and Shenglin Liu. 2022. "Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST" Diagnostics 12, no. 10: 2538. https://doi.org/10.3390/diagnostics12102538

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automatic Cardiopulmonary Endurance Assessment: A Machine Learning Approach Based on GA-XGBOOST

Abstract

1. Introduction

2. Materials and Methods

2.1. Participants

2.2. The Cardiopulmonary Test (CPET)

2.3. Algorithms

2.4. Performance Analysis

2.4.1. Tenfold Validation

2.4.2. Statistical Analyses

2.4.3. Confusion Matrix

3. Results

3.1. Performance of Classifying Models

3.2. Hyperparameter Optimization of XGBOOST Based on Genetic Algorithm

3.3. Interpretability of GA-XGBOOST Output

4. Discussion

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI