Next Article in Journal
Outcome and Safety after 103 Radioembolizations with Yttrium-90 Resin Microspheres in 73 Patients with Unresectable Intrahepatic Cholangiocarcinoma—An Evaluation of Predictors
Next Article in Special Issue
Using Quantitative Imaging for Personalized Medicine in Pancreatic Cancer: A Review of Radiomics and Deep Learning Applications
Previous Article in Journal
Lymphoma-Associated Biomarkers Are Increased in Current Smokers in Twin Pairs Discordant for Smoking
Previous Article in Special Issue
Deep Learning Segmentation of Triple-Negative Breast Cancer (TNBC) Patient Derived Tumor Xenograft (PDX) and Sensitivity of Radiomic Pipeline to Tumor Probability Boundary
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development and Validation of an Efficient MRI Radiomics Signature for Improving the Predictive Performance of 1p/19q Co-Deletion in Lower-Grade Gliomas

by
Quang-Hien Kha
1,
Viet-Huan Le
1,2,
Truong Nguyen Khanh Hung
1,3 and
Nguyen Quoc Khanh Le
1,4,5,6,*
1
International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan
2
Department of Thoracic Surgery, Khanh Hoa General Hospital, Nha Trang City 65000, Vietnam
3
Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh City 70000, Vietnam
4
Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 106, Taiwan
5
Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 106, Taiwan
6
Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan
*
Author to whom correspondence should be addressed.
Cancers 2021, 13(21), 5398; https://doi.org/10.3390/cancers13215398
Submission received: 23 August 2021 / Revised: 19 October 2021 / Accepted: 26 October 2021 / Published: 27 October 2021
(This article belongs to the Collection Radiomics and Cancers)

Abstract

:

Simple Summary

Low-grade gliomas (LGG) with the 1p/19q co-deletion mutation have been proven to have a better survival prognosis and response to treatment than individuals without the mutation. Identifying this mutation has a vital role in managing LGG patients; however, the current diagnostic gold standard, including the brain-tissue biopsy or the surgical resection of the tumor, remains highly invasive and time-consuming. We proposed a model based on the eXtreme Gradient Boosting (XGBoost) classifier to detect 1p/19q co-deletion mutation using non-invasive medical images. The performance of our model achieved 87% and 82.8% accuracy on the training and external test set, respectively. Significantly, the prediction was based on only seven optimal wavelet radiomics features extracted from brain Magnetic Resonance (MR) images. We believe that this model can address clinicians in the rapid diagnosis of clinical 1p/19q co-deletion mutation, thereby improving the treatment prognosis of LGG patients.

Abstract

The prognosis and treatment plans for patients diagnosed with low-grade gliomas (LGGs) may significantly be improved if there is evidence of chromosome 1p/19q co-deletion mutation. Many studies proved that the codeletion status of 1p/19q enhances the sensitivity of the tumor to different types of therapeutics. However, the current clinical gold standard of detecting this chromosomal mutation remains invasive and poses implicit risks to patients. Radiomics features derived from medical images have been used as a new approach for non-invasive diagnosis and clinical decisions. This study proposed an eXtreme Gradient Boosting (XGBoost)-based model to predict the 1p/19q codeletion status in a binary classification task. We trained our model on the public database extracted from The Cancer Imaging Archive (TCIA), including 159 LGG patients with 1p/19q co-deletion mutation status. The XGBoost was the baseline algorithm, and we combined the SHapley Additive exPlanations (SHAP) analysis to select the seven most optimal radiomics features to build the final predictive model. Our final model achieved an accuracy of 87% and 82.8% on the training set and external test set, respectively. With seven wavelet radiomics features, our XGBoost-based model can identify the 1p/19q codeletion status in LGG-diagnosed patients for better management and address the drawbacks of invasive gold-standard tests in clinical practice.

1. Introduction

Amongst tumors from the central nervous system (CNS) negatively affecting millions of patients worldwide regardless of their age or gender, brain cancers account for the highest prevalence of more than ninety percent [1]. Gliomas, which originate from astrocytes, oligodendrocytes, oligo-astrocytes, and glioneuronal cells, are the most common type of brain cancers [1]. According to the 2016 World Health Organization (WHO) classification of CNS tumors, gliomas could be divided into diffuse low-grade gliomas (LGGs) (i.e., grade-II and grade-III gliomas), and high-grade gliomas. The criterion of diagnosing LGGs, which make up 30 percent of gliomas [2,3], requires evidence of isocitrate dehydrogenase (IDH) 1 and 2 mutations, without or with the presence of 1p/19q codeletion. More specifically, the diagnosis of oligodendroglioma requires both IDH and 1p/19q mutation, whereas only the mutation of IDH is considered to diagnose diffuse astrocytoma [2,4,5]. Various studies indicated that the LGGs with 1p/19q codeletion were more indolent [6,7,8]. Patients diagnosed with 1p/19q codeletion mutation status LGG significantly had their survival time improved and were more sensitive to therapeutics in terms of chemotherapy and radiotherapy compared with those with 1p/19q non-deleted tumors [9,10,11,12,13]. Thus, the manipulation of 1p/19q co-deleted mutation and the early detection of this chromosomal abnormality among patients diagnosed with LGGs is highly appreciated, as this can facilitate short-term and long-term management.
The current gold-standard procedure of identifying 1p/19q codeletion clinically is the examination of the tumor’s histopathological sample. The pathology samples are collected via the brain-tissue biopsy or the surgical resection of the tumor, which contain many underlying risks [3,14,15,16]. Additionally, the results interpreted from the cytological analysis of LGGs lack critical information about genomic biomarkers or pathognomonic imaging changes of gliomas or other brain cancers [10,16].
The extraction of tumor properties from medical images (e.g., magnetic resonance imaging (MRI), computed tomography (CT), positron emission tomography (PET), etc.) is one of the various approaches conducted to address the limitations of mentioned methods [10,16,17]. Following the ubiquity of medical images, the term “radiomics” is developed and defined as quantifiable data or features derived from those images that can be utilized to uncover disease properties for diagnosis, making decisions, and further clinical evaluation of morbidities (e.g., tumors, lesions, etc.) [17,18,19,20,21].
Since the advent of precision and personalized medicine, machine learning (ML) has received great interest as a promising tool for diagnosing and predicting optimal treatment for cancer patients [3,22,23,24]. Recent ML models were proposed to assist the non-invasive detection of 1p/19q co-deleted LGGs, based on the intensity, texture, and geometry obtained via radiomics features from medical images [4,5,10,18,19,25,26,27,28,29,30,31,32,33]. However, the performance of most models remained unstable on the external test set, as the accuracy score ranged from 0.68 to 0.93. Additionally, the database used to create these models contained mostly retrospective cohorts, or they had to use many features for the prediction, and yet, the predictors could not replace the histopathological methods in the tumor genetic profiling.
In this study, we hypothesize that a ML model could predict the mutation of 1p/19q chromosomal arms using only a number of optimal radiomic features. We propose an eXtreme Gradient Boosting (XGBoost) model to tackle the limitations of most models in terms of prolonged runtime, stable performance on different data, and reproducibility under various conditions, which would then contribute to targeted decisions and reduce the adverse effects induced by invasive diagnostic methods on LGG patients. Both our training and validation database included preoperative patients from retrospective studies [10,34], and their condition of 1p/19q was verified by clinical gold standard examinations. With only seven optimization features, we believe that further studies can build on this framework to optimize cancer prediction models in the future.

2. Results

The updated The Cancer Imaging Archive (TCIA) database of 159 LGG patients [10] was used as our training database. All MR images were downloaded via the National Biomedical Imaging Archive (NBIA) Data Retriever (version 3.6, https://wiki.cancerimagingarchive.net/display/NBIA/Downloading+Images+Using+the+NBIA+Data+Retriever, accessed on 20 March 2021). Statuses of 1p/19q chromosomal arms were fully reported based on the gold standard histopathology examination via brain biopsy or open surgical tumor resection.

2.1. Baseline Comparison among Different Machine Learning Algorithms and Selecting the Baseline Machine Learning Model

We obtained 159 complete region-of-interest (ROI) segmentations from the previous study by Akkus et al. [10,26]. Next, 851 radiomic features were derived from each segmentation via 3D Slicer Software (version 10.4.2) [35]. The lack of negative instances (or 1p/19q non-deleted mutations) may affect the performance of models (especially for specificity score). Therefore, the Synthetic Minority Over-sampling Technique (SMOTE) [36] would be applied in the following steps to handling the imbalance and improve the model’s specificity. Among the five machine learning algorithms (with default parameters) surveyed on the initial training set (159 instances and 851 radiomic features), the eXtreme Gradient Boosting (XGBoost) algorithm outperformed other classifier algorithms, as it correctly predicted 69.17% total instances using all features (Table 1). The accuracy score of Random Forest (RF) and Adaptive Boosting (AdaBoost) ranked second and third at 66.67% and 60.99%, respectively. On the other hand, the performance of k-Nearest Neighbors (kNN) and Logistic Regression (LR) algorithm were noticeably low. kNN observed the lowest accuracy score with 42.16%, whereas LR accurately predicted 47.8% total observations.
XGBoost also ranked first in sensitivity with 73.16%, precision with 81.19%. Even though the specificity was relatively low at 61.51%, it still outperformed the other algorithms. Moreover, XGBoost showed a significant performance in terms of AUC and AUPRC (0.71 and 0.827, respectively). In this experiment, XGBoost was therefore chosen to be our baseline algorithm, for feature selections with SHapley Additive exPlanations (SHAP) analysis [37] and further considerations.

2.2. Radiomics Signature Building

We performed SMOTE on the dataset with 159 instances and 851 original features before features selection. In this study, we performed two steps of features selection using Spearman’s Correlation Coefficient (SCC) [38] and the combination of XGBoost-SHAP analysis. First, with the threshold of SCC equal to 0.8, we kept features with SCC greater than 0.8 and discarded features with at least one SCC less than 0.8. As a result, 68 features were removed.
Then, features selection continued with the combination of XGBoost-SHAP analysis. Only 37 features were retained after this step. The SHAP values bee swarm plot expressed the SHAP values of the 20 best features and ranked them by their importance (out of 37 retained features) (Figure 1). In addition, the below part of Table 1 shows that the performance was increased when we only used our 37 features inserting into our model.

2.3. Model Ensembling and Predictions on the Training Set

Finally, we added each feature from the 37 aforementioned features to the XGBoost-baseline model to seek the model with the best combination of XGBoost algorithm and certain set of features. We found the model with the highest accuracy score at 87% with the first seven features and the XGBoost algorithm with the following parameters: max_depth = 6; min_child_weight = 1; max_delta_step = 0; lambda = 1; alpha = 0; scale_pos_weight = 1 (Figure 2). Names of the chosen radiomic features were provided in Table 2.
The original first-order skewness was the best radiomic feature in our study. Skewness of the first-order matrix refers to the measurement of figure distribution’s symmetry around the Mean value (i.e., the average gray level of intensity that lies within the ROI). Depending on the symmetry, this feature could be positive or negative.
Our final model, build on XGBoost algorithm (max_depth = 6; min_child_weight = 1; max_delta_step = 0; lambda = 1; alpha = 0; scale_pos_weight = 1) and seven optimal features (Table 2), was applied to predict the LGG 1p/19q codeletion mutations on the training set, with five-fold cross-validation. The results were promising, with 87% in accuracy, 88.2% in sensitivity, and 77.2% in specificity. Our model achieved an AUC of 0.87, and we also plotted the area under a receiver operating characteristics (AUC ROC) curve and the precision–recall curve to visualize the performance of our model in each fold (Figure 3).

2.4. Comparison with Previous Studies on 1p/19q Status Prediction

LGG gliomas are malignant tumors; however, the detection of 1p/19q codeletion mutation could enhance the prognosis and the therapeutic procedures [9,10,11,12,13]. Various studies were conducted to address the limitations in early diagnosing 1p/19q mutated LGG gliomas. We compared concretely our proposed model with models formerly developed by other authors in Table 3. It was notable that the deep learning algorithm-based models were superior to our model, whereas the XGBoost-7 outperformed a variety of the ML frameworks.

2.5. Performance Results on the External Test Set

We applied our optimal model to the external test set with unseen data. The dataset contained 65 LGG patients with 52 1p/19q non-deleted and 13 1p/19q codeleted mutations. Our model accurately classified 82.80% of 1p/19q codeleted LGG, with 94.10% in specificity and 33.30% in sensitivity.

2.6. Performance Results of Our Radiomics Model on Different Patient Subgroups

We also used the model to predict 1p/19q codeletion mutations in different patient groups according to grades and types of LGG cells in the training and external test set (Table 4).
The accuracy scores were not lower than 64% in two sets. In particular, the model’s predictive ability achieved high accuracy for LGG grade 2, LGG type Astrocytoma, and Oligoastrocytoma in the training set and LGG grade 3 in the test set, with AUC reaching 0.876, 0.85, 0.836, and 0.847, respectively. Moreover, the model’s ability to recognize codeletion mutations was considerable for LGG grade 2, 3, or LGG type Oligoastrocytoma and Oligodendroglioma (with high sensitivity over 80% for these grades and types).

3. Discussion

In this study, we proposed a predictive model based on XGBoost algorithm and seven radiomic features. Our training database contained 159 LGG patients with semi-automated segmentations by Akkus et al. [26]. The authors’ method fully segmented tumors (or ROIs); however, there was still the possibility of errors due to user-to-user variability. A study by Rundo et al. [40] performed semi-automatic segmentation of brain tumors based on the presence of a cellular automata model and Gamma Knife. The method proposed by Rundo et al. was effective in brain tumor recognition and reproducible. However, auto-segmentation can reduce the time for segmentation while minimizing human error. Pereira et al. [41] proposed an MRI-based auto-segmentation method for patients with gliomas. In the future, we will apply the auto-segmentation technique in further studies and take advantage of the development of deep learning algorithms in identifying cancer mutations for better management of cancer patients.
Many studies have used machine learning or deep learning to identify 1p/19q codeletion mutations in LGG patients based on medical imaging. However, some studies still need to use many radiomic features [33], or the model’s performance is not high [28,30,39]. On the other hand, our model achieved a high performance using only seven optimal radiomic biomarkers. We compared our model’s performance with previously developed ML models by other authors. The similar XGBoost-based model by Shboul et al. [33] achieved the AUC, sensitivity, and specificity of 0.8 ± 0.04, 0.75 ± 0.08, and 0.85 ± 0.06, respectively. Given that the total features sorted out by Shboul et al. [33] were more than ours (15 vs. 7), our model achieved a higher score of AUC (0.8 vs. 0.85), and sensitivity (0.75 vs. 0.88), but lower specificity (0.85 vs. 0.77). Furthermore, our model outperformed the SVM-based model proposed by van der Voort et al. [39] (AUC 0.723 vs. 0.85, sensitivity 0.732 vs. 0.88, specificity 0.617 vs. 0.77), and Rathore et al. [30] (accuracy 75.15% vs. 86.80%, sensitivity 0.82 vs. 0.88, specificity 0.74 vs. 0.77). A two-step classification based on the presence of the T2-FLAIR mismatch sign yielded by Batchala et al. [31] in 2019 surpassed our results regarding the training process, but overall lower scores were observed in the test set (accuracy ranged from 79.2% to 81.1% vs. 82.80%). The authors also included the contribution of humans in the procedure; this was appreciated. However, it may be prone to inconvenience and excessive total cost of the implementation in real-world conditions. Since we used fewer features than previous studies and our wavelet features were not hard to be produced, clinicians may actually use our model to predict 1p/19q mutation status with less time and more accuracy than other studies.
Referring to the deep learning models, the compared results were also eminently favorable. In 2019, a review article by Kocak et al. [27] yielded that the model could correctly classified the 1p/19q deletion status at an average AUC of 0.869, which was somewhat lower than our XGBoost model’s performance (0.869 vs. 0.85). Matsui et al. [28], in 2016, introduced a comprehensive model to predict the mutation of concerned chromosomal arms, and achieved only 58.5% accuracy on MRI database, despite the reasonable results on the combination of different forms of medical images. Nonetheless, there were various remarkable performances via the application of deep neural networks. In 2017, the performance of a convolutional neural network (CNN)-based model by Akkus et al. [10] generated an accuracy of 87.7% on the same dataset used in this study.
Yogananda et al. [5], in 2020, proposed a network correctly predicted the 1p/19q co-deletion with a prominent accuracy level of 93.46%, compared with 86.80% obtained from our research. The difference lies in the fact that the author has used deep learning algorithms to learn and predict. Deep learning algorithms have been proven highly effective in auto-segmentation, feature extraction, and learning capacity. However, training a deep learning model requires big data, massive time, and hardware. In contrast, our current model used a machine-learning algorithm (XGBoost), which requires less data and reduces the runtime to complete. Our model’s highly accurate prediction had clinical significance as it significantly reduced the time for diagnosing the 1p/19q codeletion mutation to adjust the treatment plan accordingly. Moreover, using only a few radiomics features could reduce the complexity of the diagnostic process but still ensures the model’s performance.
Shortening the time to detect the type of 1p/19q mutation in patients with LGG is important in terms of the treatment and prognosis of LGG. In clinical practice, it is necessary to shorten the runtime of the model, while ensuring high accuracy in predictions. Our model achieved high performance and stability based on seven optimal features with a ten-second runtime for MRI images with tumor segmentation. Problems can arise during the extraction of radiomics features from tumor images. In this study, the time from extracting radiomics features until the prediction was made was 2.8 + −0.3 min, achieving AUC 0.85 + −0.06. The deep learning model proposed by Yogananda et al., with a runtime of three minutes (from tumor segmentation to results), achieved the AUC of 0.95 + −0.01 in predictions. Despite the similar runtime, our model had a lower predictive performance compared with the model proposed by Yogananda et al. In the future, we will focus on improving the model’s performance while maintaining or further shortening the total time of the process from image input to output.
In terms of the performance of our model on the external test set, the sensitivity was low with 33.30%, compared with 94.10% of specificity. The overall accuracy reached 82.80%, which indicated that the performance was reasonable. However, the imbalance of sensitivity and specificity was due to the imbalance of the external test set, with 52 1p/19q non-deleted LGG patients (accounting for 80% of the data) and only 13 1p/19q codeletion LGG patients (accounting for 20% of the data). This could be interpreted as our model can identify accurately LGG patients with 1p/19q non-deleted status to apply appropriate cancer therapies. It is necessary to evaluate our model on different external test sets with more balanced data for further assessment and to validate the model’s sensitivity. Moreover, our model can be applied into cross-group training of LGG patients. As shown in Table 5, we reached promising performance when training with different grades, or tumor subtypes. Especially, our model worked well on oligoastrocytoma and oligodendroglioma subtypes with high sensitivity even though we had a small sample size in each problem.
Wavelet features made up the majority of the most important features in our study. Many studies have shown the strengths of wavelet features in image compression and preprocessing and use these features for classification [42,43,44]. Although no study has clearly shown the wavelet features’ advantage over the remaining features, the aforementioned advantages make wavelet features important in predicting based on medical imaging.
To the best of our knowledge, in the field of identifying the LGG 1p/19q codeletion status, our XGBoost-7 was the first model that contained the least number of involved radiomic features, and still yielded reasonable results against the training and external test set. This appealing finding indicated a prospect, in which the physicians would be capable of detecting the LGG with 1p/19q codeletion by T1- or T2-weighted MR images and only seven radiomic features, without the need of invasive biopsy or tumor resection. Patients would receive suitable management plans as quickly as possible, hence attenuating the risks of treatment-induced adverse effects and unnecessary medical procedures.
However, some limitations still need to be addressed. First, the modest datasets used for training and validating tasks conflicted with the real-world situations, as a large number of LGG patients with known 1p/19q codeletion were not included. For the next steps, data of LGG patients from medical centers would be included to our trained model for evaluation. Second, the binary classification of LGG patients was insufficient, since this study lacked the crucial prognostic marker IDH 1 and 2 mutation [4,5]. In the future, we would conduct the multiclassification to contribute to preoperative treatment plans. Third, only MR images in the axial plane were considered. This unintendedly overlooked some potential characteristics of the tumor. For the next project, the authors would try to propose a model that is compatible with all MRI planes, thence comprehensively predict the 3D shape of the tumor. Fourth, also a constraint of most studies, only a retrospective database was considered; hence, the application on prospective cohorts would gauge the robustness of our model more completely. Finally, despite the stability and promising results of XGBoost algorithm towards the unbalanced database [45,46], and the given result on the training and external validation cohort, the above citations implied that the combination of the present model and deep learning algorithms [47] may significantly enhance the outcomes in the future.

4. Materials and Methods

Our workflow (which was exhibited in Figure 4), from data extraction to the external validation, comprised three main steps: (I) Data extraction from two public databases from former articles [10,34]: one database was used for training and another one was used as the external test set; (II) radiomic feature extraction and feature selection using SCC, XGBoost, and SHAP analysis; (III) applying the ML algorithm on refined features to predict the 1p/19q codeletion status of LGG patients on the training and external test set.

4.1. Patient Cohort

For the training database, we obtained 159 LGG patients (WHO Grade II and III) with confirmed preoperative diagnosis, histopathological result of LGG, 1p/19q mutation status, and complete region-of-interest (ROI) segmentation of tumors in three axial slices in NiFTI format from the database published on The Cancer Imaging Archive (TCIA) by Akkus et al. [10,48]. The National Biomedical Imaging Archive (NBIA) Retriever version 3.6, which is an open-source software for downloading medical images from TCIA, was used to download and extract the data. There were 57 patients with non-deleted 1p/19q condition and 102 co-deleted patients. Among the 102 co-deleted patients, there were 66 grade-II and 36 grade-III patients, compared with 38 and 19 patients in the respective grades of the non-deleted group. In terms of the tumor types, i.e., astrocytoma, oligo-astrocytoma, and oligodendroglioma, the number of patients diagnosed with oligo-astrocytoma accounted for the highest proportion of 61% regardless of the 1p/19q mutated condition, whereas people suffering from astrocytoma made up only 10.7% of the total patients.
For the external test set, we recruited 65 LGG patients from the previous study by Bakas et al. [34], which was published on TCIA [48]. The 1p/19q mutation status of all patients was confirmed by histopathological samples. In detail, 6 of the 13 1p/19q-co-deleted and 22 of the 52 1p/19q-non-deleted patients were WHO-Grade II LGGs, and all of the 13 1p/19q-co-deleted LGGs were oligodendrogliomas. The external test cohort was collected using the same steps as the training cohort with the support of the NBIA Retriever. Detailed information of patients was shown in Table 5.

4.2. Feature Extraction

We used 3D Slicer software [35] (version 4.10.2; released on 10 October 2012; last updated on May 17, 2019) to extract radiomic features from the MR images with ROI segmentations. We installed seven 3D Slicer extensions for the radiomic feature extractions, including SlicerRadiomics (https://github.com/AIM-Harvard/SlicerRadiomics, accessed on 20 March 2021) integrates PyRadiomics library [49] in 3D Slicer, supports calculation and extraction of radiomic features, DCMQI [50], PETDICOMExtension [51], QuantitativeReporting [50] (https://github.com/QIICR/QuantitativeReporting, accessed on 20 March 2021), SlicerDevelopmentToolbox (https://github.com/QIICR/SlicerDevelopmentToolbox, accessed on 20 March 2021), and SlicerRT [52].
Initially, the MR records were orderly imported to 3D Slicer for radiomic feature extraction. A total of 851 features derived from each record were afterwards stored in .tsv-formatted files. The features were classified into nine categories, e.g., original, wavelet high-high-high (HHH), wavelet high-high-low (HHL), wavelet HLH, wavelet HLL, wavelet LHH, wavelet LHL, wavelet LLH, and wavelet LLL. Each category comprised six sub-categories, namely first-order, Gray Level Co-occurrence Matrix (GLCM), Gray Level Size Zone (GLSZM), Gray Level Run Length Matrix (GLRLM), Neighbouring Gray Tone Difference Matrix (NGTDM), and Gray Level Dependence Matrix (GLDM), except for the original radiomics category with one sub-category (Shape) more than the others. However, in this study, we classified the features into four categories according to Aerts et al. [9], including Tumor Shape (containing 14 original shape features), Tumor Intensity (containing 18 original first-order features), Textures (containing 75 GLCM, GLSZM, GLRLM, NGTDM, and GLDM features), and 744 Wavelet features. The information about the radiomics classes was concretely described by Zwanenburg et al. [53] and Van Griethuysen et al. [49]

4.3. Data Mining

4.3.1. Determining the Ground-Truth Labels

After the feature extraction step, our training database contained 159 samples with 851 features. For the ground-truth labels, we labelled “n/n” (i.e., 1p/19q non-codeletion status) as “0”, and “d/d” (i.e., 1p/19q codeletion status) as “1”.

4.3.2. Select the Baseline Machine Learning Models

To evaluate which algorithms worked well on these radiomics features, we applied five ML algorithms to the training database. The ML algorithms included Logistic Regression (LR), k-Nearest Neighbors (kNN), Random Forest (RF), Adaptive Boosting (AdaBoost), and eXtreme Gradient Boosting (XGBoost). All ML algorithms were implemented using scikit-learn package in Python. All five algorithms were used inside grid search techniques (GridSearchCV method) to find the optimal parameters for preliminary assessment.

LR

LR is described as an algorithm used for calculating the correlations between independent variables toward one dependent variable, and the trendy prediction of related dependent values based on known independent inputs [40]. This algorithm can be divided into simple (SLR) and multiple LR (MLR). In this study, we applied MLR analysis to find whether there is a linear correlation between radiomic features and corresponding 1p/19q deletion status.
The following pattern exhibits the SLR equation:
y =   β o + β 1 X 1 +   β 2 X 2 + +   β n X n + ε
y, βo, β1…βn, X1…Xn, and ε are denoted as the dependent variable, the intercept constant, the slope (also known as the regression coefficient), the independent variables (the radiomic features), and the random error, respectively.

kNN

Evelyn Fix and Joseph Hodges introduced an algorithm called “k-Nearest Neighbors” in 1951 [54]. Its main principle based on the intuitive assumption that nearby cases usually have many similarities. In this study, kNN was used as a supervised ML classifier. It appraised which class each neighbor was in and calculated the most preferred class within k neighbors of the concerned observation. Further details about kNN could be found at [55].

RF

The RF classifier algorithm is the combination of various decision trees where each tree would play a role in voting for the most popular class. Eventually, each separated input data would be classified into its relative class [56]. This algorithm, specifically, could be sufficiently used in different tasks including multiple or binary classification, and regression analysis. RF, in this study, was used to solve the binary classification problem on detecting 1p/19q mutation.

AdaBoost

AdaBoost is a boosting algorithm, which integrates the downgrade components into one robust model. The weight assignations to each training sample and classifier are implemented, i.e., the weaker the item and the stronger the classifier, the higher weight is added, to make it contribute more significantly to the outcome. A systematic review by Ying et al. [43] revealed more information about AdaBoost, its contribution and drawbacks in the field of high-yield ML. Recently, numerous projects have exploited AdaBoost and other boosting algorithms to build models used for radiomics prediction.

XGBoost

Similar to AdaBoost, XGBoost is the application of a sufficient gradient boosting algorithm, and hence, it becomes a favored choice in the midst of boosting techniques for its superior predictive performance, exact classification, and the capacity of administering imbalanced data. To be more particular, L1 and L2 regularization are also included in this algorithm, which are responsible for handling sparseness and attenuating overfitting. Moreover, the clinical records, e.g., the database of LGG patients used in this project, usually experience discrepancy data, which can be addressed using XGBoost. The detailed equation of XGBoost and its applications could be found in [33,57].

4.4. Handling the Imbalance between Two Classes and Features Selection Using the Spearman’s Correlation Coefficient (SCC) and SHAP Analysis

Prior to features selection, it was necessary to address the data imbalance. We used the Synthetic Minority Over-sampling Technique (SMOTE) [36]. SMOTE helps select minority instances from each class from the dataset and generate a synthetic point between the instances to make the classes more balanced [36]. In this study, SMOTE can address the low specificity on the training set.
We implemented the first step of radiomic feature selection by using the SCC. The pair-wise correlation coefficient between each pair of radiomic features was calculated. In this experiment, 0.8 was the threshold for determining if one feature would be excluded or included. Every pair, of which the SCC exceeded 0.8, was retained, whereas the SCCs of below 0.8 were prone to the exclusion. From 851 features, 68 features were filtered out, and leaving 783 features for further considerations. Moreover, SHAP analysis and RFE were integrated to find out the optimal 7 features. Our 7 features can be found in Table 2 for reproducing the methods.

4.5. Statistical Analysis

Statistical analysis was performed using Python. Our model is trained using 5-fold cross validation method, and then it is applied on an external validation set. Measurement metrics included Sensitivity, Specificity, Accuracy, ROC curve, and AUC.
Sensitivity is defined as the percentage of the 1p/19q co-deleted patients correctly predicted by the model on total patients with 1p/19q codeletion.
Sensitivity   = True   Positive True   Positive + False   Negative
Specificity is calculated by the ratio of patients with 1p/19q non-deleted patients recognized by the model and all 1p/19q non-deleted patients.
Specificity   = True   Negative True   Negative + False   Positive
Accuracy of a model is validated by the number of its correct predictions dividing by the whole number of involved patients.
Accuracy   = True   Positive + True   Negative Total   patients
The ROC–AUC and precision–recall (PR) curves for each fold in five-fold cross validation were plotted to visualize the overall performances of each algorithm.

5. Conclusions

We proposed an XGBoost-based model with solely seven wavelet radiomic features, to implement the classification of LGG 1p/19q codeletion status in the cohort of LGG patients, which was believed to contribute to the non-invasive diagnosis, individual cancer therapy, and long-term management for patients diagnosed with LGG. In the future, we intend to apply the model to multicenter data with many patients to evaluate the predictive power of the model. Additionally, we plan to conduct our experiments with auto-segmentation, using the state-of-the-art deep learning (convolutional neural networks, recurrent neural networks, etc.). Hopefully, our future work can contribute more to the early diagnosis of LGG cancer with 1p/19q codeletion mutation in clinical practice.

Author Contributions

Conceptualization, Q.-H.K. and N.Q.K.L.; methodology, Q.-H.K. and N.Q.K.L.; validation, Q.-H.K., V.-H.L. and T.N.K.H.; formal analysis, Q.-H.K. and N.Q.K.L.; investigation, Q.-H.K. and T.N.K.H.; data curation, Q.-H.K. and N.Q.K.L.; writing—original draft preparation, Q.-H.K. and V.-H.L.; writing—review and editing, Q.-H.K., V.-H.L., T.N.K.H. and N.Q.K.L.; visualization, Q.-H.K. and V.-H.L.; supervision, N.Q.K.L.; funding acquisition, N.Q.K.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partly supported by the Taiwan Higher Education Sprout Project by the Ministry of Education (DP2-110-21121-01-A-06) and the Ministry of Science and Technology, Taiwan (MOST110-2221-E-038-001-MY2).

Institutional Review Board Statement

Ethical review and approval were waived for this study, due to all data deriving from TCIA public database.

Informed Consent Statement

Patient consent was waived because all data were derived from the public TCIA database.

Data Availability Statement

Our source codes and data are freely available at https://github.com/khanhlee/LGG-radiomics, accessed on 20 March 2021.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Patel, A.P.; Fisher, J.M.; Nichols, E.; Abdela, J.; Abdelalim, A.; Abraha, H.N.; Agius, D.; Alahdab, F.; Alam, T.; Allen, C.A. Global, regional, and national burden of brain and other CNS cancer, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 2019, 18, 376–393. [Google Scholar] [CrossRef] [Green Version]
  2. Louis, D.N.; Perry, A.; Reifenberger, G.; von Deimling, A.; Figarella-Branger, D.; Cavenee, W.K.; Ohgaki, H.; Wiestler, O.D.; Kleihues, P.; Ellison, D.W. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: A summary. Acta Neuropathol. 2016, 131, 803–820. [Google Scholar] [CrossRef] [Green Version]
  3. Cuccarini, V.; Erbetta, A.; Farinotti, M.; Cuppini, L.; Ghielmetti, F.; Pollo, B.; DiMeco, F.; Grisoli, M.; Filippini, G.; Finocchiaro, G.; et al. Advanced MRI may complement histological diagnosis of lower grade gliomas and help in predicting survival. J. Neuro-Oncol. 2015, 126, 279–288. [Google Scholar] [CrossRef] [PubMed]
  4. Goyal, A. Erratum. The T2-FLAIR–mismatch sign as an imaging biomarker for IDH and 1p/19q status in diffuse low-grade gliomas: A systematic review with a Bayesian approach to evaluation of diagnostic test performance. Neurosurg. Focus 2020, 48, E10. [Google Scholar] [CrossRef]
  5. Yogananda, C.G.B.; Yogananda, C.; Shah, B.R.; Yu, F.F.; Pinho, M.C.; Nalawade, S.S.; Murugesan, G.K.; Wagner, B.C.; Mickey, B.; Patel, T.R.; et al. A novel fully automated MRI-based deep-learning method for classification of 1p/19q co-deletion status in brain gliomas. Neurooncol. Adv. 2020, 2, vdaa066. [Google Scholar] [PubMed]
  6. Forst, D.A.; Nahed, B.V.; Loeffler, J.S.; Batchelor, T.T. Low-grade gliomas. Oncologist 2014, 19, 403–413. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Sanai, N.; Chang, S.; Berger, M.S. Low-grade gliomas in adults: A review. J. Neurosurg. 2011, 115, 948–965. [Google Scholar] [CrossRef]
  8. Ruiz, J.; Lesser, G.J. Low-grade gliomas. Curr. Treat. Options Oncol. 2009, 10, 231–242. [Google Scholar] [CrossRef]
  9. Franceschi, E.; Mura, A.; De Biase, D.; Tallini, G.; Pession, A.; Foschini, M.P.; Danieli, D.; Pizzolitto, S.; Zunarelli, E.; Lanza, G.; et al. The role of clinical and molecular factors in low-grade gliomas: What is their impact on survival? Future Oncol. 2018, 14, 1559–1567. [Google Scholar] [CrossRef]
  10. Akkus, Z.; Ali, I.; Sedlář, J.; Agrawal, J.P.; Parney, I.F.; Giannini, C.; Erickson, B.J. Predicting deletion of chromosomal arms 1p/19q in low-grade gliomas from MR images using machine intel-ligence. J. Digit. Imaging 2017, 30, 469–476. [Google Scholar] [CrossRef] [Green Version]
  11. Garcia, C.R.; Slone, S.A.; Pittman, T.; Clair, W.H.S.; Lightner, D.D.; Villano, J.L. Comprehensive evaluation of treatment and outcomes of low-grade diffuse gliomas. PLoS ONE 2018, 13, e0203639. [Google Scholar] [CrossRef] [PubMed]
  12. Ricard, D.; Kaloshi, G.; Amiel-Benouaich, A.; Lejeune, J.; Marie, Y.; Mandonnet, E.; Kujas, M.; Mokhtari, K.; Taillibert, S.; Laigle-Donadey, F.; et al. Dynamic history of low-grade gliomas before and after temozolomide treatment. Ann. Neurol. 2007, 61, 484–490. [Google Scholar] [CrossRef] [PubMed]
  13. Bent, M.J.V.D.; Brandes, A.; Taphoorn, M.J.B.; Kros, J.M.; Kouwenhoven, M.; Delattre, J.-Y.; Bernsen, H.J.J.A.; Frenay, M.; Tijssen, C.C.; Grisold, W.; et al. Adjuvant Procarbazine, Lomustine, and Vincristine Chemotherapy in Newly Diagnosed Anaplastic Oligodendroglioma: Long-Term Follow-Up of EORTC Brain Tumor Group Study 26951. J. Clin. Oncol. 2013, 31, 344–350. [Google Scholar] [CrossRef] [PubMed]
  14. Riche, M.; Amelot, A.; Peyre, M.; Capelle, L.; Carpentier, A.; Mathon, B. Complications after frame-based stereotactic brain biopsy: A systematic review. Neurosurg. Rev. 2020, 44, 301–307. [Google Scholar] [CrossRef]
  15. Malone, H.; Yang, J.; Hershman, D.L.; Wright, J.D.; Bruce, J.N.; Neugut, A.I. Complications Following Stereotactic Needle Biopsy of Intracranial Tumors. World Neurosurg. 2015, 84, 1084–1089. [Google Scholar] [CrossRef]
  16. Pouratian, N.; Schiff, D. Management of Low-Grade Glioma. Curr. Neurol. Neurosci. Rep. 2010, 10, 224–231. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Gillies, R.J.; Kinahan, P.; Hricak, H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Chaddad, A.; Kucharczyk, M.; Daniel, P.; Sabri, S.; Jean-Claude, B.J.; Niazi, T.; Abdulkarim, B. Radiomics in Glioblastoma: Current Status and Challenges Facing Clinical Implementation. Front. Oncol. 2019, 9, 374. [Google Scholar] [CrossRef] [Green Version]
  19. Chen, C.; Ou, X.; Wang, J.; Guo, W.; Ma, X. Radiomics-Based Machine Learning in Differentiation Between Glioblastoma and Metastatic Brain Tumors. Front. Oncol. 2019, 9, 806. [Google Scholar] [CrossRef] [Green Version]
  20. Lee, G.; Lee, H.Y.; Park, H.; Schiebler, M.; van Beek, E.J.; Ohno, Y.; Seo, J.B.; Leung, A. Radiomics and its emerging role in lung cancer research, imaging biomarkers and clinical management: State of the art. Eur. J. Radiol. 2016, 86, 297–307. [Google Scholar] [CrossRef]
  21. Narang, S.; Lehrer, M.; Yang, D.; Lee, J.; Rao, A. Radiomics in glioblastoma: Current status, challenges and potential opportunities. Transl. Cancer Res. 2016, 5, 383–397. [Google Scholar] [CrossRef]
  22. Rafique, R.; Islam, S.R.; Kazi, J.U. Machine learning in the prediction of cancer therapy. Comput. Struct. Biotechnol. J. 2021, 19, 4003–4017. [Google Scholar] [CrossRef]
  23. Huang, C.; Clayton, E.A.; Matyunina, L.V.; McDonald, L.D.; Benigno, B.B.; Vannberg, F.; McDonald, J.F. Machine learning predicts individual cancer patient responses to therapeutic drugs with high accuracy. Sci. Rep. 2018, 8, 1–8. [Google Scholar] [CrossRef] [Green Version]
  24. Iqbal, M.J.; Javed, Z.; Sadia, H.; Qureshi, I.A.; Irshad, A.; Ahmed, R.; Malik, K.; Raza, S.; Abbas, A.; Pezzani, R.; et al. Clinical applications of artificial intelligence and machine learning in cancer diagnosis: Looking into the future. Cancer Cell Int. 2021, 21, 1–11. [Google Scholar] [CrossRef] [PubMed]
  25. Lu, C.-F.; Hsu, F.-T.; Hsieh, K.L.-C.; Kao, Y.-C.J.; Cheng, S.-J.; Hsu, J.B.-K.; Tsai, P.-H.; Chen, R.-J.; Huang, C.-C.; Yen, Y.; et al. Machine Learning–Based Radiomics for Molecular Subtyping of Gliomas. Clin. Cancer Res. 2018, 24, 4429–4436. [Google Scholar] [CrossRef] [Green Version]
  26. Akkus, Z.; Sedlar, J.; Coufalova, L.; Korfiatis, P.; Kline, T.L.; Warner, J.D.; Agrawal, J.; Erickson, B.J. Semi-automated segmentation of pre-operative low grade gliomas in magnetic resonance imaging. Cancer Imaging 2015, 15, 1–10. [Google Scholar] [CrossRef] [Green Version]
  27. Kocak, B.; Durmaz, E.S.; Ates, E.; Sel, I.; Gunes, S.T.; Kaya, O.K.; Zeynalova, A.; Kilickesmez, O. Radiogenomics of lower-grade gliomas: Machine learning–based MRI texture analysis for predicting 1p/19q codeletion status. Eur. Radiol. 2019, 30, 877–886. [Google Scholar] [CrossRef] [PubMed]
  28. Matsui, Y.; Maruyama, T.; Nitta, M.; Saito, T.; Tsuzuki, S.; Tamura, M.; Kusuda, K.; Fukuya, Y.; Asano, H.; Kawamata, T.; et al. Prediction of lower-grade glioma molecular subtypes using deep learning. J. Neuro-Oncol. 2019, 146, 321–327. [Google Scholar] [CrossRef]
  29. Aliotta, E.; Dutta, S.W.; Feng, X.; Tustison, N.J.; Batchala, P.P.; Schiff, D.; Lopes, M.B.; Jain, R.; Druzgal, T.J.; Mukherjee, S.; et al. Automated apparent diffusion coefficient analysis for genotype prediction in lower grade glioma: Association with the T2-FLAIR mismatch sign. J. Neuro-Oncol. 2020, 149, 1–11. [Google Scholar] [CrossRef]
  30. Rathore, S.; Mohan, S.; Bakas, S.; Sako, C.; Badve, C.; Pati, S.; Singh, A.; Bounias, D.; Ngo, P.; Akbari, H.; et al. Multi-institutional noninvasive in vivo characterization of IDH, 1p/19q, and EGFRvIII in glioma using neuro-Cancer Imaging Phenomics Toolkit (neuro-CaPTk). Neurooncol. Adv. 2020, 2, iv22–iv34. [Google Scholar]
  31. Batchala, P.; Muttikkal, T.; Donahue, J.; Patrie, J.; Schiff, D.; Fadul, C.; Mrachek, E.; Lopes, M.-B.; Jain, R.; Patel, S. Neuroimaging-Based Classification Algorithm for Predicting 1p/19q-Codeletion Status in IDH-Mutant Lower Grade Gliomas. Am. J. Neuroradiol. 2019, 40, 426–432. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Kim, D.; Wang, N.; Ravikumar, V.; Raghuram, D.R.; Li, J.; Patel, A.; Wendt, R.E., 3rd; Rao, G.; Rao, A. Prediction of 1p/19q codeletion in diffuse glioma patients using pre-operative multiparametric magnetic reso-nance imaging. Front. Comput. Neurosci. 2019, 13, 52. [Google Scholar] [CrossRef] [Green Version]
  33. Shboul, Z.A.; Chen, J.; Iftekharuddin, K.M. Prediction of Molecular Mutations in Diffuse Low-Grade Gliomas using MR Imaging Features. Sci. Rep. 2020, 10, 1–13. [Google Scholar] [CrossRef] [Green Version]
  34. Bakas, S.; Akbari, H.; Sotiras, A.; Bilello, M.; Rozycki, M.; Kirby, J.; Freymann, J.B.; Farahani, K.; Davatzikos, C. Advancing the Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features. Sci. Data 2017, 4, 170117. [Google Scholar] [CrossRef] [Green Version]
  35. Fedorov, A.; Beichel, R.; Kalpathy-Cramer, J.; Finet, J.; Fillion-Robin, J.-C.; Pujol, S.; Bauer, C.; Jennings, D.; Fennessy, F.; Sonka, M.; et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging 2012, 30, 1323–1341. [Google Scholar] [CrossRef] [Green Version]
  36. Chawla, N.V.; Bowyer, K.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  37. Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
  38. Spearman Rank Correlation Coefficient. In The Concise Encyclopedia of Statistics; Springer: New York, NY, USA, 2008; pp. 502–505.
  39. van der Voort, S.R.; Incekara, F.; Wijnenga, M.; Kapas, G.; Gardeniers, M.; Schouten, J.W.; Starmans, M.; Nandoe Tewarie, R.; Lycklama, G.J.; French, P.J.; et al. Predicting the 1p/19q codeletion status of presumed low-grade glioma with an externally validated machine learning algorithm. Clin. Cancer Res. 2019, 25, 7455–7462. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Rundo, L.; Militello, C.; Russo, G.; Vitabile, S.; Gilardi, M.C.; Mauri, G. GTV cut for neuro-radiosurgery treatment planning: An MRI brain cancer seeded image segmentation method based on a cellular automata model. Nat. Comput. 2018, 17, 521–536. [Google Scholar] [CrossRef]
  41. Pereira, S.; Pinto, J.A.A.D.S.R.; Alves, V.; Silva, C. Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images. IEEE Trans. Med. Imaging 2016, 35, 1240–1251. [Google Scholar] [CrossRef]
  42. Chaddad, A.; Daniel, P.; Niazi, T. Radiomics Evaluation of Histological Heterogeneity Using Multiscale Textures Derived From 3D Wavelet Transformation of Multispectral Images. Front. Oncol. 2018, 8, 96. [Google Scholar] [CrossRef]
  43. Dettori, L.; Semler, L. A comparison of wavelet, ridgelet, and curvelet-based texture classification algorithms in computed tomography. Comput. Biol. Med. 2007, 37, 486–498. [Google Scholar] [CrossRef] [PubMed]
  44. Weyn, B.; van de Wouwer, G.; Scheunders, P.; van Dyck, D.; van Marck, E.; Jacob, W. Automated breast tumor diagnosis and grading based on wavelet chromatin texture description. Cytometry 1998, 33, 32–40. [Google Scholar] [CrossRef]
  45. Thongsuwan, S.; Jaiyen, S.; Padcharoen, A.; Agarwal, P. ConvXGB: A new deep learning model for classification problems based on CNN and XGBoost. Nucl. Eng. Technol. 2020, 53, 522–531. [Google Scholar] [CrossRef]
  46. Wang, C.; Deng, C.; Wang, S. Imbalance-XGBoost: Leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost. Pattern Recognit. Lett. 2020, 136, 190–197. [Google Scholar] [CrossRef]
  47. Le, N.Q.K.; Huynh, T.-T. Identifying SNAREs by Incorporating Deep Learning Architecture and Amino Acid Embedding Representation. Front. Physiol. 2019, 10. [Google Scholar] [CrossRef]
  48. Clark, K.; Vendt, B.; Smith, K.; Freymann, J.; Kirby, J.; Koppel, P.; Moore, S.; Phillips, S.; Maffitt, D.; Pringle, M.; et al. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. J. Digit. Imaging 2013, 26, 1045–1057. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Van Griethuysen, J.J.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.; Fillion-Robin, J.-C.; Pieper, S.; Aerts, H.J. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef] [Green Version]
  50. Herz, C.; Fillion-Robin, J.-C.; Onken, M.; Riesmeier, J.; Lasso, A.; Pinter, C.; Fichtinger, G.; Pieper, S.; Clunie, D.; Kikinis, R.; et al. dcmqi: An Open Source Library for Standardized Communication of Quantitative Image Analysis Results Using DICOM. Cancer Res. 2017, 77, e87–e90. [Google Scholar] [CrossRef] [Green Version]
  51. Ulrich, E.; Fedorov, A. 3D Slicer PET-DICOM Extension Documentation. 2017. Available online: https://www.slicer.org/wiki/Documentation/4.6/Extensions/PETDICOM (accessed on 20 March 2021).
  52. Pinter, C.; Lasso, A.; Wang, A.; Jaffray, D.; Fichtinger, G. SlicerRT: Radiation therapy research toolkit for 3D Slicer. Med. Phys. 2012, 39, 6332–6338. [Google Scholar] [CrossRef]
  53. Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.W.L.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Fix, E.; Hodges, J.L. Nonparametric discrimination: Consistency properties. Randolph Field Tex. Proj. 1951, 21–49. [Google Scholar] [CrossRef]
  55. Kozma, L. k Nearest Neighbors Algorithm (kNN). 2008. Available online: http://www.lkozma.net/knn2.pdf (accessed on 20 March 2021).
  56. Pal, M. Random forest classifier for remote sensing classification. Int. J. Remote. Sens. 2005, 26, 217–222. [Google Scholar] [CrossRef]
  57. Le, N.Q.K.; Kha, Q.H.; Nguyen, V.H.; Chen, Y.-C.; Cheng, S.-J.; Chen, C.-Y. Machine Learning-Based Radiomics Signatures for EGFR and KRAS Mutations Prediction in Non-Small-Cell Lung Cancer. Int. J. Mol. Sci. 2021, 22, 9254. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Top 20 features extracted using SHAP analysis. The horizontal axis showed how much the features contribute to the output predictor variable via SHAP values. Features were ranked based on their contribution to the output predictor; the higher the SHAP value, the higher the rank of the corresponding feature. The vertical axis with the color ranged from blue to red represented the value of a feature from low to high, respectively. The density of the colored dots in any SHAP value represented the strength or weakness of the feature in that SHAP value range. The original first order Skewness feature had the greatest impact on the predictability of the model. The SHAP value of original first order Skewness was 0.48 (absolute value of −0.4 plus +0.08). The density of red and purple points concentrated in the range from 0 to +0.08 indicated that the predictive value of this feature was highest as its corresponding SHAP value was from 0 to +0.08. On the other hand, the feature wavelet HHH first order Median ranked last in the graph, with the lowest SHAP value 0.06 (absolute value −0.05 plus +0.01), although it was still a relatively important feature (density thick red and purple dots).
Figure 1. Top 20 features extracted using SHAP analysis. The horizontal axis showed how much the features contribute to the output predictor variable via SHAP values. Features were ranked based on their contribution to the output predictor; the higher the SHAP value, the higher the rank of the corresponding feature. The vertical axis with the color ranged from blue to red represented the value of a feature from low to high, respectively. The density of the colored dots in any SHAP value represented the strength or weakness of the feature in that SHAP value range. The original first order Skewness feature had the greatest impact on the predictability of the model. The SHAP value of original first order Skewness was 0.48 (absolute value of −0.4 plus +0.08). The density of red and purple points concentrated in the range from 0 to +0.08 indicated that the predictive value of this feature was highest as its corresponding SHAP value was from 0 to +0.08. On the other hand, the feature wavelet HHH first order Median ranked last in the graph, with the lowest SHAP value 0.06 (absolute value −0.05 plus +0.01), although it was still a relatively important feature (density thick red and purple dots).
Cancers 13 05398 g001
Figure 2. Performance results when using different numbers of top-rank features in predicting 1p/19 status using XGBoost algorithm. The optimal cut-off point belongs to the first seven features (accuracy = 87%).
Figure 2. Performance results when using different numbers of top-rank features in predicting 1p/19 status using XGBoost algorithm. The optimal cut-off point belongs to the first seven features (accuracy = 87%).
Cancers 13 05398 g002
Figure 3. Predicting 1p/19q status using XGBoost classifier on top seven radiomics features. Cross-validation is implemented to get see the model consistency. (A) ROC curve analysis (AUC = 0.87), (B) PRC curve analysis (AUC = 0.906).
Figure 3. Predicting 1p/19q status using XGBoost classifier on top seven radiomics features. Cross-validation is implemented to get see the model consistency. (A) ROC curve analysis (AUC = 0.87), (B) PRC curve analysis (AUC = 0.906).
Cancers 13 05398 g003
Figure 4. The workflow of our study. Firstly, data was extracted from two public database from former articles, one database was used for training and another one was used as the external test set; secondly, radiomic feature extraction and feature selection were performed using SCC, XGBoost, and SHAP analysis; finally, machine learning algorithm XGBoost classifier was applied on refined features to predict the 1p/19q codeletion status of LGG patients on training and external test set.
Figure 4. The workflow of our study. Firstly, data was extracted from two public database from former articles, one database was used for training and another one was used as the external test set; secondly, radiomic feature extraction and feature selection were performed using SCC, XGBoost, and SHAP analysis; finally, machine learning algorithm XGBoost classifier was applied on refined features to predict the 1p/19q codeletion status of LGG patients on training and external test set.
Cancers 13 05398 g004
Table 1. Comparative performance among different machine learning algorithms using all radiomics features and top 37 features. The experiments were done and reported via 5-fold cross validation and SMOTE technique.
Table 1. Comparative performance among different machine learning algorithms using all radiomics features and top 37 features. The experiments were done and reported via 5-fold cross validation and SMOTE technique.
FeaturesAlgorithmSensitivitySpecificityPrecisionAccuracyAUCAUPRC
All featuresLogistic Regression61.75 ± 5.1732.26 ± 6.6950.19 ± 9.3147.80 ± 5.380.510 ± 0.0950.687 ± 0.064
k-Nearest Neighbors56.20 ± 3.8727.97 ± 3.4644.19 ± 4.9842.16 ± 3.930.434 ± 0.0650.607 ± 0.027
Random Forest72.13 ± 3.0757.88 ± 7.7779.48 ± 12.3466.67 ± 3.090.685 ± 0.0610.799 ± 0.040
AdaBoost68.47 ± 7.3147.03 ± 15.8071.33 ± 11.3960.99 ± 10.440.599 ± 0.0820.740 ± 0.051
XGBoost73.16 ± 2.8461.51 ± 12.2581.19 ± 9.8369.17 ± 6.090.710 ± 0.0790.827 ± 0.057
37 featuresLogistic Regression64.95 ± 12.0535.71 ± 6.7944.10 ± 8.1548.49 ± 8.200.587 ± 0.1060.706 ± 0.091
k-Nearest Neighbors69.82 ± 8.0442.83 ± 13.2562.86 ± 8.6158.55 ± 9.760.590 ± 0.1160.720 ± 0.077
Random Forest74.10 ± 7.6054.16 ± 10.9475.29 ± 11.1866.61 ± 7.570.713 ± 0.0780.858 ± 0.041
AdaBoost71.80 ± 7.5046.67 ± 6.6770.67 ± 7.9462.28 ± 4.720.632 ± 0.0990.755 ± 0.090
XGBoost77.89 ± 5.7555.85 ± 7.4673.52 ± 3.9669.21 ± 4.830.753 ± 0.0580.809 ± 0.045
Table 2. Seven radiomics features using our final model.
Table 2. Seven radiomics features using our final model.
FormTypeMatrixName
original First OrderSkewness
WaveletLLHGLCMInverse Difference Moment Normalized (IDMN)
WaveletLHLGLCMInformational Measure of Correlation 1 (IMC1)
WaveletHLLFirst OrderKurtosis
WaveletHLHGLCMCluster Shade
WaveletHHHFirst OrderMean
WaveletLLLGLCMIMC1
Table 3. Performance comparisons between our XGBoost-7 and other typical models.
Table 3. Performance comparisons between our XGBoost-7 and other typical models.
StudyAlgorithmDatasetAccuracySensitivitySpecificityAUC
van der Voort et al.SVMLGG 28469.865.772.10.755
LGG 108 [39]69.373.261.70.723
Yogananda et al.Deep learningLGG 268 [5]93.50.99595.3
Akkus et al.CNNLGG 159 [10]87.793.382.2-
Shboul et al.XGBoostLGG 81-78830.83
LGG 23 [33]-75850.8
OursXGBoostLGG 159 [10]87.088.277.20.87
Table 4. Performance results of our model on different patient subgroups. The final model used XGBoost algorithm and top seven features. The results show that we had promising performance on cross-grade and cross-tumor subtype problems.
Table 4. Performance results of our model on different patient subgroups. The final model used XGBoost algorithm and top seven features. The results show that we had promising performance on cross-grade and cross-tumor subtype problems.
SubtypeTraining DataExternal Validation Data
AccSensSpecAUCAUPRCAccSensSpecAUCAUPRC
Grade283.7 ± 11.787.9 ± 6.676.3 ± 22.40.876 ± 0.110.891 ± 0.0782.116.7100.00.6950.855
380 ± 15.197.2 ± 25.947.4 ± 4.10.759 ± 0.200.815 ± 0.1388.650.096.60.8470.885
TypeAstrocytoma76.5 ± 8.625 ± 9.092.3 ± 11.90.85 ± 0.090.785 ± 0.078-----
Oligoastrocytoma80.4 ± 2.082.1 ± 0.978 ± 4.00.836 ± 0.040.858 ± 0.05-----
Oligodendroglioma93.3 ± 6.9100.0 ± 00.0 ± 00.726 ± 0.070.864 ± 0.0564.058.369.20.7330.828
-: there is no ‘d/d’ class in this group.
Table 5. Patients’ characteristics.
Table 5. Patients’ characteristics.
FeatureSubtypeTraining CohortExternal Test Cohort
d/d (n = 102)n/n (n = 57)d/d (n = 13)n/n (n = 52)
Grade266 (64.7%)38 (66.7%)6 (46.2%)22 (42.3%)
336 (35.3%)19 (33.3%)7 (53.8%)30 (57.7%)
TypeAstrocytoma4 (3.9%)13 (22.8%)0 (0%)21 (40.4%)
Oligoastrocytoma56 (54.9%)41 (71.9%)0 (0%)18 (34.6%)
Oligodendroglioma42 (41.2%)3 (5.3%)13 (100%)13 (25%)
“d/d” means 1p and 19q are co-deleted, “n/n” means neither 1p nor 19q were deleted.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kha, Q.-H.; Le, V.-H.; Hung, T.N.K.; Le, N.Q.K. Development and Validation of an Efficient MRI Radiomics Signature for Improving the Predictive Performance of 1p/19q Co-Deletion in Lower-Grade Gliomas. Cancers 2021, 13, 5398. https://doi.org/10.3390/cancers13215398

AMA Style

Kha Q-H, Le V-H, Hung TNK, Le NQK. Development and Validation of an Efficient MRI Radiomics Signature for Improving the Predictive Performance of 1p/19q Co-Deletion in Lower-Grade Gliomas. Cancers. 2021; 13(21):5398. https://doi.org/10.3390/cancers13215398

Chicago/Turabian Style

Kha, Quang-Hien, Viet-Huan Le, Truong Nguyen Khanh Hung, and Nguyen Quoc Khanh Le. 2021. "Development and Validation of an Efficient MRI Radiomics Signature for Improving the Predictive Performance of 1p/19q Co-Deletion in Lower-Grade Gliomas" Cancers 13, no. 21: 5398. https://doi.org/10.3390/cancers13215398

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop