Next Article in Journal
Assessing the Accuracy of an Artificial Intelligence-Based Segmentation Algorithm for the Thoracic Aorta in Computed Tomography Applications
Next Article in Special Issue
Automatic Segmentation and Quantitative Assessment of Stroke Lesions on MR Images
Previous Article in Journal
Deep Segmentation Networks for Segmenting Kidneys and Detecting Kidney Stones in Unenhanced Abdominal CT Images
Previous Article in Special Issue
Merging Multiphase CTA Images and Training Them Simultaneously with a Deep Learning Algorithm Could Improve the Efficacy of AI Models for Lateral Circulation Assessment in Ischemic Stroke
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Outcome Prediction Based on Automatically Extracted Infarct Core Image Features in Patients with Acute Ischemic Stroke

by
Manon L. Tolhuisen
1,2,*,
Jan W. Hoving
2,
Miou S. Koopman
2,
Manon Kappelhof
2,
Henk van Voorst
1,
Agnetha E. Bruggeman
2,
Adam M. Demchuck
3,
Diederik W. J. Dippel
4,
Bart J. Emmer
2,
Serge Bracard
5,
Francis Guillemin
6,
Robert J. van Oostenbrugge
7,8,
Peter J. Mitchell
9,
Wim H. van Zwam
10,
Michael D. Hill
11,
Yvo B. W. E. M. Roos
12,
Tudor G. Jovin
13,
Olvert A. Berkhemer
2,4,14,
Bruce C. V. Campbell
9,15,
Jeffrey Saver
16,
Phil White
17,18,
Keith W. Muir
19,
Mayank Goyal
11,
Henk A. Marquering
1,2,
Charles B. Majoie
2 and
Matthan W. A. Caan
1,* on behalf of the MR CLEAN-NO IV and HERMES investigators
add Show full author list remove Hide full author list
1
Department of Biomedical Engineering and Physics, Amsterdam UMC, Location AMC, 1105 AZ Amsterdam, The Netherlands
2
Department of Radiology and Nuclear Medicine, Amsterdam UMC, Location AMC, 1105 AZ Amsterdam, The Netherlands
3
Department of Clinical Neurosciences and Radiology, Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, AB T2N 1N4, Canada
4
Department of Neurology, Erasmus MC University Medical Center, 3015 GD Rotterdam, The Netherlands
5
Department of Diagnostic and Interventional Neuroradiology, IADI, Inserm, CHRU, Université de Lorraine, 54500 Nancy, France
6
CIC-Epidémiologie Clinique, 1433, CHRU, Inserm, Université de Lorraine, 54500 Nancy, France
7
Department of Neurology, Maastricht UMC, 6229 HX Maastricht, The Netherlands
8
Cardiovascular Research Institute Maastricht, 6229 ER Maastricht, The Netherlands
9
Department of Neurology, Royal Melbourne Hospital, Parkville, VIC 3050, Australia
10
Department of Radiology, Maastricht UMC, 6229 HX Maastricht, The Netherlands
11
Department of Clinical Neurosciences, University of Calgary, Calgary, AB T2N 1N4, Canada
12
Department of Neurology, Amsterdam UMC, Location AMC, 1105 AZ Amsterdam, The Netherlands
13
Department of Neurology, Stroke Institute, University of Pittsburgh Medical Center, Pittsburgh, PA 15213, USA
14
Department of Radiology and Nuclear Medicine, Erasmus MC University Medical Center, 3015 GD Rotterdam, The Netherlands
15
Department of Medicine, University of Melbourne, Parkville, VIC 3010, Australia
16
Department of Neurology and Comprehensive Stroke Center, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
17
Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK
18
Department of Neuroradiology, Newcastle upon Tyne Hospitals, Newcastle upon Tyne NE1 7RU, UK
19
Institute of Neuroscience and Psychology, University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
*
Authors to whom correspondence should be addressed.
Diagnostics 2022, 12(8), 1786; https://doi.org/10.3390/diagnostics12081786
Submission received: 11 June 2022 / Revised: 9 July 2022 / Accepted: 18 July 2022 / Published: 23 July 2022
(This article belongs to the Special Issue Artificial Intelligence in Stroke Imaging)

Abstract

:
Infarct volume (FIV) on follow-up diffusion-weighted imaging (FU-DWI) is only moderately associated with functional outcome in acute ischemic stroke patients. However, FU-DWI may contain other imaging biomarkers that could aid in improving outcome prediction models for acute ischemic stroke. We included FU-DWI data from the HERMES, ISLES, and MR CLEAN-NO IV databases. Lesions were segmented using a deep learning model trained on the HERMES and ISLES datasets. We assessed the performance of three classifiers in predicting functional independence for the MR CLEAN-NO IV trial cohort based on: (1) FIV alone, (2) the most important features obtained from a trained convolutional autoencoder (CAE), and (3) radiomics. Furthermore, we investigated feature importance in the radiomic-feature-based model. For outcome prediction, we included 206 patients: 144 scans were included in the training set, 21 in the validation set, and 41 in the test set. The classifiers that included the CAE and the radiomic features showed AUC values of 0.88 and 0.81, respectively, while the model based on FIV had an AUC of 0.79. This difference was not found to be statistically significant. Feature importance results showed that lesion intensity heterogeneity received more weight than lesion volume in outcome prediction. This study suggests that predictions of functional outcome should not be based on FIV alone and that FU-DWI images capture additional prognostic information.

1. Introduction

Acute ischemic stroke (AIS) has a major impact on patients’ lives: the majority of AIS patients do not return to functional independence or their functional status before experiencing AIS—even with adequate treatment [1]. Accurate estimations of functional outcome after treatment could help to guide patients in setting realistic expectations and deciding on the focus of the rehabilitation process [2].
Follow-up infarct volume (FIV) as measured by radiological follow-up imaging has been suggested as a prognostic marker for functional outcome [3]. However, previous studies have indicated that FIV is only moderately associated with functional outcome: only 12% of functional outcomes are explained by FIV [3]. It has been suggested that current imaging techniques—such as computed tomography (CT), CT perfusion, and diffusion-weighted imaging (DWI)—are not able to accurately predict or measure infarcted tissue [4]. A complicating factor is the fact that progression from severely ischemic tissue to actual infarction is likely not constant over time and not always clearly visible on CT or DWI scans. In addition, cells within the ischemic region may potentially remain viable, depending on their tolerance to ischemia [4]. A previous study showed that ischemic lesions may still evolve in the subacute phase even after successful treatment, resulting in smaller or larger lesions after 1-week follow-up [5].
Previous studies have suggested that tissue estimated as infarcted on radiological imaging may contain additional prognostic information that could improve outcome prediction for AIS. For example, intensity heterogeneity on images in infarcted regions may reflect a variance in tissue vulnerability to ischemia and may represent the degree of ischemia [6]. In addition, Wang et al. showed that textural features, including heterogeneity, assessed based on T2 FLAIR and ADC images were associated with follow-up NIHSS and modified Rankin Scale (mRS) scores [7]. Moreover, the shape of the lesion may contain important information on the potential progression from ischemia to infarcted tissue [8].
Previous studies have demonstrated the potential of machine learning (ML) to use automatically extracted imaging biomarkers for outcome prediction in AIS. For example, Qiu et al. [9] trained a support vector machine (SVM) to show that thrombus radiomic features were more predictive for recanalization in patients treated with intravenous alteplase than manually extracted thrombus features. In addition, Hilbert et al. [10] showed that features automatically extracted by an autoencoder combined with a dense layer outperformed ML models trained on handcrafted imaging biomarkers in predicting successful reperfusion and functional outcome at 90 days after stroke onset.
We hypothesized that infarct volume alone, as measured by DWI, is not sufficient to represent the pathological changes in the ischemic brain region and that DWI data may contain additional prognostic information that is still unknown. We compared the performance of an ML model based on FIV alone with a radiomic-features-based model and a model based on features obtained from a deep learning autoencoder network in the prediction of favorable functional outcome.

2. Materials and Methods

Figure 1 shows the workflow of this study, which can be split up into two stages: feature extraction and outcome classification. Before we were able to extract features from the study dataset, we used an external dataset to train the CAE and a deep learning network for the delineation of the infarct lesions.

2.1. Datasets

The external dataset included patients from the HERMES collaboration [11] with available diffusion-weighted imaging (DWI) at 24 h; it also included DWI images from patients with subacute lesions from SISS ISLES 2015 [12]. The HERMES collaboration was formed to pool patient-level data from seven randomized, controlled clinical trials that showed the efficacy of endovascular treatment (EVT) over best medical management alone for patients with an occlusion of arteries of the proximal anterior circulation (ICA, M1, and M2) [11]. Each trial in the HERMES collaboration was approved by the relevant national or local medical ethical committee. All imaging data and clinical reports were anonymized, and informed consent was obtained for each patient according to each trial protocol. Patients included in these trials consented to participation in the individual trials as well as the use of their data for future research.
The study dataset included patients from the MR CLEAN-NO IV trial with available DWI at 24 h post-treatment. The MR CLEAN-NO IV trial was a randomized clinical trial in which the effect of immediate endovascular treatment (EVT) on 90-day functional outcome in patients with AIS was compared to that of intravenous treatment with alteplase (IVT) followed by endovascular treatment [13]. Patients who were directly admitted to an EVT-capable hospital were included if they were eligible for IVT and EVT and over the age of 18 with a proximal occlusion of the anterior circulation. Informed consent was obtained following a deferred consent procedure in accordance with national legislation in the three participating countries [14].
Since imaging was acquired in a multicenter and international setting, scanner types and image acquisition parameters varied. Images were acquired with a field strength of 1.5 or 3 Tesla. The slice thickness ranged from 3 to 6 mm. For this study, patients were excluded if DWI images contained motion artifacts or in cases with unsolvable registration errors.

2.2. Pre-Processing and Image Analysis

2.2.1. Image Registration

All DWI images were transformed to standard MNI space via non-rigid registration using the SPM8 toolbox [15], resulting in isotropic voxel dimensions of 1 mm. Intensities were normalized using the white stripe normalization toolbox [16]. Images processed by the CAE were additionally subsampled to an isotropic voxel spacing of 3 mm before analysis.

2.2.2. Lesion Segmentation

To delineate the infarct lesions for the MR CLEAN-NO IV population, we trained a Deepmedic network [17]. Deepmedic is a multi-scale 3D convolutional neural network with a fully connected conditional random field and has been shown to be computationally efficient; it performed best at brain lesion segmentation in the ISLES 2015 challenge [12]. We trained the network on the HERMES DWI images for which lesion segmentations were available [3]. Images were split into a training set (70%), a validation set (10%), and a test set (20%). The trained network was applied to the MR CLEAN-NO IV image dataset. Each resulting segmentation was checked by one of two experienced observers (authors J.W.H. and M.L.T.) and manually adjusted, in cases of erroneous segmentation, using ITK-SNAP [18]. Hemorrhagic transformation was included within the lesion. For cases where no consensus could be reached, the segmentations were assessed by two expert neuroradiologists (authors C.B.M. and M.S.K., with >20 and >5 years of experience, respectively) to reach a consensus.

2.3. Feature Extraction

2.3.1. Convolutional Autoencoder

We developed and optimized a CAE for the reconstruction of DWI images (Figure 2) using the Keras libraries [19]. By learning how to reduce the dimensions of the feature space and reconstruct images from this low-dimensional feature space (latent space), the CAE learns the most important features that describe the source image. A CAE consists of several layers that downsample an image (encoder) to a compressed feature space (latent space), followed by several upsampling layers (decoder) that reverse the downsampling by upsampling the image to the original image dimensions. The encoder consisted of four 4 × 4 × 4 convolutional layers with stride 2 and rectified linear unit activation. Since each convolutional layer divides the feature space dimensions in half, it was favorable to use input dimensions that were powers of 2. Therefore, we first zero-padded the input image to the dimensions of 64 × 80 × 64. For each subsequent convolutional layer, the number of filters was doubled, starting at 16. Each convolutional layer was followed by group normalization to reduce the chance of overfitting. After the final convolutional layer of the encoder, the feature space was flattened, and a dense layer was added to reduce the number of features in the latent space to 100. The decoder reversed the encoder by first upsampling the feature space by a factor 2, followed by the use of a convolutional layer with stride 1 to maintain the feature space dimensions. The number of filters in the first four convolutional layers of the decoder was equal to the number in the encoder but in the opposite direction. Again, each convolutional layer was followed by group normalization. After the fourth convolutional layer, three additional convolutional layers were added to gradually reduce the fourth dimension of the feature space to 1, resulting in the input image dimensions. After the last convolutional layer, the output image was cropped to the original image dimensions. The loss function of the CAE was the mean squared error (MSE) between the source image and the resulting image.
For the development and optimization of the CAE, the data from the HERMES and ISLES challenges were combined and divided into a training (80%) set and a validation set (20%). To increase the number of training samples available, we performed data augmentation for the training set by flipping the images over the z-axis. The network was trained for 200 epochs with a batch size of 2. We used the validation set to optimize the CAE. After optimization, we extracted features from the MR CLEAN-NO IV DWI dataset.

2.3.2. Radiomics

Radiomic features extracted from medical images aim to identify and quantify pathological effects that may be invisible to the human eye [20]. Radiomic features are extracted from a region of interest (ROI), in our case, the infarct lesion, and include first-order statistics, shape, and textural features (Figure 3). Examples of first-order statistics are minimum, maximum, and mean intensity within the lesion. Shape features contain both 2D metrics, such as the maximum diameter within a slice of the ROI, and 3D metrics, including the 3D volume of the ROI. Textural features are computed using filtering methods and matrices that capture the relationships between multiple voxels. An example of these matrices is the gray-level size zone matrix (GLSZM), which represents the number of neighboring pixels with the same intensity. Metrics computed from this matrix represent coarseness and homogeneity within the lesion. In total, 100 radiomic features were extracted using the PyRadiomic Toolkit [20].

2.4. Classification

An SVM classifier was optimized based on FIV, radiomic, and CAE features. The SVM classifier separated different outcome groups by optimizing a hyperplane that described the boundary with maximal distance between the features that belonged to the different outcome groups. We assessed its performance at accurately predicting functional independence, defined as an mRS score of 0–2, at 90 days. For the implementation of the SVM, we used the scikit-learn toolkits [21]. The optimization and testing were performed similarly for both feature sets. From the MR CLEAN-NO IV DWI dataset, 80% of the DWI images were used for 5-fold cross-validation. The remaining 20% of the images were used to test the performance of the final classifier. Before optimizing the SVM, all features were normalized with the scikit-learn ‘RobustScaler’ function, which scales each feature based on its median and interquartile ranges. To optimize the SVM, we performed a grid search to find the most optimal kernel type and coefficient (gamma) and regularization parameter (C). The following options were used: linear kernel type, radial basis function, polynomial or sigmoid, and gamma of 1 × 10−2 to 1 × 103 per order of magnitude. The performance of the classifiers was evaluated based on the area under the receiver operating characteristic curve (AUC) computed for the test set. The AUCs were pairwise compared and tested for statistically significant differences, with the highest AUC as a reference, using deLong’s test [22]. Classification accuracy, precision, and recall were also reported. For the radiomics-based classifier, we investigated feature importance based on the Shapley additive explanation (SHAP) values [23]. For the CAE, we visualized a representative predicted validation image and compared it to the original validation image.

3. Results

3.1. Study Population

From the 307 patients with FU-DWI images in the HERMES dataset, we excluded 55 patients due to poor image quality. No images were excluded from the ISLES dataset (n = 64), which resulted in a total dataset of 316 images. From these images, 253 DWI images were included in the training set and 63 in the validation set.
The MR CLEAN-NO IV dataset contained 220 patients with available FU-DWI scans. We excluded 11 patients due to poor image quality and 3 patients due to uncorrectable registration errors. This resulted in 206 patients in the study population, from which 144 scans were included in the training set, 21 in the validation set, and 41 in the test set. The baseline and follow-up characteristics for the MR CLEAN-NO IV subpopulation and the overall study population are provided in the Supplemental Material, Table S1.

3.2. Autoencoder Image Reconstruction

The training MSE of the CAE was 2.0 × 10−3 (arbitrary units), and the validation error was 5.1 × 10−3. Figure 4 shows the reconstruction of a validation image (left) established by the CAE (middle) and the corresponding difference map (right). The difference map shows small intensity differences in most of the healthy brain regions. The largest differences in intensities were present at the transition between brain tissue and cerebral spinal fluid. It is of note that some predicted voxels within the lesion and ventricles also differed in intensity from the original. The CAE was able to reconstruct the lesion at a location similar to that of the original image.

3.3. Functional Outcome Prediction

Table 1 shows the results of the best-performing classifiers that were trained on FIV, CAE-selected features, and radiomic features. We found the highest test accuracy for the FIV-based SVM classifier (0.74). The precision was highest for the radiomic-features-based SVM classifier (0.80), while the recall was highest for the SVM classifier based on FIV (0.73). Based on the AUC (Figure 5), the SVM classifier trained on radiomic features showed the best performance (0.88). However, this improved outcome prediction was not statistically significant compared to the model based on FIV (p = 0.15) or the model based on the CAE-trained SVM classifier (p = 0.37).

3.4. Radiomic Feature Importance

Figure 6 lists the 15 radiomic features with the largest impact on the outcome prediction generated by the SVM classifier based on SHAP values. The majority of these features consisted of textural features [24]. The two most important features were ‘large area of high gray-level emphasis’ and ‘large area of low gray-level emphasis’, which are both based on the GLSZM matrix. These features represented the presence of large areas with high or low intensities within the lesion and steered the classifier towards unfavorable functional outcome classifications. The lesion volume features with the most impact on the classification were mesh volume (volume based on the reconstructed 3D mesh based on the delineation) and voxel volume (lesion volume based on voxel volume). These features were in 9th and 10th place, respectively.

4. Discussion

We compared the predictive performance of ML models based on three different feature sets: FIV, radiomic, and CAE features. We showed that the accuracy of favorable outcome prediction based on radiological imaging characteristics was improved when using automatically extracted imaging biomarkers from FU-DWI images. However, we were unable to show statistically significant differences in independent data. We found that intensity heterogeneity in the FU-DWI lesion was most important for functional outcome prediction.
The model based on radiomic features most accurately predicted favorable functional outcome, and our SHAP analysis showed that its most important features were related to textural information. Thus, the SVM classifier weighted the decisions regarding predicted outcome mostly on texture and, to a lesser extent, on lesion volume. The most important textural features were related to intensity heterogeneity. This corresponds with the current literature [7] and may reflect the heterogeneity in tissue vulnerability. Possibly, intensity heterogeneity in our study population was related to the presence of hemorrhage, which is negatively associated with functional outcome [25]. We performed an explorative analysis to study whether hemorrhage was present in patients with heterogeneous lesions. Together with an expert neuroradiologist (C.B.M.), we visually inspected the DWI and T2* images of the patients with high values for heterogeneity and negative SHAP values (which corresponded to predictions of unfavorable outcome). Information about treatment outcome was not provided. Hemorrhage could not be observed in these patients. This study suggests that functional outcome predictions should not be based on FIV alone as an imaging biomarker and that FU-DWI images capture additional prognostic information about the ischemic tissue in patients with an LVO.
The radiomic-feature-based SVM outperformed the CAE-feature-based SVM and was best at correctly classifying patients with favorable outcomes: 20% of the patients for whom a favorable outcome was predicted by the radiomic-features-based SVM did not achieve functional independence, compared to 25% for the CAE-feature-based SVM. However, the recall for the radiomic-feature-based classifier was only 65%, while for the CAE-feature-based SVM, 73% of patients with favorable outcomes were selected. Considering these results, we think that the CAE-feature-based SVM is more appropriate for clinical decision making since patients with the potential for a favorable outcome should not be missed.
An advantage of the use of CAE features over radiomic features is that no lesion delineations are required for feature extraction. In this study, lesion delineation required manual annotations, which is time-consuming and introduces user dependency. In addition, since the CAE features are based on the entire brain volume, information on surrounding tissue relative to the lesion is incorporated. A disadvantage of using the CAE is that features are less interpretable. Future studies could potentially perform activation visualization to study which information about the brain was most important for the classifier [26]. In addition, ischemic lesion location could be a feature of interest for predicting functional outcome [27].
This study suffers from some limitations. First, selection bias might have occurred since, for functional outcome prediction, we only included patients who complied with the inclusion criteria of MR CLEAN-NO IV. Consequently, our results cannot be generalized to ischemic stroke patients with more distal occlusions, posterior circulation occlusions, or a stroke with minor symptoms. In addition, our results are not generalizable to patients not eligible for IVT and/or EVT, or who present outside the treatment window or with a baseline NIHSS < 2. In addition, in our healthcare system, a follow-up MRI for AIS patients is mostly only acquired in a research setting. Therefore, we only included patients from centers who participated in the MR CLEAN-NO IV trial in whom a follow-up MRI was performed as a secondary outcome measure as required by the trial protocol. Second, the mRS score was used since it is a common endpoint in AIS trials for the assessment of independence in daily activities. However, it is coarse and mainly focuses on motor function, with less attention to the assessment of cognitive function and emotional processing. Third, the performance of the CAE may have been hampered due to the optimization process of the CAE itself. Improving the CAE, for example, by adding more data to the training set, may improve the accuracy of classifications of functional independence by the SVM based on CAE features.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics12081786/s1, Table S1: Baseline and follow-up characteristics of the selected study and total population of the MR CLEAN NO-IV trial.

Author Contributions

Conceptualization, M.L.T., H.A.M., C.B.M. and M.W.A.C.; methodology, M.L.T.; software, M.L.T.; validation, M.L.T.; formal analysis, M.L.T.; investigation, M.L.T.; resources, M.L.T., A.E.B., H.v.V., M.K., A.M.D., D.W.J.D., B.J.E., S.B., F.G., R.J.v.O., P.J.M., W.H.v.Z., M.D.H., Y.B.W.E.M.R., T.G.J., O.A.B., B.C.V.C., J.S., P.W., K.W.M., M.G. and C.B.M.; data curation, M.L.T., J.W.H., M.S.K. and C.B.M.; writing—original draft preparation, M.L.T.; writing—review and editing, J.W.H., M.S.K., M.K., H.v.V., A.E.B., A.M.D., B.J.E., S.B., F.G., R.J.v.O., P.J.M., W.H.v.Z., M.D.H., T.G.J., O.A.B., B.C.V.C., J.S., P.W., K.W.M., M.G., H.A.M., C.B.M. and M.W.A.C.; visualization, M.L.T.; supervision, H.A.M., C.B.M. and M.W.A.C.; project administration, M.L.T.; funding acquisition, A.M.D., D.W.J.D., B.J.E., S.B., F.G., R.J.v.O., P.J.M., W.H.v.Z., M.D.H., Y.B.W.E.M.R., T.G.J., B.C.V.C., J.S., P.W., K.W.M., M.G. and C.B.M. All authors have read and agreed to the published version of the manuscript.

Funding

The CONTRAST consortium is supported by the Netherlands Cardiovascular Research Initiative (CVON), an initiative of the Dutch Heart Foundation, and by the Brain Foundation Netherlands, Medtronic, and Cerenovus. AMC and Erasmus MC received additional unrestricted funding on behalf of CONTRAST for the execution of MR CLEAN-NO IV from Stryker European Operations BV.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki. Ethical approval from the relevant institutional review boards and ethics committees was acquired by each of the individual trials in the pooled HERMES collaboration data as well as the MR CLEAN-NO IV trial, as reported in the original trial reports [13,28,29,30,31,32,33,34]. Ethical approval was received from: ‘Central Medical Ethics Committee and the Research Board of the Erasmus MC University Medical Center, Rotterdam, the Netherlands’, with approval code: MEC-2010-041, on 2 February 2010. MR CLEAN NO-IV received ethical approval by: ‘Central Medical Ethics Committee and the Research Board of the Erasmus MC University Medical Center, Rotterdam, the Netherlands’, with approval code MEC-2017-368 on 19 October 2017.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The SISS ISLES 2015 is an open dataset that is available after registration with https://www.smir.ch (accessed on 17 July 2022). Trial data can be made available on reasonable request via mrclean@erasmusmc.nl.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results. Dr. Majoie reports grants from CVON/the Dutch Heart Foundation, the TWIN Foundation, the European Commission, the Dutch Health Evaluation program, and Stryker outside the submitted work (paid to the institution) and is a shareholder of Nico.lab International Ltd. A. van der Lugt and D.W.J. Dippel are the recipients of unrestricted grants from the Dutch Heart Foundation, the Dutch Brain Foundation, the Netherlands Organisation for Health Research and Development, Health Holland Top Sector Life Science, Stryker, Medtronic, Penumbra Inc., and Cerenovus for the conduct of trials regarding acute treatment for stroke. Dr. Caan reports a grant of the TKI-PPP program, co-funded by the PPP Allowance made available by Health~Holland, Top Sector Life Sciences & Health and is a shareholder of Nico.lab Inter- national Ltd. Dr. H.A. Marquering is a co-founder and shareholder of Nico.lab International Ltd. Dr. Saver is an employee of the University of California. The University of California has patent rights in retrieval devices for stroke. The University of California Regents received payments, on the basis of clinical trial contracts, for the number of subjects enrolled in multicenter clinical trials sponsored by Medtronic, Stryker, Cerenovus, and Rapid Medical; Dr. Saver served as an unpaid site investigator under these contracts. Dr. Saver has received contracted hourly payments and travel reimbursement for services as a scientific consultant advising on rigorous trial design and conduct to Medtronic, Stryker, and Cerenovus. Dr. Saver has received contracted stock options for services as a scientific consultant advising on rigorous trial design and conduct to Rapid Medical. Prof. White declares institutional un-restricted educational grants from Medtronic, Penumbra, and Stryker, an institutional grant from Microvention, and personal consultancy work for Microvention. Prof. K. Muir has received fees for advisory board roles or speaking from Boehringer Ingelheim, Biogen, Bayer, Daiichi Sankyo, and ReNeuron, and receives institutional support from Boehringer Ingelheim for the ATTEST-2 trial.

References

  1. Jansen, I.G.H.; Mulder, M.J.H.L.; Goldhoorn, R.J.B. Endovascular treatment for acute ischaemic stroke in routine clinical practice: Prospective, observational cohort study (MR CLEAN Registry). BMJ 2018, 360, k949. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Langhammer, B.; Sunnerhagen, K.S.; Lundgren-Nilsson, Å.; Sällström, S.; Becker, F.; Stanghelle, J.K. Factors enhancing activities of daily living after stroke in specialized rehabilitation: An observational multicenter study within the Sunnaas International Network. Eur. J. Phys. Rehabil. Med. 2017, 53, 725–734. [Google Scholar] [CrossRef] [PubMed]
  3. Boers, A.M.M.; Jansen, I.G.H.; Beenen, L.F.M.; Devlin, T.G.; San Roman, L.; Heo, J.H.; Ribó, M.; Brown, S.; Almekhlafi, M.A.; Liebeskind, D.S.; et al. Association of follow-up infarct volume with functional outcome in acute ischemic stroke: A pooled analysis of seven randomized trials. J. Neurointerv. Surg. 2018, 10, 1137–1142. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Goyal, M.; Ospel, J.M.; Menon, B.; Almekhlafi, M.; Jayaraman, M.; Fiehler, J.; Psychogios, M.; Chapot, R.; Van Der Lugt, A.; Liu, J.; et al. Challenging the Ischemic Core Concept in Acute Ischemic Stroke Imaging. Stroke 2020, 51, 3147–3155. [Google Scholar] [CrossRef] [PubMed]
  5. Konduri, P.; van Kranendonk, K.; Boers, A.; Treurniet, K.; Berkhemer, O.; Yoo, A.J.; van Zwam, W.; van Oostenbrugge, R.; van der Lugt, A.; Dippel, D.; et al. The Role of Edema in Subacute Lesion Progression After Treatment of Acute Ischemic Stroke. Front. Neurol. 2021, 12, 705221. [Google Scholar] [CrossRef]
  6. Wu, O.; Christensen, S.; Hjort, N.; Dijkhuizen, R.M.; Kucinski, T.; Fiehler, J.; Thomalla, G.; Röther, J.; Østergaard, L. Characterizing physiological heterogeneity of infarction risk in acute human ischaemic stroke using MRI. Brain 2006, 129, 2384–2393. [Google Scholar] [CrossRef] [Green Version]
  7. Wang, H.; Lin, J.; Zheng, L.; Zhao, J.; Song, B.; Dai, Y. Texture analysis based on ADC maps and T2-FLAIR images for the assessment of the severity and prognosis of ischaemic stroke. Clin. Imaging 2020, 67, 152–159. [Google Scholar] [CrossRef]
  8. Frindel, C.; Rouanet, A.; Giacalone, M.; Cho, T.H.; Østergaard, L.; Fiehler, J.; Pedraza, S.; Baron, J.C.; Wiart, M.; Berthezène, Y.; et al. Validity of Shape as a Predictive Biomarker of Final Infarct Volume in Acute Ischemic Stroke. Stroke 2015, 46, 976–981. [Google Scholar] [CrossRef] [Green Version]
  9. Qiu, W.; Kuang, H.; Nair, J.; Assis, Z.; Najm, M.; McDougall, C.; McDougall, B.; Chung, K.; Wilson, A.T.; Goyal, M.; et al. Radiomics-based intracranial thrombus features on CT and CTA predict recanalization with intravenous alteplase in patients with acute ischemic stroke. Am. J. Neuroradiol. 2019, 40, 39–44. [Google Scholar] [CrossRef]
  10. Hilbert, A.; Ramos, L.A.; van Os, H.J.A.; Olabarriaga, S.D.; Tolhuisen, M.L.; Wermer, M.J.H.; Barros, R.S.; van der Schaaf, I.; Dippel, D.; Roos, Y.B.W.E.M.; et al. Data-efficient deep learning of radiological image data for outcome prediction after endovascular treatment of patients with acute ischemic stroke. Comput. Biol. Med. 2019, 115, 103516. [Google Scholar] [CrossRef]
  11. Goyal, M.; Menon, B.K.; Van Zwam, W.H.; Dippel, D.W.J.; Mitchell, P.J.; Demchuk, A.M.; Dávalos, A.; Majoie, C.B.L.M.; Van Der Lugt, A.; De Miquel, M.A.; et al. Endovascular thrombectomy after large-vessel ischaemic stroke: A meta-analysis of individual patient data from five randomised trials. Lancet 2016, 387, 1723–1731. [Google Scholar] [CrossRef]
  12. Maier, O.; Menze, B.H.; von der Gablentz, J.; Häni, L.; Heinrich, M.P.; Liebrand, M.; Winzeck, S.; Basit, A.; Bentley, P.; Chen, L.; et al. ISLES 2015—A public evaluation benchmark for ischemic stroke lesion segmentation from multispectral MRI. Med. Image Anal. 2017, 35, 250–269. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. LeCouffe, N.E.; Kappelhof, M.; Treurniet, K.M.; Rinkel, L.A.; Bruggeman, A.E.; Berkhemer, O.A.; Wolff, L.; van Voorst, H.; Tolhuisen, M.L.; Dippel, D.W.J.; et al. A Randomized Trial of Intravenous Alteplase before Endovascular Treatment for Stroke. N. Engl. J. Med. 2021, 385, 1833–1844. [Google Scholar] [CrossRef] [PubMed]
  14. Kompanje, E.J.O.; van Dijck, J.T.J.M.; Chalos, V.; van den Berg, S.A.; Janssen, P.M.; Nederkoorn, P.J.; van der Jagt, M.; Citerio, G.; Stocchetti, N.; Dippel, D.W.J.; et al. Informed consent procedures for emergency interventional research in patients with traumatic brain injury and ischaemic stroke. Lancet Neurol. 2020, 19, 1033–1042. [Google Scholar] [CrossRef]
  15. Friston, K.J.; Ashburner, J.T.; Kiebel, S.; Nichols, T.E.; Penny, W.D. (Eds.) Statistical Parametric Mapping: The Analysis of Functional Brain Images, 1st ed.; Elsevier: Amsterdam, The Netherlands; Academic Press: Cambridge, MA, USA, 2007; ISBN 9780123725608. [Google Scholar]
  16. Shinohara, R.T.; Sweeney, E.M.; Goldsmith, J.; Shiee, N.; Mateen, F.J.; Calabresi, P.A.; Jarso, S.; Pham, D.L.; Reich, D.S.; Crainiceanu, C.M. Statistical normalization techniques for magnetic resonance imaging. NeuroImage Clin. 2014, 6, 9–19. [Google Scholar] [CrossRef] [Green Version]
  17. Kamnitsas, K.; Ledig, C.; Newcombe, V.F.J.; Simpson, J.P.; Kane, A.D.; Menon, D.K.; Rueckert, D.; Glocker, B. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal. 2017, 36, 61–78. [Google Scholar] [CrossRef]
  18. Yushkevich, P.A.; Piven, J.; Hazlett, H.C.; Smith, R.G.; Ho, S.; Gee, J.C.; Gerig, G. User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability. Neuroimage 2006, 31, 1116–1128. [Google Scholar] [CrossRef] [Green Version]
  19. Chollet, F.; Keras Team. GitHub. 2015. Available online: https://github.com/fchollet/keras (accessed on 17 July 2022).
  20. Van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H.J.W.L. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef] [Green Version]
  21. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in {P}ython. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  22. DeLong, E.R.; DeLong, D.M.; Clarke-Pearson, D.L. Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach. Biometrics 1988, 44, 837–845. [Google Scholar] [CrossRef]
  23. Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS’17:), Long Beach, CA, USA, 4–9 December 2017; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 32, pp. 4765–4774. [Google Scholar]
  24. Zwanenburg, A.; Leger, S.; Vallières, M.; Löck, S. Image biomarker standardisation initiative. arXiv 2016, arXiv:1612.07003. [Google Scholar] [CrossRef] [Green Version]
  25. Van Kranendonk, K.R.; Treurniet, K.M.; Boers, A.M.M.; Berkhemer, O.A.; Van Den Berg, L.A.; Chalos, V.; Lingsma, H.F.; Van Zwam, W.H.; Van Der Lugt, A.; Van Oostenbrugge, R.J.; et al. Hemorrhagic transformation is associated with poor functional outcome in patients with acute ischemic stroke due to a large vessel occlusion. J. Neurointerv. Surg. 2019, 11, 464–468. [Google Scholar] [CrossRef]
  26. Rios, T.; Van Stein, B.; Menzel, S.; Back, T.; Sendhoff, B.; Wollstadt, P. Feature Visualization for 3D Point Cloud Autoencoders. In Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK, 19–24 July 2020. [Google Scholar] [CrossRef]
  27. Xie, Y.; Oster, J.; Micard, E.; Chen, B.; Douros, I.K.; Liao, L.; Zhu, F.; Soudant, M.; Felblinger, J.; Guillemin, F.; et al. Impact of Pretreatment Ischemic Location on Functional Outcome after Thrombectomy. Diagnostics 2021, 11, 2038. [Google Scholar] [CrossRef] [PubMed]
  28. Berkhemer, O.A.; Fransen, P.S.S.; Beumer, D.; van den Berg, L.A.; Lingsma, H.F.; Yoo, A.J.; Schonewille, W.J.; Vos, J.A.; Nederkoorn, P.J.; Wermer, M.J.H.; et al. A Randomized Trial of Intraarterial Treatment for Acute Ischemic Stroke. N. Engl. J. Med. 2014, 372, 11–20. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Saver, J.L.; Goyal, M.; Bonafe, A.; Diener, H.-C.; Levy, E.I.; Pereira, V.M.; Albers, G.W.; Cognard, C.; Cohen, D.J.; Hacke, W.; et al. Stent-Retriever Thrombectomy after Intravenous t-PA vs. t-PA Alone in Stroke. N. Engl. J. Med. 2015, 372, 2285–2295. [Google Scholar] [CrossRef] [Green Version]
  30. Campbell, B.C.V.; Mitchell, P.J.; Kleinig, T.J.; Dewey, H.M.; Churilov, L.; Yassi, N.; Yan, B.; Dowling, R.J.; Parsons, M.W.; Oxley, T.J.; et al. Endovascular Therapy for Ischemic Stroke with Perfusion-Imaging Selection. N. Engl. J. Med. 2015, 372, 1009–1018. [Google Scholar] [CrossRef] [Green Version]
  31. Jovin, T.G.; Chamorro, A.; Cobo, E.; de Miquel, M.A.; Molina, C.A.; Rovira, A.; San Román, L.; Serena, J.; Abilleira, S.; Ribó, M.; et al. Thrombectomy within 8 Hours after Symptom Onset in Ischemic Stroke. N. Engl. J. Med. 2015, 372, 2296–2306. [Google Scholar] [CrossRef] [Green Version]
  32. Goyal, M.; Demchuk, A.M.; Menon, B.K.; Eesa, M.; Rempel, J.L.; Thornton, J.; Roy, D.; Jovin, T.G.; Willinsky, R.A.; Sapkota, B.L.; et al. Randomized assessment of rapid endovascular treatment of ischemic stroke. N. Engl. J. Med. 2015, 372, 1019–1030. [Google Scholar] [CrossRef] [Green Version]
  33. Muir, K.W.; Ford, G.A.; Messow, C.M.; Ford, I.; Murray, A.; Clifton, A.; Brown, M.M.; Madigan, J.; Lenthall, R.; Robertson, F.; et al. Endovascular therapy for acute ischaemic stroke: The Pragmatic Ischaemic Stroke Thrombectomy Evaluation (PISTE) randomised, controlled trial. J. Neurol. Neurosurg. Psychiatry 2017, 88, 38–44. [Google Scholar] [CrossRef] [Green Version]
  34. Bracard, S.; Ducrocq, X.; Mas, J.L.; Soudant, M.; Oppenheim, C.; Moulin, T.; Guillemin, F. Mechanical thrombectomy after intravenous alteplase versus alteplase alone after stroke (THRACE): A randomised controlled trial. Lancet Neurol. 2016, 15, 1138–1147. [Google Scholar] [CrossRef]
Figure 1. Study workflow for functional outcome prediction. Three different feature sets were extracted: follow-up infarct volume, features extracted by a convolutional autoencoder, and radiomic features. Each feature set was split into a training (80%) set and a test (20%) set. A support vector machine (SVM) was trained on the training set to classify favorable outcome. The SVMs were tested on the test set. The results were evaluated for each SVM.
Figure 1. Study workflow for functional outcome prediction. Three different feature sets were extracted: follow-up infarct volume, features extracted by a convolutional autoencoder, and radiomic features. Each feature set was split into a training (80%) set and a test (20%) set. A support vector machine (SVM) was trained on the training set to classify favorable outcome. The SVMs were tested on the test set. The results were evaluated for each SVM.
Diagnostics 12 01786 g001
Figure 2. The convolutional autoencoder architecture. The dimensions of the input image were 64 × 80 × 64. The encoder consisted of four 4 × 4 × 4 convolutional layers with stride 2 and rectified linear unit activation. For each subsequent convolutional layer, the number of filters was doubled, starting at 16. Each convolutional layer was followed by group normalization. After the final convolutional layer of the encoder, the feature space was flattened, and a dense layer was added. The decoder contained the same components as the encoder in the opposite direction, except that the feature space was upsampled first by a factor of 2, and the stride of the convolutional layers was kept at 1. After the fourth convolutional layer, three additional convolutional layers reduced the fourth dimension of feature space to 1, resulting in the original image dimensions.
Figure 2. The convolutional autoencoder architecture. The dimensions of the input image were 64 × 80 × 64. The encoder consisted of four 4 × 4 × 4 convolutional layers with stride 2 and rectified linear unit activation. For each subsequent convolutional layer, the number of filters was doubled, starting at 16. Each convolutional layer was followed by group normalization. After the final convolutional layer of the encoder, the feature space was flattened, and a dense layer was added. The decoder contained the same components as the encoder in the opposite direction, except that the feature space was upsampled first by a factor of 2, and the stride of the convolutional layers was kept at 1. After the fourth convolutional layer, three additional convolutional layers reduced the fourth dimension of feature space to 1, resulting in the original image dimensions.
Diagnostics 12 01786 g002
Figure 3. Illustration of the three radiomic feature classes. Radiomic features consist of shape, texture, and first-order statistics features. Shape features describe the 2D and 3D size and shape of the lesion. Textural features describe the intensity distribution and relations between neighboring voxels. First-order statistics describe the intensity distributions of the lesion.
Figure 3. Illustration of the three radiomic feature classes. Radiomic features consist of shape, texture, and first-order statistics features. Shape features describe the 2D and 3D size and shape of the lesion. Textural features describe the intensity distribution and relations between neighboring voxels. First-order statistics describe the intensity distributions of the lesion.
Diagnostics 12 01786 g003
Figure 4. Example of imaging reconstruction using a trained convolutional autoencoder. (Left) An axial slice of the original validation image. (Middle) The corresponding slice of the predicted image. (Right) The absolute difference between the normalized original and predicted images.
Figure 4. Example of imaging reconstruction using a trained convolutional autoencoder. (Left) An axial slice of the original validation image. (Middle) The corresponding slice of the predicted image. (Right) The absolute difference between the normalized original and predicted images.
Diagnostics 12 01786 g004
Figure 5. Receiver operating curves for the best-performing support vector machine model based on three different inputs: features extracted by a convolutional autoencoder, radiomic features, and follow-up infarct volume.
Figure 5. Receiver operating curves for the best-performing support vector machine model based on three different inputs: features extracted by a convolutional autoencoder, radiomic features, and follow-up infarct volume.
Diagnostics 12 01786 g005
Figure 6. SHAP summary plot showing the top 15 radiomic features (and their feature classes) in terms of impact on the classification based on the SHAP values. Negative and positive SHAP values represent unfavorable and favorable outcome classifications, respectively. The feature values are represented by a color map, ranging from blue (low value) to red (high value). Abbreviations of second-order radiomic feature classes in gray-level matrices: size zone (glszm), dependence (gldm), and run length (glrlm).
Figure 6. SHAP summary plot showing the top 15 radiomic features (and their feature classes) in terms of impact on the classification based on the SHAP values. Negative and positive SHAP values represent unfavorable and favorable outcome classifications, respectively. The feature values are represented by a color map, ranging from blue (low value) to red (high value). Abbreviations of second-order radiomic feature classes in gray-level matrices: size zone (glszm), dependence (gldm), and run length (glrlm).
Diagnostics 12 01786 g006
Table 1. Training and testing accuracy, AUC, precision, and recall for the best-performing SVM classifiers based on FIV, the autoencoder features, and the radiomic features. The p-values resulting from deLong’s tests against the radiomic features are presented in the last column.
Table 1. Training and testing accuracy, AUC, precision, and recall for the best-performing SVM classifiers based on FIV, the autoencoder features, and the radiomic features. The p-values resulting from deLong’s tests against the radiomic features are presented in the last column.
Feature Extraction MethodTraining Accuracy (n = 144)Testing Accuracy (n = 41)AUC
(n = 41)
Precision
(n = 41)
Recall
(n = 41)
deLong’s Test
p-Value
FIV only *0.730.740.790.780.730.15
Autoencoder **0.760.710.810.700.710.37
Radiomics ***0.750.710.880.800.65
* SVM parameters: {C: 1000, gamma: 0.01, kernel: rbf}, ** SVM parameters: {C: 0.1, gamma: 0.01, kernel: linear}, *** SVM parameters: {C: 1, gamma: 0.001, kernel: sigmoid}.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tolhuisen, M.L.; Hoving, J.W.; Koopman, M.S.; Kappelhof, M.; van Voorst, H.; Bruggeman, A.E.; Demchuck, A.M.; Dippel, D.W.J.; Emmer, B.J.; Bracard, S.; et al. Outcome Prediction Based on Automatically Extracted Infarct Core Image Features in Patients with Acute Ischemic Stroke. Diagnostics 2022, 12, 1786. https://doi.org/10.3390/diagnostics12081786

AMA Style

Tolhuisen ML, Hoving JW, Koopman MS, Kappelhof M, van Voorst H, Bruggeman AE, Demchuck AM, Dippel DWJ, Emmer BJ, Bracard S, et al. Outcome Prediction Based on Automatically Extracted Infarct Core Image Features in Patients with Acute Ischemic Stroke. Diagnostics. 2022; 12(8):1786. https://doi.org/10.3390/diagnostics12081786

Chicago/Turabian Style

Tolhuisen, Manon L., Jan W. Hoving, Miou S. Koopman, Manon Kappelhof, Henk van Voorst, Agnetha E. Bruggeman, Adam M. Demchuck, Diederik W. J. Dippel, Bart J. Emmer, Serge Bracard, and et al. 2022. "Outcome Prediction Based on Automatically Extracted Infarct Core Image Features in Patients with Acute Ischemic Stroke" Diagnostics 12, no. 8: 1786. https://doi.org/10.3390/diagnostics12081786

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop