Next Article in Journal
The Pathophysiology of Collateral Circulation in Acute Ischemic Stroke
Next Article in Special Issue
An Efficient Combination of Convolutional Neural Network and LightGBM Algorithm for Lung Cancer Histopathology Classification
Previous Article in Journal
Physical Functional Ability and Quantitative Assessment of the Multifidus Muscle of the Lumbar Spine in the Elderly
Previous Article in Special Issue
Landmark-Assisted Anatomy-Sensitive Retinal Vessel Segmentation Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine-Learning-Based Diagnostics of Cardiac Sarcoidosis Using Multi-Chamber Wall Motion Analyses

1
Institute for Radiology, Nuclear Medicine and Molecular Imaging, Heart and Diabetes Center North Rhine Westphalia, Bad Oeynhausen, University of Bochum, 32545 Bochum, Germany
2
Department of Engineering Systems & Environment, University of Virginia, Charlottesville, VA 22904, USA
3
Clinic for Electrophysiology, Heart and Diabetes Center North-Rhine Westphalia, Ruhr-University of Bochum, 32545 Bad Oeynhausen, Germany
4
Clinic for Thoracic and Cardiovascular Surgery, Heart and Diabetes Center North-Rhine Westphalia, Ruhr-University of Bochum, 32545 Bad Oeynhausen, Germany
5
Cardiology Institute and Clinic, Poznan University of Medical Sciences, 61-701 Poznan, Poland
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(14), 2426; https://doi.org/10.3390/diagnostics13142426
Submission received: 26 May 2023 / Revised: 13 July 2023 / Accepted: 19 July 2023 / Published: 20 July 2023
(This article belongs to the Special Issue 2nd Edition: AI/ML-Based Medical Image Processing and Analysis)

Abstract

:
Background: Hindered by its unspecific clinical and phenotypical presentation, cardiac sarcoidosis (CS) remains a challenging diagnosis. Objective: Utilizing cardiac magnetic resonance imaging (CMR), we acquired multi-chamber volumetrics and strain feature tracking for a support vector machine learning (SVM)-based diagnostic approach to CS. Method: Forty-five CMR-negative (CMR(−), 56.5(53.0;63.0)years), eighteen CMR-positive (CMR(+), 64.0(57.8;67.0)years) sarcoidosis patients and forty-four controls (CTRL, 56.5(53.0;63.0)years)) underwent CMR examination. Cardiac parameters were processed using the classifiers of logistic regression, KNN(K-nearest-neighbor), DT (decision tree), RF (random forest), SVM, GBoost, XGBoost, Voting and feature selection. Results: In a three-cluster analysis of CTRL versus vs. CMR(+) vs. CMR(−), RF and Voting classifier yielded the highest prediction rates (81.82%). The two-cluster analysis of CTRL vs. all sarcoidosis (All Sarc.) yielded high prediction rates with the classifiers logistic regression, RF and SVM (96.97%), and low prediction rates for the analysis of CMR(+) vs. CMR(−), which were augmented using feature selection with logistic regression (89.47%). Conclusion: Multi-chamber cardiac function and strain-based supervised machine learning provides a non-contrast approach to accurately differentiate between healthy individuals and sarcoidosis patients. Feature selection overcomes the algorithmically challenging discrimination between CMR(+) and CMR(−) patients, yielding high accuracy predictions. The study findings imply higher prevalence of cardiac involvement than previously anticipated, which may impact clinical disease management.

1. Introduction

Sarcoidosis is a systemic, non-caseating granulomatous disease of unclear etiology affecting multiple organs [1]. The presentation of cardiac sarcoidosis ranges from asymptomatic to palpitations, syncope, symptoms of congestive heart failure and sudden cardiac death [2]. Although cardiac sarcoidosis (CS) is considered rare with symptomatic involvement estimated at 5% [3], it remains a detrimental predictor of mortality [4]. Moreover, autopsy results revealed that the majority of CS remains unrecognized, with actual cardiac involvement detected in approximately 27% of sarcoidosis patients [5]. Additionally, delayed immunosuppressive therapy decreased CS transplant-free survival from 83% to 53% at 10 years if heart failure was present at clinical presentation [6]. Hence, CS is often considered a clinical chameleon, further exemplified by the observation that the ECG is normal in 25% of CS patients despite extensive cardiac involvement [7]. Moreover, due to the patchy nature of myocardial involvement, endomyocardial biopsies yield an insufficient diagnostic rate [8]. Along with CMR(+)-associated congestive heart failure and conduction abnormalities, sudden death is one of the most common causes of mortality [9]. Therefore, methods for earlier and improved detection rate are essential.
Cardiac magnetic resonance (CMR) imaging is an effective diagnostic tool for CS detection [10], establishing diagnostic superiority over echocardiography [11]. Its diagnostic accuracy was underlined by a two-fold higher rate of CS detection with CMR compared to the standard guideline-based clinical evaluation [12]. In particular, late gadolinium enhancement (LGE) pattern is of high diagnostic and prognostic value and increases the odds of all-cause mortality and arrhythmogenic events for CS [13]. Nonetheless, the recent literature presents worrisome inconsistency among observers (low interobserver agreement) of LGE imaging, even at experienced centers, underscoring its limitation in CS detection [11,14]. Despite state-of-the-art cardiomyopathy protocols incorporating cardiac cine imaging, LGE and tissue characterization sequences, such as native and post-contrast T1-mapping, extracellular volume (ECV) quantification and T2-mapping [10], the CS detection rate appears to require improvement beyond the diagnostic performance of these features.
Cardiac wall deformation characterized by strain has been described as a promising diagnostic feature for the purpose of CS detection. Recent echocardiographic studies have demonstrated diagnostic discrimination based on cardiac functional and strain features between sarcoidosis patients and controls [15,16]. Moreover, CMR-quantified strain identified a potential early diagnostic marker of impaired left ventricular global longitudinal strain of sarcoidosis patients with otherwise normal CMR findings [17]. With the objective of optimizing CMR diagnostic accuracy, we hypothesize that multi-chamber-derived cardiac strain and function yield accurate diagnostic performance for CS. Analyses of the CMR-attained broad spectrum of cardiac features demands algorithmic assistance in the form of machine learning. Machine learning is a function of artificial intelligence and performs complex classification, regression and prediction of data matrices extending beyond the capacity of traditional prediction approaches. For example, a recently conducted study with similar design demonstrated competitive diagnostic accuracies for amyloidotic cardiomyopathy detection achieved via supervised machine learning [18]. Additionally, strain- and cardiac-function-based diagnostics offer novel perspectives for non-contrast CMR examination, which is relevant for sarcoidosis patients as hypercalcemia-induced renal dysfunction is not uncommon [19,20].

2. Methods

2.1. Study Cohort

This is a retrospective observational study, conducted at a single center. The study cohort consisted of 63 systemic sarcoidosis patients in whom the diagnosis was histologically validated from various tissues, including skin, lung or lymph nodes. The control group (CTRL) consisted of healthy volunteers. All volunteers submitted written informed consent prior to CMR examination. CTRL subjects were excluded if they had a medical history of cardiovascular disease, medication or surgery for cardiovascular disease, cardiovascular risk factors and metabolic disorders. All enrolled subjects received CMR at our center for evaluation of cardiac involvement. In patients with biopsy-proven extracardiac sarcoidosis, CMR was conducted on the basis of the Heart Rhythm Society diagnostic criteria [21] upon clinical suspicion of cardiac involvement. In adherence to guidelines of the Heart Rhythm Society (HRS) 2017 [22] recommending CMR for CS diagnostics, CMR findings were typically based on contrast-enhanced myocardial scarring patterns, myocardial edema, quantitative tissue characterization and cardiac functional analysis. Patients were classified as CMR-positive (CMR(+)) if their CMR examination demonstrated CS-associated features, or CMR-negative (CMR(−)) if no signs of cardiac involvement were found. All enrolled sarcoidosis patients received routine CMR according to standard cardiomyopathy protocol with adequate imaging quality. Pathological CMR findings, such as ischemic or non-ischemic heart disease and vascular abnormalities, would further result in exclusion of the volunteer from the study. All examinations were conducted in accordance with the 1964 declaration of Helsinki and the approval of the local ethics committee was waived (Ethik-Kommission der Medizinischen Fakultät der Ruhr-Universität Bochum; registration number 2023-1071).

2.2. Cardiac Magnetic Resonance Imaging

The CMR imaging was performed using a 3.0 Tesla multi-transmit magnetic resonance scanner (Achieva, Philips Healthcare, Best, The Netherlands; Release 5.3.1 and 5.6.1) incorporating dStream technology. All patients underwent vector electrocardiogram-triggered cardiac acquisitions. The maximum gradient performance was 40 mT/m with a slew rate of 200 mT/m/ms. A cardiac phased-array coil was used for signal reception. An axially acquired stack covering the whole heart and a short-axis stack covering the entire left and right ventricles (12–16 slices, no gap) as well as standard two-, three- and four-chamber views were utilized with retrospectively gated cine steady-state free-precession acquisitions (TR/TE/flip angle = 2.7 ms/1.35 ms/42°) for the assessment of heart function in all four cardiac chambers and morphology. Parallel imaging technique with a SENSE-reduction factor of 2 was applied to keep breath-holding times ≤12 s. Within one cardiac cycle, >25 reconstructed heart frames were acquired in order to achieve greater temporal resolution, as recently demonstrated [23]. The spatial resolution was 1.5 × 1.5 × 8 mm3.

2.3. Cardiac Function and Strain Quantification

For cardiac function and strain analysis on cine steady-state free-precession acquisitions, CVI42® software package (Circle Cardiovascular Imaging Inc., Calgary, AB, Canada, Version: Release 5.16.2 (3381)) was utilized (for parameter list see Table 1). Left ventricular (LV) and left atrial (LA) endocardial and epicardial contouring was performed in longitudinal 2-chamber and 4-chamber views. LA contouring excluded the left atrial appendage and ostia of the pulmonary veins. Only the 4-chamber view was employed for analysis of the right atrium (RA). RA contouring excluded the right atrial appendage and superior and inferior ostia of the vena cava. Automatic volumetric and strain analysis for the LV and right ventricle (RV) contouring additionally entailed the short-axis stacks. Manual adaption was performed if needed. Volumetric biventricular and biatrial quantification were obtained using the disc-summation technique (Simpson approach) at end-diastole and end-systole.

2.4. Descriptive Statistics

Descriptive statistics were generated using SPSS (version 27.0.0.0, IBM Deutschland GmbH, iBM, Armonk, NY, USA). For normally distributed continuous variables, the mean ± standard deviation was reported, whereas for non-parametric variables, the median with interquartile range was provided. For comparison of baseline characteristics, either the univariate analysis of variance (ANOVA) or the Welch test were employed if the assumptions of ANOVA were not met. For determining inter-group differences, a post hoc Tukey’s Honest Significant Difference (HSD) test was conducted when there was homogeneity of variance. If homogeneity of variance was violated, a Games–Howell test was used. In the case of nonparametric data, the Kruskal–Wallis test was applied.

2.5. Cardiac Features

Multiparametric CMR assessment encompassed bi-atrial and bi-ventricular volumetric and functional parameters along with strain and strain rate of the left and right atrium and left ventricle, cumulatively making up 36 cardiac features.

2.6. Correlation Matrix

Linear correlation coefficients were derived using Spearman’s correlation between variables forming a matrix. The coefficients span from −1 to +1, where +1 demonstrates a strong positive correlation and vice versa. Normalization for all data was performed for bias reduction.

2.7. Classification Algorithms

The generated coefficients were fed into the following machine learning classifier algorithms: support vector machine (SVM), k-nearest-neighbor (KNN), decision tree (DT), random forest (RF), logistic regression, GBoost, XGBoost and Voting. Simplified descriptions along with illustrations of various algorithmic functions were recently described [18]. Mella and Pentakoti define the Voting classifier as a machine learning model that combines the predictions of multiple individual models to make a final prediction. Instead of using a single model, it creates an ensemble by training on numerous models. The Voting classifier predicts the output class based on the highest probability or majority vote from the individual models. By doing so, it simplifies the process of model selection and achieves better accuracy by leveraging the collective wisdom of the ensemble. Rather than training and evaluating separate models individually, a single model is created that makes predictions based on the combined majority voting of the ensemble’s models for each output class [24]. Data were randomly assigned into a training or testing category. The training set is 70% of the dataset used to train a model. It consists of input data along with corresponding known output values, allowing the model to learn from this labeled data. During training, the model adjusts its internal parameters and learns patterns from the training set to make accurate predictions.
Once the model is trained, it is essential to assess its performance on new, unseen data. This is where the remaining 30% of the data come into play as the test set. The test set is a separate dataset that was not used during training. It contains input data, similar to the training set, but lacks the corresponding labels. The trained model then makes predictions on the test set, and its performance is evaluated by comparing these predictions to the true labels withheld from the model (Table 2). A flow chart of data processing is given by Figure 1.
SVM utilizes a hyperplane for discrimination between two clusters. Flexibility of the hyperplane affects accuracy. KNN orientates around the Euclidian distance, where similar variables remain in closer algorithmic proximity than dissimilar data. DT processing is algorithmically structured into “nodes” for variables and “branches” for decisions. Random forest is characterized as an extension of DT, whereby a multitude of decision trees may generate random forests, which can correct for overfitting and generally outperform DT [25]. As defined by Mohd Ali et al., random forest classifier is also a machine learning model used in both classification and regression problems that, unlike fitting a single “best” tree model, strategically combines multiple simple decision trees to optimize predictive performance. This approach accommodates diverse travel decision heuristics by utilizing a multitude of decision trees. Each tree in the ensemble captures different sources of uncertainty and variability in the data. Consequently, this technique enhances the accuracy of model estimation and prediction [26]. Lastly, logistic regression operates by estimating parameters using logistic function to convert log-odds into probability. A voting classifier is a machine learning model predicting output class based on the majority of votes from an ensemble of models. It supports two types of voting: hard voting, which takes the class with the highest majority of votes; and soft voting, which takes the class with the highest average probability. GBoost refers to gradient boosting classifier, which merges groups of weak learning models to augment their predictive performance, particularly via reweighting of data. XGBoost refers to extreme gradient boosting, commonly based on a DT model which applies customized algorithms to maximize the predictive potential. Due to poor prediction rates of CMR(+) vs. CMR(−) discrimination, feature selection was applied. Feature selection is important in machine learning to reduce data complexity and improve accuracy. It focuses on selecting relevant features, which improves model performance, prevents overfitting, enhances interpretability, reduces computational complexity and optimizes data collection and storage. This results in higher performance of the models to classify CMR(+) vs. CMR(−). All data processing was performed using Python (Version No. 3.8.12). Algorithmic differentiation requires identification of hyperparameters utilizing the support vector classifier, ultimately forming a “best fit” hyperplane of the dataset. Algorithmic fine-tuning was conducted throughout the training and testing of the individual classifiers. Output of the individual classifier algorithms is presented in confusion matrices and summarized in terms of precision (positive predictive value), recall score (sensitivity) and the F1-score (test accuracy).

2.8. Algorithmic Feature Selection

Algorithmic feature selection was applied to the analyses of CMR(+) vs. CMR(−) patients. The individual contributions of each feature were evaluated based on RF classifier and presented via feature rates. The five features with the highest predictive values were selected to augment diagnostic accuracy. A closer proximity towards “1” would imply that this feature positively predicts cardiac sarcoidosis. To validate performance enhancement using feature selection application, ROC analyses were carried out to compare discrimination between cluster CMR(+) vs. CMR(−) without and with feature selection.

3. Results

3.1. Baseline Features

This study enrolled 44 control subjects, 45 CMR(−) and 18 CMR(+) patients. CMR(+) (median (interquartile range), 64.0(57.8;67.0)years) patients represented the oldest patient group in contrast to CMR(−) (mean ± standard deviation, 56.2 ± 15.5 years) and CTRL (56.5(53.0;63.0)years), without statistical significant difference. The left (LV-EF) and right (RV-EF) ventricular ejection fraction (%) of both CMR(−) and CMR(+) patients was significantly reduced in contrast to CTRL (e.g., LV-EF: 49 ± 15; 61 (50;65); 67 ± 5; p < 0.001). CMR(+) patients demonstrated significantly elevated left ventricular volumetrics compared to CMR(−) patients and CTRL; for example, regarding indexed end-diastolic volume (96.5 ± 26.0; 69.3 (60.0;83.7); 71.6 ± 9.6; p < 0.001). In contrast to CTRL, left ventricular longitudinal, radial and circumferential strains were found to be significantly impaired for CMR(+)patients (p-values: <0.001–0.007). Further details are summarized in Table 1.

3.2. Diagnostic Accuracy of Machine Learning Models

The CMR multi-chamber wall motion and functional analyses resulted in a 36-feature matrix for algorithmic processing (supplemental data Tables S1–S3). Among the various classifier algorithms used, RF and Voting classifier demonstrated the highest level of prediction rate for the differentiation between the three clusters of CTRL, CMR(−) and CMR(+) patients (82%). Analyses of CTRL versus All Sarc. found logistic regression, RF and SVM to yield the highest prediction rates (97%). Algorithmic discrimination between CMR(+) versus CMR(−) appeared challenging, reaching maximum accuracy of 68% for the DR, GBoost and XGBoost classifiers. Differentiation between CMR(+) versus (CMR(−) analyses was augmented using algorithmic feature selection, raising it to a maximum accuracy of 89% utilizing logistic regression. Further details are summarized in Figure 2.

3.3. Confusion Matrices

Confusion matrices were generated and exemplary top-performing classifier algorithms were exhibited (Figure 3). These presented differences between the true and predicted outcomes based on the testing data category. The confusion matrices demonstrate that CTRL subjects were predicted with highest accuracy in the three-cluster model; for example, based on a precision (positive predictive value) of 86%, recall (sensitivity) of 82% and F1-score (test accuracy) of 79%. In particular, differentiation of CTRL from All Sarc. in the two-cluster model appeared effective with regard to a constant 97% precision, recall and F1-score. However, inter-group differences for sarcoidosis patients upon division into CMR(+) and CMR(−) patients resulted in poor performance, with a constant 68% precision, recall and F1-score. Algorithmic accuracy was augmented using algorithmic feature selection, yielding 89% precision, recall and F1-score.

3.4. Algorithmic Feature Selection

The five features with the highest algorithmic impact for the two-cluster model of CMR(+) versus CMR(−) are displayed in Table 2. All five parameters were based on left ventricular cardiac motion or volumetrics. In particular, longitudinal strain rates of the left ventricle appeared to be of high discriminative value for machine learning algorithms. Among the most valuable impact features for both analyses was routinely available indexed left ventricular end systolic volume. The ROC curves of Figure 4 demonstrate a profound enhancement of model performance in distinguishing the two clusters after selecting the most important features.

4. Discussion

To our knowledge, this study is the first to demonstrate diagnostic prediction rates for cardiac sarcoidosis based on CMR-acquired multi-chamber wall motion and volumetrics analyses using supervised machine learning algorithms.
The present study reports the following novelties:
(i)
Accurate algorithmic discrimination was achieved between healthy subjects and all sarcoidosis patients, particularly with Voting and RF classifiers;
(ii)
Poor algorithmic discrimination of CMR(−) and CMR(+) patients was improved to accurate levels via algorithmic feature selection application, particularly when using logistic regression and SVM classifiers;
(iii)
The algorithmic challenge associated with discrimination between both patient groups implies cardiac involvement may be more prevalent than anticipated, potentially evading CMR detection.

4.1. The Benefit of Machine Learning in Cardiac Imaging

Implementations of machine learning algorithms in recent non-CS cardiac imaging have presented additive benefit for diagnostic support [27,28,29]. The present CMR study showed highly accurate discrimination of healthy subjects from all sarcoidosis patients. In terms of the confusion matrices, diagnostic prediction rate was higher for discrimination between healthy individuals and all sarcoidosis compared to distinguishing between CMR(−) and CMR(+) patients. A possible explanation may be the limited number of confirmed CMR(+) subjects included, which in the context of machine learning may result in imbalanced class distribution that can affect the accuracy of the model. Most interesting was the poor algorithmic discrimination between CMR(−) versus CMR(+) patients in the two-cluster model, for which promising algorithmic accuracy was expected given the baseline data. Based on the multi-parametric data input, it can be surmised that multi-chamber wall motion and volumetrics have a greater overlap than expected, compromising algorithmic discrimination. Utilizing feature selection application, accurate prediction rates between CMR(+) versus CMR(−) patients were maintained, based primarily on left ventricular wall motion features. Thus, machine learning is principally capable of distinguishing between both CMR groups. However, this study holds a greater serendipitous value, which implies that cardiac involvement tends to be a commonality rather than exception. This is further underscored by accurate algorithmic discrimination between CTRL and all sarcoidosis patients in the three- and two-cluster models. These findings indicate that cardiac involvement is more prevalent than recognized, perhaps at a subvisual level, in agreement with echocardiographic findings [16,30,31].
To the best of our knowledge, machine learning has not been applied to CMR data for the purpose of cardiac sarcoidosis prediction. A recent study by Kasushika et al. applied deep learning algorithms to echocardiographic movies for CS detection and found no significant difference between the area under the curve (AUC) of the pretrained algorithm in contrast to the diagnostic accuracy of five experienced cardiologists [32]. Moreover, they observed lowered AUC and negative predictive value compared to the cardiologists’ interpretation for CS with preserved LVEF, but did not reach statistical significance. An additional benefit of machine learning integration was presented when using a deep convolutional neural network (DCNN) applied to polar maps of F-FDG PET, identifying the greatest sensitivity and specificity for when using the ReliefF classifier for feature weight estimation, in contrast to standardized uptake value (SUV)-based classification and coefficient of variance (CoV)-based classification [33]. These examples underscore the benefit of implementing machine learning algorithms as a tool for diagnostic support. Nonetheless, machine learning application for cardiac imaging data is currently in its infancy, requiring larger cohort studies for validation.

4.2. Weighting of the Input Features

Various imaging studies have demonstrated that cardiac wall deformation is a valuable parameter for CS detection, capable of discriminating from healthy subjects [15,16,34]. In contrast to the focus of prior cardiac imaging studies [15,16,30,31], left ventricular strain rate rather than strain received greater diagnostic weighting for algorithmic discrimination. However, echocardiographic findings from Di Stefano et al. substantiate this observation, reporting compromised left ventricular longitudinal, circumferential, longitudinal early diastolic and circumferential early diastolic strain rates in CS patients with positive endomyocardial biopsy [34]. The power of machine learning lies in its ability to be adjustable, per example in the form of reweighting features. Shifting algorithmic weighting towards left ventricular wall motion features augmented discrimination between both CMR groups, emphasizing the adaptability of machine learning.

4.3. Clinical Outlooks

The mismatch between the relatively low clinical detection rate of CMR(+) in contrast to CMR(−) sarcoidosis patients implies that CS can remain undetected and that diagnostic methods require optimization. The recent literature has demonstrated that clinical data, such as from echocardiography and ECG, only reasonably predict positive CMR or FDG-cPET findings for the purpose of CS detection [35], reiterating its clinical chameleon-like nature. Baseline data have found single parameters, such as left ventricular strains or ejection fraction, to appear sufficient for diagnostic differentiation. Although these may represent useful diagnostic indicators, their entirety may not suffice for diagnostic manifestation. Instead, the present study assesses the potential of machine-learning-processed multi-parametric CMR data with the objective of pushing up the boundaries of diagnostic accuracy rates. Due to the rarity and complexity of the disease, generating a validated algorithm for CS diagnosis is challenging. This study presents novel diagnostic aspects of non-contrast CMR, for which machine learning of multi-chamber volumetrics and strain achieved high diagnostic discrimination between healthy subjects and sarcoidosis patients. This binary distinction represents a novel additive clinical benefit of machine learning integration, requiring validation in larger cohort studies. Moreover, whilst machine learning can be adjusted to distinguish between both CMR groups, this study found signs that cardiac involvement appears far more widespread than anticipated. If validated by future studies, these findings would alter clinical disease management, such as in patient monitoring and earlier therapeutic strategies.

5. Limitations

This study utilized a retrospective single-center observational design with typical limitations. Considering that this study was conducted at a specialized cardiovascular clinic, it is important to acknowledge the presence of selection bias in our cohort. This implies that the level of cardiac involvement observed among our participants is likely higher compared with other studies of patient cohorts with systemic cardiac sarcoidosis. Furthermore, with regard to the baseline data, we acknowledge that CMR(+) patients were, on average, detected in more advanced stages of the disease. Therefore, we cannot make claims regarding early disease detection based on these data. Although the cohorts were not matched, the study included healthy individuals over the age of 50 to improve data homogeneity. Moreover, it is important to note that overlapping structural changes associated with further underlying diseases, such as hypertensive heart disease, cannot be ruled out. The current study demonstrates a supervised machine learning approach. Its diagnostic applicability in an unsupervised setting has not been examined. Moreover, while algorithms can identify numerical patterns in large datasets and provide supportive output to physicians in clinical practice, they cannot establish a cause-and-effect relationship. It is essential to recognize that this study is preliminary in nature and requires further validation with larger datasets and additional risk assessment.

6. Conclusions

Supervised machine learning based on multi-chamber strains and volumetrics provides a non-contrast and non-invasive approach to effectively discriminate between healthy and sarcoidosis patients that is most competitive when using Voting and RF classifiers. Feature selection facilitates the otherwise difficult discrimination between CMR(+) and CMR(−) sarcoidosis patients, especially when using logistic regression and the SVM classifier. Our results suggest that cardiac involvement is more widespread than assumed and may evade CMR detection, potentially necessitating changes in clinical disease management.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics13142426/s1, Table S1: Data matrix derived from clusters CTRL, CMR(+) and CMR(−); Table S2: Data matrix derived from clusters CTRL and CS; Table S3: Data matrix derived from clusters CMR(+) and CMR(−).

Author Contributions

Conceptualization, H.K., J.P. and M.P.; Methodology, N.M. and M.P.; Software, N.M.; Validation, J.E. and V.S.; Formal analysis, R.A. and M.P.; Investigation, J.E. and R.A.; Resources, W.B.; Data curation, J.E., N.M. and R.A.; Writing—original draft, J.E.; Writing—review & editing, N.M., H.K., V.S., C.S., P.S., J.B., J.P. and M.P.; Visualization, C.S., W.B. and M.P.; Supervision, N.M., H.K., C.S., P.S., J.B. and M.P.; Project administration, J.E., H.K., W.B. and M.P.; Funding acquisition, H.K. and W.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the 1964 declaration of Helsinki and the approval of the local ethics committee was waived (Ethik-Kommission der Medizinischen Fakultät der Ruhr-Universität Bochum; registration number 2023-1071).

Informed Consent Statement

Informed consent was obtained from all subjects involved in this study.

Data Availability Statement

Data is available on request due to restrictions for privacy protection.

Acknowledgments

We acknowledge support by the Open Access Publication Funds of the Ruhr-Universität Bochum.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Soto-Gomez, N.; Peters, J.I.; Nambiar, A.M. Diagnosis and Management of Sarcoidosis. Am. Fam. Physician 2016, 93, 840–848. [Google Scholar]
  2. Hamzeh, N.; Steckman, D.A.; Sauer, W.H.; Judson, M.A. Pathophysiology and clinical management of cardiac sarcoidosis. Nat. Rev. Cardiol. 2015, 12, 278–288. [Google Scholar] [CrossRef] [PubMed]
  3. Chow, K.L.; O’Donnell, J.L.; Crozier, I. Prevalence, incidence and survival outcomes of cardiac sarcoidosis in the South Island, New Zealand. Int. J. Cardiol. 2022, 357, 128–133. [Google Scholar] [CrossRef] [PubMed]
  4. Swigris, J.J.; Olson, A.L.; Huie, T.J.; Fernandez-Perez, E.R.; Solomon, J.; Sprunger, D.; Brown, K.K. Sarcoidosis-related mortality in the United States from 1988 to 2007. Am. J. Respir. Crit. Care Med. 2011, 183, 1524–1530. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Silverman, K.J.; Hutchins, G.M.; Bulkley, B.H. Cardiac sarcoid: A clinicopathologic study of 84 unselected patients with systemic sarcoidosis. Circulation 1978, 58, 1204–1211. [Google Scholar] [CrossRef] [Green Version]
  6. Kandolin, R.; Lehtonen, J.; Airaksinen, J.; Vihinen, T.; Miettinen, H.; Ylitalo, K.; Kaikkonen, K.; Tuohinen, S.; Haataja, P.; Kerola, T.; et al. Cardiac sarcoidosis: Epidemiology, characteristics, and outcome over 25 years in a nationwide study. Circulation 2015, 131, 624–632. [Google Scholar] [CrossRef]
  7. Roberts, W.C.; McAllister, H.A., Jr.; Ferrans, V.J. Sarcoidosis of the heart. A clinicopathologic study of 35 necropsy patients (group 1) and review of 78 previously described necropsy patients (group 11). Am. J. Med. 1977, 63, 86–108. [Google Scholar] [CrossRef] [PubMed]
  8. Uemura, A.; Morimoto, S.-I.; Hiramitsu, S.; Kato, Y.; Ito, T.; Hishida, H. Histologic diagnostic rate of cardiac sarcoidosis: Evaluation of endomyocardial biopsies. Am. Heart J. 1999, 138 Pt 1, 299–302. [Google Scholar] [CrossRef]
  9. Yazaki, Y.; Isobe, M.; Hiroe, M.; Morimoto, S.-I.; Hiramitsu, S.; Nakano, T.; Izumi, T.; Sekiguchi, M. Prognostic determinants of long-term survival in Japanese patients with cardiac sarcoidosis treated with prednisone. Am. J. Cardiol. 2001, 88, 1006–1010. [Google Scholar] [CrossRef]
  10. Patel, A.R.; Kramer, C.M. Role of Cardiac Magnetic Resonance in the Diagnosis and Prognosis of Nonischemic Cardiomyopathy. JACC Cardiovasc. Imaging 2017, 10 Pt A, 1180–1193. [Google Scholar] [CrossRef]
  11. Kouranos, V.; Tzelepis, G.E.; Rapti, A.; Mavrogeni, S.; Aggeli, K.; Douskou, M.; Prasad, S.; Koulouris, N.; Sfikakis, P.; Wells, A.; et al. Complementary Role of CMR to Conventional Screening in the Diagnosis and Prognosis of Cardiac Sarcoidosis. JACC Cardiovasc. Imaging 2017, 10, 1437–1447. [Google Scholar] [CrossRef] [PubMed]
  12. Patel, M.R.; Cawley, P.J.; Heitner, J.F.; Klem, I.; Parker, M.A.; Jaroudi, W.A.; Meine, T.J.; White, J.B.; Elliott, M.D.; Kim, H.W.; et al. Detection of myocardial damage in patients with sarcoidosis. Circulation 2009, 120, 1969–1977. [Google Scholar] [CrossRef] [Green Version]
  13. Coleman, G.C.; Shaw, P.W.; Balfour, P.C., Jr.; Gonzalez, J.A.; Kramer, C.M.; Patel, A.R.; Salerno, M. Prognostic Value of Myocardial Scarring on CMR in Patients With Cardiac Sarcoidosis. JACC Cardiovasc. Imaging 2017, 10, 411–420. [Google Scholar] [CrossRef] [PubMed]
  14. Patel, A.R. Detection of Cardiac Sarcoidosis: A Balancing Act Between Symptoms and Imaging Findings. JACC Cardiovasc. Imaging 2017, 10, 1448–1450. [Google Scholar] [CrossRef]
  15. Tigen, K.; Sunbul, M.; Karaahmet, T.; Tasar, O.; Dundar, C.; Yalcinsoy, M.; Takir, M.; Akkaya, E. Early Detection of Bi-ventricular and Atrial Mechanical Dysfunction Using Two-Dimensional Speckle Tracking Echocardiography in Patients with Sarcoidosis. Lung 2015, 193, 669–675. [Google Scholar] [CrossRef]
  16. Kusunose, K.; Fujiwara, M.; Yamada, H.; Nishio, S.; Saijo, Y.; Yamada, N.; Hirata, Y.; Torii, Y.; Ise, T.; Yamaguchi, K.; et al. Deterioration of biventricular strain is an early marker of cardiac involvement in confirmed sarcoidosis. Eur. Heart J. Cardiovasc. Imaging 2020, 21, 796–804. [Google Scholar] [CrossRef] [PubMed]
  17. Dabir, D.; Meyer, D.; Kuetting, D.; Luetkens, J.; Homsi, R.; Pizarro, C.; Nadal, J.; Thomas, D. Diagnostic Value of Cardiac Magnetic Resonance Strain Analysis for Detection of Cardiac Sarcoidosis. Rofo 2018, 190, 712–721. [Google Scholar] [CrossRef] [Green Version]
  18. Eckstein, J.; Moghadasi, N.; Körperich, H.; Valdés, E.W.; Sciacca, V.; Paluszkiewicz, L.; Burchert, W.; Piran, M. A Machine Learning Challenge: Detection of Cardiac Amyloidosis Based on Bi-Atrial and Right Ventricular Strain and Cardiac Function. Diagnostics 2022, 12, 2693. [Google Scholar] [CrossRef] [PubMed]
  19. Correia, F.A.S.C.; Marchini, G.S.; Torricelli, F.C.; Danilovic, A.; Vicentini, F.C.; Srougi, M.; Nahas, W.C.; Mazzucchi, E. Renal manifestations of sarcoidosis: From accurate diagnosis to specific treatment. Int. Braz. J. Urol. 2020, 46, 15–25. [Google Scholar] [CrossRef] [Green Version]
  20. Casella, F.J.; Allon, M. The kidney in sarcoidosis. J. Am. Soc. Nephrol. 1993, 3, 1555–1562. [Google Scholar] [CrossRef]
  21. Birnie, D.H.; Sauer, W.H.; Bogun, F.; Cooper, J.M.; Culver, D.A.; Duvernoy, C.S.; Judson, M.A.; Kron, J.; Mehta, D.; Nielsen, J.C.; et al. HRS expert consensus statement on the diagnosis and management of arrhythmias associated with cardiac sarcoidosis. Heart Rhythm 2014, 11, 1305–1323. [Google Scholar] [CrossRef] [PubMed]
  22. Al-Khatib, S.M.; Stevenson, W.G.; Ackerman, M.J.; Bryant, W.J.; Callans, D.J.; Curtis, A.B.; Deal, B.J.; Dickfeld, T.; Field, M.E.; Fonarow, G.C.; et al. 2017 AHA/ACC/HRS Guideline for Management of Patients with Ventricular Arrhythmias and the Prevention of Sudden Cardiac Death: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines and the Heart Rhythm Society. J. Am. Coll. Cardiol. 2018, 72, e91–e220. [Google Scholar] [PubMed]
  23. Weise Valdés, E.; Barth, P.; Piran, M.; Laser, K.T.; Burchert, W.; Körperich, H. Left-Ventricular Reference Myocardial Strain Assessed by Cardiovascular Magnetic Resonance Feature Tracking and fSENC-Impact of Temporal Resolution and Cardiac Muscle Mass. Front. Cardiovasc. Med. 2021, 8, 764496. [Google Scholar] [CrossRef] [PubMed]
  24. Mella, N.V.V.P.; Pentakoti, V.M. Crop yield prediction and Fertilizer Recommendation using Voting Based Ensemble Classifier. J. Eng. Sci. 2022, 13, 262–270. [Google Scholar]
  25. Hastie, T.; Tibshirani, R.; Friedman, J.H.; Friedman, J.H. The Elements of Statistical Learning, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  26. Ali, N.F.M.; Sadullah, A.F.M.; Majeed, A.P.A.; Razman, M.A.M.; Musa, R.M. The identification of significant features towards travel mode choice and its prediction via optimised random forest classifier: An evaluation for active commuting behavior. J. Transp. Health 2022, 25, 101362. [Google Scholar]
  27. Karr, J.; Cohen, M.; McQuiston, S.A.; Poorsala, T.; Malozzi, C. Validation of a deep-learning semantic segmentation approach to fully automate MRI-based left-ventricular deformation analysis in cardiotoxicity. Br. J. Radiol. 2021, 94, 20201101. [Google Scholar] [CrossRef]
  28. Neisius, U.; El-Rewaidy, H.; Nakamori, S.; Rodriguez, J.; Manning, W.J.; Nezafat, R. Radiomic Analysis of Myocardial Native T(1) Imaging Discriminates Between Hypertensive Heart Disease and Hypertrophic Cardiomyopathy. JACC Cardiovasc. Imaging 2019, 12, 1946–1954. [Google Scholar] [CrossRef]
  29. Huang, S.; Shi, K.; Zhang, Y.; Yan, W.-F.; Guo, Y.-K.; Li, Y.; Yang, Z.-G. Texture analysis of T2-weighted cardiovascular magnetic resonance imaging to discriminate between cardiac amyloidosis and hypertrophic cardiomyopathy. BMC Cardiovasc. Disord. 2022, 22, 235. [Google Scholar] [CrossRef]
  30. Joyce, E.; Ninaber, M.K.; Katsanos, S.; Debonnaire, P.; Kamperidis, V.; Bax, J.J.; Taube, C.; Delgado, V.; Marsan, N.A. Subclinical left ventricular dysfunction by echocardiographic speckle-tracking strain analysis relates to outcome in sarcoidosis. Eur. J. Heart Fail. 2015, 17, 51–62. [Google Scholar] [CrossRef]
  31. Aggeli, C.; Felekos, I.; Tousoulis, D.; Gialafos, E.; Rapti, A.; Stefanadis, C. Myocardial mechanics for the early detection of cardiac sarcoidosis. Int. J. Cardiol. 2013, 168, 4820–4821. [Google Scholar] [CrossRef]
  32. Katsushika, S.; Kodera, S.; Nakamoto, M.; Ninomiya, K.; Kakuda, N.; Shinohara, H.; Matsuoka, R.; Ieki, H.; Uehara, M.; Higashikuni, Y.; et al. Deep Learning Algorithm to Detect Cardiac Sarcoidosis from Echocardiographic Movies. Circ. J. 2021, 86, 87–95. [Google Scholar] [CrossRef]
  33. Togo, R.; Hirata, K.; Manabe, O.; Ohira, H.; Tsujino, I.; Magota, K.; Ogawa, T.; Haseyama, M.; Shiga, T. Cardiac sarcoidosis classification with deep convolutional neural network-based features using polar maps. Comput. Biol. Med. 2019, 104, 81–86. [Google Scholar] [CrossRef] [Green Version]
  34. Di Stefano, C.; Bruno, G.; Calle, M.C.A.; Acharya, G.A.; Fussner, L.M.; Ungprasert, P.; Cooper, L.T.; Blauwet, L.A.; Ryu, J.H.; Pellikka, P.A.; et al. Diagnostic and predictive value of speckle tracking echocardiography in cardiac sarcoidosis. BMC Cardiovasc. Disord. 2020, 20, 21. [Google Scholar] [CrossRef]
  35. Freeman, A.M.; Curran-Everett, D.; Weinberger, H.D.; Fenster, B.E.; Buckner, J.K.; Gottschall, E.B.; Sauer, W.H.; Maier, L.A.; Hamzeh, N.Y. Predictors of cardiac sarcoidosis using commonly available cardiac studies. Am. J. Cardiol. 2013, 112, 280–285. [Google Scholar] [CrossRef]
Figure 1. Flow chart illustrating the processing steps of machine learning along with the applied classifiers. CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, KNN—K-nearest-neighbor, DT—deep trees, RF—random forest, SVM—support vector machine, GBOOST—gradient boosting, XGBOOST—extreme gradient boosting.
Figure 1. Flow chart illustrating the processing steps of machine learning along with the applied classifiers. CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, KNN—K-nearest-neighbor, DT—deep trees, RF—random forest, SVM—support vector machine, GBOOST—gradient boosting, XGBOOST—extreme gradient boosting.
Diagnostics 13 02426 g001
Figure 2. Graphic discrimination between classifier performance of the different analyses, yielding below (red) and above (green) 80% prediction rates with corresponding outcome statistics. CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, All Sarc.—all sarcoidosis patients, KNN—K-nearest-neighbor, DT—deep trees, RF—random forest, SVM—support vector machine, GBOOST—gradient boosting, XGBOOST—extreme gradient boosting.
Figure 2. Graphic discrimination between classifier performance of the different analyses, yielding below (red) and above (green) 80% prediction rates with corresponding outcome statistics. CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, All Sarc.—all sarcoidosis patients, KNN—K-nearest-neighbor, DT—deep trees, RF—random forest, SVM—support vector machine, GBOOST—gradient boosting, XGBOOST—extreme gradient boosting.
Diagnostics 13 02426 g002
Figure 3. Selected confusion matrices for 36-feature analyses of the various cluster analyses, depicting the highest yielding classifier(s) per cluster with corresponding performance statistics. Precision—positive predictive value, recall score—sensitivity, F1-score—test accuracy, CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, All Sarc.—all sarcoidosis patients, DT—deep trees, RF—random forest.
Figure 3. Selected confusion matrices for 36-feature analyses of the various cluster analyses, depicting the highest yielding classifier(s) per cluster with corresponding performance statistics. Precision—positive predictive value, recall score—sensitivity, F1-score—test accuracy, CTRL—control subjects, CMR(+)—positive cardiac magnetic resonance patients, CMR(−)—negative cardiac magnetic resonance patients, All Sarc.—all sarcoidosis patients, DT—deep trees, RF—random forest.
Diagnostics 13 02426 g003
Figure 4. ROC curves for (A) CMR(+) vs. CMR(–) with no feature selection and (B) (+) v. CMR(–) after selecting features.
Figure 4. ROC curves for (A) CMR(+) vs. CMR(–) with no feature selection and (B) (+) v. CMR(–) after selecting features.
Diagnostics 13 02426 g004
Table 1. Baseline data of control subjects and sarcoidosis patients.
Table 1. Baseline data of control subjects and sarcoidosis patients.
CRTLCMR(−)CMR(+)Comparisonp-Values
Number444518
Male231912
Age c56.5(53.0;63.0) §56.5(53.0;63.0) §64.0(57.8;67.0) § n.s.
Weight b74.2 ± 12.581.9 ± 15.185.6 ± 17.0CTRL—CMR(−)
CRTL—CMR(+)
0.036
0.016
Height c172 ± 10168(164;177) §177 ± 10 n.s.
Heart rate b65 ± 1072 ± 1371 ± 13CTRL—CMR(−)0.015
BMI c24.8 ± 2.628.0 ± 4.527.3 ± 4.8CTRL—CMR(−)<0.001
LV_EF c67 ± 561(50;65)49 ± 15CTRL—CMR(−)
CRTL—CMR(+)
<0.001
<0.001
LV_ESVi c23.0(20.9;26.8) §27.9(21.1;38.3) §50.2 ± 21.2CTRL—CMR(−)
CRTL—CMR(+)
CMR(−)—CMR(+)
0.022
<0.001
0.004
LV_EDVi c71.6 ± 9.669.3(60.0;83.7) §96.5 ± 26.0CRTL—CMR(+)
CMR(−)—CMR(+)
<0.001
<0.001
LV_longitudinal_strain c−16.5 ± 2.4−15.9(−17.5;−14.4) §−10.9 ± 3.8CRTL—CMR(+)
CMR(−)—CMR(+)
<0.001
0.001
LV_radial_strain b29.5 ± 7.426.7 ± 8.519.9 ± 7.5CRTL—CMR(+)
CMR(−)—CMR(+)
<0.001
0.007
LV_circumferential_strain c−17.3 ± 3.1−17.1(−18.7;−15.2) §−12.8 ± 4.0CRTL—CMR(+)
CMR(−)—CMR(+)
<0.001
0.005
RV_EF c61 ± 649.5(40.7;56.3) §44.1 (26.2;53.6) §CTRL—CMR(−)
CRTL—CMR(+)
<0.001
<0.001
RV_ESVi c30.1 ± 8.138.4(29.2;48.7) §44.4 (36.7;74.7) §CTRL—CMR(−)
CRTL—CMR(+)
0.003
<0.001
RV_EDVi c76.0 ± 12.771.1(63.7;88.5) §83.1 (66.1;92.2) § n.s.
b—ANOVA–Tukey–HSD, c—Kruskal–Wallis test, (§)—median (interquartile range), n.s.—not significant, CTRL—control subjects, CMR(+)—CMR-positive patients, CMR(−)—CMR-negative patients. LV_EF—left ventricular ejection fraction, LV_ESVi—indexed left ventricular end systolic volume, LV_EDVi—indexed left ventricular end diastolic volume, LV_longitudinal_strain—left ventricular longitudinal strain, LV_radial_strain, LV_circumferential_strain, RV_EF—right ventricular ejection fraction, RV_ESVi—indexed right ventricular end systolic volume, RV_EDVi—indexed right ventricular end diastolic volume.
Table 2. Feature rates of the five most valuable parameters for machine learning discrimination generated by random forest classifier.
Table 2. Feature rates of the five most valuable parameters for machine learning discrimination generated by random forest classifier.
ParameterFeature Rates
LV_ESVi0.045
LV_syst_radial_LAX_SR0.045
LV_diast_radial_LAX_SR0.049
LV_radial_LAX_S0.056
LA_syst_long_LAX_SR0.060
LV_ESVi—indexed left ventricular end systolic volume, LA_syst_radial_LAX_SR—left atrial systolic radial strain rate in longitudinal axis, LA_diast_radial_LAX_SR—left atrial diastolic radial strain rate in longitudinal axis, LV_radial_LAX_S—left ventricular radial strain in longitudinal axis, LA_syst_long_LAX_SR—left atrial systolic longitudinal strain rate in longitudinal axis.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Eckstein, J.; Moghadasi, N.; Körperich, H.; Akkuzu, R.; Sciacca, V.; Sohns, C.; Sommer, P.; Berg, J.; Paluszkiewicz, J.; Burchert, W.; et al. Machine-Learning-Based Diagnostics of Cardiac Sarcoidosis Using Multi-Chamber Wall Motion Analyses. Diagnostics 2023, 13, 2426. https://doi.org/10.3390/diagnostics13142426

AMA Style

Eckstein J, Moghadasi N, Körperich H, Akkuzu R, Sciacca V, Sohns C, Sommer P, Berg J, Paluszkiewicz J, Burchert W, et al. Machine-Learning-Based Diagnostics of Cardiac Sarcoidosis Using Multi-Chamber Wall Motion Analyses. Diagnostics. 2023; 13(14):2426. https://doi.org/10.3390/diagnostics13142426

Chicago/Turabian Style

Eckstein, Jan, Negin Moghadasi, Hermann Körperich, Rehsan Akkuzu, Vanessa Sciacca, Christian Sohns, Philipp Sommer, Julian Berg, Jerzy Paluszkiewicz, Wolfgang Burchert, and et al. 2023. "Machine-Learning-Based Diagnostics of Cardiac Sarcoidosis Using Multi-Chamber Wall Motion Analyses" Diagnostics 13, no. 14: 2426. https://doi.org/10.3390/diagnostics13142426

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop