Next Article in Journal
New Real-Time Impulse Noise Removal Method Applied to Chest X-ray Images
Next Article in Special Issue
Comparing Three Profoundly Influential Prognostic Scores in Cirrhotic Patients with Acute-on-Chronic-Liver Failure Admitted to the ICU: Prediction of One-Month Mortality—A Retrospective Cohort Study
Previous Article in Journal
Multi-Stage Temporal Convolution Network for COVID-19 Variant Classification
Previous Article in Special Issue
Accuracy of Noninvasive Diagnostic Tests for the Detection of Significant and Advanced Fibrosis Stages in Nonalcoholic Fatty Liver Disease: A Systematic Literature Review of the US Studies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Ultrasound Radiomics for the Detection of Early-Stage Liver Fibrosis

Ultrasound Research Lab, Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
*
Author to whom correspondence should be addressed.
Diagnostics 2022, 12(11), 2737; https://doi.org/10.3390/diagnostics12112737
Submission received: 16 October 2022 / Revised: 6 November 2022 / Accepted: 7 November 2022 / Published: 9 November 2022
(This article belongs to the Special Issue Diagnosis and Management of Liver Diseases)

Abstract

:
Objective: The study evaluates quantitative ultrasound (QUS) texture features with machine learning (ML) to enhance the sensitivity of B-mode ultrasound (US) for the detection of fibrosis at an early stage and distinguish it from advanced fibrosis. Different ML methods were evaluated to determine the best diagnostic model. Methods: 233 B-mode images of liver lobes with early and advanced-stage fibrosis induced in a rat model were analyzed. Sixteen features describing liver texture were measured from regions of interest (ROIs) drawn on B-mode images. The texture features included a first-order statistics run length (RL) and gray-level co-occurrence matrix (GLCM). The features discriminating between early and advanced fibrosis were used to build diagnostic models with logistic regression (LR), naïve Bayes (nB), and multi-class perceptron (MLP). The diagnostic performances of the models were compared by ROC analysis using different train-test sampling approaches, including leave-one-out, 10-fold cross-validation, and varying percentage splits. METAVIR scoring was used for histological fibrosis staging of the liver. Results: 15 features showed a significant difference between the advanced and early liver fibrosis groups, p < 0.05. Among the individual features, first-order statics features led to the best classification with a sensitivity of 82.1–90.5% and a specificity of 87.1–89.8%. For the features combined, the diagnostic performances of nB and MLP were high, with the area under the ROC curve (AUC) approaching 0.95–0.96. LR also yielded high diagnostic performance (AUC = 0.91–0.92) but was lower than nB and MLP. The diagnostic variability between test-train trials, measured by the coefficient-of-variation (CV), was higher for LR (3–5%) than nB and MLP (1–2%). Conclusion: Quantitative ultrasound with machine learning differentiated early and advanced fibrosis. Ultrasound B-mode images contain a high level of information to enable accurate diagnosis with relatively straightforward machine learning methods like naïve Bayes and logistic regression. Implementing simple ML approaches with QUS features in clinical settings could reduce the user-dependent limitation of ultrasound in detecting early-stage liver fibrosis.

1. Introduction

Liver fibrosis is a major health problem that, if not treated, leads to advanced liver cirrhosis and hepatocellular carcinoma (HCC) [1,2]. Cirrhosis accounts for more than 44,000 deaths in the United States and 2 million deaths worldwide yearly [3]. The disease is associated with a high burden of disability and increased healthcare utilization. Therefore, diagnosing liver fibrosis in its early stages can significantly improve a patient’s chance of making a full recovery. The damage may be limited and reversible in early-stage fibrosis compared to advanced fibrosis, which causes widespread, irreversible damage that eventually leads to cirrhosis.
Biopsy remains the gold standard for diagnosing liver fibrosis and assessing disease severity [4,5]. However, its use is invasive, and sampling and interpretative variations influence the results [5,6]. The development of biomarkers has evolved as an attractive alternative to liver biopsy. They include direct biological markers reflecting the pathophysiology of liver fibrogenesis and indirect biomarkers consisting of routine laboratory data reflecting the consequences of liver damage [7]. While helpful, a potential drawback of these biomarkers is the lack of specificity in discriminating inflammation from hepatocellular injury, which often leads to low accuracy in detecting early or intermediate fibrosis stages compared to cirrhosis [8,9,10]. Imaging biomarkers from conventional MRI and computer tomography (CT) are routinely used to assess cirrhosis and its complications. Both modalities, however, are costly and time-consuming, and the discrimination of early and advanced stages of fibrosis remains challenging [11].
Ultrasound (US) is widely used in diagnosing patients with suspected liver disease. It is highly accurate, relatively inexpensive, noninvasive, and valuable in assessing morphological and structural changes associated with liver disease [12,13]. US has the capacity to evaluate hepatic parenchyma concurrently with ascites, hepatic masses, or other abdominal indications and to image a large liver volume with reduced sampling error [14]. B-mode ultrasound has been the primary imaging modality in assessing patients with nonalcoholic fatty liver disease (NAFLD) [15]. It is an excellent method to detect moderate and severe steatosis but has low sensitivity for detecting mild steatosis. Hepatorenal index (HRI) and quantitative ultrasound methods based on attenuation parameters improve the identification of patients with mild steatosis [15]. Patients with NAFLD cirrhosis, on the other hand, present liver changes such as a nodular liver surface, inhomogeneous parenchyma, a rounded inferior border of the liver, a hypertrophic left liver lobe, and changes in the liver vessels’ borders [16,17,18,19]. These changes can also be observed with B-mode US [20]. Despite these capabilities of ultrasound, its role in the diagnosis and staging of liver fibrosis continues to be a subject of debate [21,22,23]. Limitations of this technique include its operator-dependent nature. The interpretation of liver ultrasound images varies with ultrasound equipment, image quality, and physical differences among patients. For the diagnosis of liver cirrhosis, the typical findings have a high specificity (82–100%) but a low and variable sensitivity (20–91%) [23]. Therefore, there is a need for an objective approach for assessing this stage of the disease with ultrasound.
In this study, we propose using computerized analysis of liver features on B-mode ultrasound (US) to characterize liver fibrosis with high precision. Prior studies on hepatic fibrosis have primarily assessed qualitative structural changes on ultrasound that are visible [20,21,22,23]. It is hypothesized that computerized ultrasound image analysis provides liver fibrosis assessment on a continuous scale of user-independent quantitative imaging biomarkers. The main contribution of the study is the differentiation of early- and late-stage liver fibrosis with quantitative US image texture features and machine learning. The study also emphasizes easy-to-use, simple methods of ML for diagnosis. The study was performed in a rat model with fibrosis induced under controlled experimental conditions. Different machine learning methods were evaluated to determine the ultrasound radiomics models for diagnosis to achieve high sensitivity and specificity.

2. Materials and Methods

2.1. Image Acquisition

Forty-five male Wistar rats (procured at 350–400 g body weight from Charles River Laboratories) were acclimated in the housing facilities for one week. Thirty-four rats were fed 0.01% diethylnitrosamine (DEN) (Sigma Aldrich, St. Louis, MO, USA) in their drinking water and ingested ad libitum for 12 weeks to induce fibrosis [24,25]. Eleven rats were not fed DEN in their drinking water as a control group. At the end of the experiments, the rats were euthanized. An autopsy was performed, and tissues from lobes on the right and left sides of the liver were harvested for histologic examination. The University’s Institutional Animal Care and Use Committee approved animal studies and protocols.
Four to six B-mode images were acquired using Visualsonics VevoLAZR (Fujifilm, Toronto, ON, Canada). The ultrasound images were acquired following the termination of DEN administration to the rats. During the experiments, the rats were placed supine under inhalational isoflurane vaporizer anesthesia (VetEquip Inc., Livermore, CA, USA) and oxygenated by a nose cone. Imaging presets (gain = 18 dB, high sensitivity, 100% power, transmit frequency of 21 MHz, and high line density) and time compensation gains were optimized and standardized.

2.2. Histopathological Assessment

The necropsy samples from the right and left liver lobes were preserved in 10% phosphate-buffered formalin for 48 to 72 h. The samples were processed for histological examination using hematoxylin and eosin (H&E) and trichrome staining. The slides were examined for fibrotic changes under a microscope (Olympus BX51, Olympus America Inc., Melville, NY, USA). Each histologic section was graded according to the METAVIR scoring system for hepatic fibrosis by a veterinary pathologist: F0, no fibrosis; F1, portal fibrosis without septa (mild); F2, portal fibrosis with rare septa (moderate); F3, numerous septa without cirrhosis (severe); and F4, cirrhosis.

2.3. Computerized Analysis

The computerized analysis is a multistep process consisting of image analysis, feature extraction, and diagnostic modeling (Figure 1).
Image analysis: A total of 233 images, including 184 from the fibrosis group and 49 from the control group, were analyzed offline using an IDL platform [26,27]. Five to six regions of interest (ROI) were manually outlined to include multiple representative samples of the liver parenchyma and exclude imaging artifacts (Figure 1A).
Feature extraction: Sixteen quantitative image features describing the liver tissue were measured for each region (Figure 1B). These features defining the distributions and relationships between the gray levels of image pixels [28,29,30,31] included first-order statistics, run-length features, and grayscale connectivity (co-occurrence matrix).
  • First-order statistics features included echo intensity (brightness level), heterogeneity (local variance in gray levels within ROIs), kurtosis, and skewness. Echo intensity and heterogeneity features were computed directly from the pixels’ mean intensity and standard deviation within the region of interest.
  • Run-length features included short-run emphasis, long-run emphasis, gray-level nonuniformity, run percentage, and run-length nonuniformity. A gray level run is a set of consecutive, collinear picture points having the same gray level value. The length of the run is the number of pixels in the run.
  • Gray-level co-occurrence matrix (GLCM) included GLCM mean, GLCM variance, GLCM correlation, entropy, contrast, dissimilarity, and angular second momentum (ASM).
Features selection: The computed values for all features were averaged over all images and shown as mean ± standard deviation. A two-tailed t-test was used to determine differences between the early (METAVIR scores of F0–1) and significant (METAVIR scores of F2–4) fibrosis groups. A p-value < 0.05 was considered statistically significant (Figure 1B).
Diagnostic model building by machine learning: Three methods, including logistic regression, naïve Bayes, and multilayer perceptron were used to construct diagnostic models (Figure 1C). Weka software for machine learning was used for ML analysis [32]. Logistic regression determines the impact of multiple independent variables presented simultaneously to predict membership in one or other of the two dependent variable categories, which in this case were early and late fibrosis [33]. The probability of late fibrosis was determined using the maximum likelihood method. The naive Bayes classifier greatly simplifies learning by assuming that features are independent. Although independence is generally a poor assumption, in practice, naive Bayes often competes well with more sophisticated classifiers [34]. Multilayer perceptron is an advanced machine learning approach in which the network is multilayered and consists of a system of simple, interconnected neurons, or nodes involving nonlinear mapping between input and output vectors [35].

2.4. Data Train-Test Splitting for Machine Learning

In developing ML models, it is desirable that the trained models perform well on new, unseen data. Different training-testing data sampling approaches were tested for the consistency of the ML models. They included leave-one-out n-fold cross-validation and percentage proportions. In leave-one-out (round robin), n − 1 samples (n = number of cases) were trained to predict the probability of malignancy for the remaining nth samples. The process was repeated n times to predict every case. This approach ensures that test cases are always left out of training while training occurs on the most data available. In 10-folds cross-validation, the data set was divided into 10 equal sets without overlaps. Then, at the first run, sets (1–9) were used for training and developing a model, and the model was used on set 10 to get the performance. Next, sets 1–8 and set 10 were used for training to develop a model to test the performance on set 9. The process was repeated until each fold was used once as a test set to evaluate performance.
In the percentage-split method, the training-testing data was split based on percentage. A testing-training percentage split of 50:50, 60:40, 70:30, and 80:20 was used for building and evaluating the ML model. Each percentage split was repeated three times by random sample selection.

2.5. Evaluating the Diagnostic Performance by Different Machine Learning Methods

The class attribute predicted by the model was diagnosis 1 for advanced fibrosis and 0 for early fibrosis. The output probabilities of fibrosis ranged from 0 to 1. The machine-learning methods logistic regression, naïve Bayes, and multilayer perceptron were used to construct diagnostic models (Figure 1C).
The area under the ROC curve (AUC) and the sensitivity and specificity for each ROC curve were used to evaluate each ML model’s diagnostic performance. Sensitivity is defined as the percentage of elements correctly classified in class 1 (True Positive (TP)) to all elements labeled in class 1 (True Positive (TP) + False Negative (FN)) and it is calculated as follows:
S e n s i t i v i t y = T P / T P + F N
Specificity is defined by the percentage of elements classified correctly in class 2 (True Negative (TN)), in comparison with all other elements in class 2 (True Negative (TN) False Positive (FP)), and it can be presented as follows:
S p e c i f i c i t y = T N / T N + F P  
The coefficient of variation (CV) of AUC between trials was used to assess the variability in the diagnostic performance of each ML method. The coefficient of variation is defined as the ratio of the standard deviation of the AUC and its mean.

3. Results

3.1. The Classification Performance of Individual Quantitative Ultrasound Features

Of the 16 features extracted from liver ultrasound images, 15 showed statistically significant differences (p < 0.05) between early and advanced fibrosis. Features normalized to the maximum-minimum range are shown in Figure 2. All first-order histogram features showed a statistical difference between the two groups. The most significant differences observed were in echo-intensity and heterogeneity (56.8 ± 10.2 and 22.5 ± 2.9) for advanced fibrosis versus 35.7 ± 9 and 16.9 ± 2.6 for early-stage cases, p-value < 0.05. Kurtosis and skewness features also discriminated between the two groups (p-value < 0.05), with values of 0.3 ± 0.2 and 0.6 ± 0.1 for advanced fibrosis versus 1.9 ± 1.6 and 0.9 ± 0.6 for early-stage fibrosis. Except for the contrast feature, all the gray-level co-occurrence matrix features showed a statistical difference, p < 0.05 (Table 1). Among the run length features, all features showed a statistical difference (p < 0.05).
The sensitivity and specificity of the individual features for detecting early and advanced fibrosis are summarized in Table 2. The first-order statistics showed high sensitivity, ranging from 82.1% to 90.5%, and specificity, ranging from 87.1% to 89.8%. Run-length features showed a broader range of variation, from a high sensitivity of 96% for some features to a lower sensitivity of 67.5%. Similarly, specificity also varied from 71.9% to 100%. The sensitivity and specificity of the GLCM features are 61.6% to 83.3% and 84.4% to 100%, respectively.

3.2. The Classification Performance of Combined Ultrasound Features with Machine Learning

3.2.1. The Impact of Train-Test Data Sampling on the Diagnostic Performance of ML

The effects of different sampling approaches on the performance of the ML methods are compared in Table 3. The leave-one-out cross-validation approach showed better performance for LR with an AUC of 0.95 than that of 10-fold (AUC = 0.90). A similar performance was observed with nB and MLP.
For the test-train percentage split approach, 50:50 % and 60:40% showed a higher performance and lesser variability between different trials (Table 3). A broader variation in AUC was observed with a 70–30% test-train percentage split. This pattern was particularly noticeable for LR, with AUC ranging between 0.86 to 0.96 for different trials. NB and MLP also showed similar behavior but with less variability between trials, with AUC ranging from 0.94 to 0.98 for nB and 0.95 to 0.99 for MLP. The variability in AUC was more prominent when using a larger test sample (80:20%) for LR, ranging from 0.83 to 0.97. Similarly, nB and MLP showed the same pattern but with less variability, with 0.95–098 and 0.93–0.99, respectively.

3.2.2. Comparison of the Diagnostic Performance of the Machine Learning Models

The diagnostic performance for naïve Bayes and MLP was higher and more consistent than that of logistic regression (Table 3). Performance ranged from 0.95–0.96 for nB and 0.95–0.96 for MLP, while LR showed a lower performance ranging from 0.91–0.92. The coefficient of variation for different trials had more variability with LR (CV 3–5%, Table 2). nB and MLP showed lesser variability (CV 1–2%, Table 3).

4. Discussion

4.1. Quantitative Ultrasound Features Can Distinguish Early and Advanced Fibrosis

Liver fibrosis is associated with the disrupted architecture of the liver tissue. Connective tissue fibers in the form of wide or narrow stripes divide the liver parenchyma into pseudocopula of unequal size and irregular shape. However, these subtle changes cannot be distinguished with high accuracy by visual observation of ultrasound images. Earlier studies have suggested a significant potential for quantitative ultrasound (QUS) to complement conventional B-mode for liver diseases [36]. QUS imaging techniques have been described in the literature for their potential to analyze echo interferences based on radiofrequency signals or statistical properties of the echo features [37]. However, until recently, the use of QUS for studying liver fibrosis has been limited, and most studies have evaluated it for assessing liver steatosis [38,39]. In this study, we evaluate QUS features for assessing liver fibrosis. The results showed that 15 of 16 features exhibited statistical differences and could discriminate between early and advanced fibrosis. Brightness (echo intensity) has been reported to increase with steatosis [40]. In this study, US echo intensity increased with steatosis-free fibrosis confirmed by histology. In addition to echo-intensity, the variance measure, heterogeneity, also increased in advanced fibrosis cases. These increases are likely due to collagen septae formation, as reported previously [41]. The acoustic impedance difference between fibrotic tissue and hepatocytes increases due to the larger number of acoustic interfaces in fibrotic tissue [41]. This change leads to the observed coarse and echogenic pattern, with the regional variation dependent on collagen septae presence or absence to scatter US waves. Our study showed that the changes in the quantified liver image texture could distinguish between early and advanced liver fibrosis with high accuracy. Quantitative studies on liver fibrosis by texture analysis of ultrasound images are, to date, limited and have only recently been reported [42,43]. Park et al. evaluated image texture analysis for staging of hepatic fibrosis [42]. Of the various first- and second-order features based on the gray-level histogram and co-occurrence matrix (GLCM) studied, the standard deviation showed a significant difference between significant and mild fibrosis. The study also found that skewness and kurtosis increased with the progression of fibrosis, indicating increased heterogeneity of the liver parenchyma. Zhou et al. [43] proposed a new method for liver fibrosis characterization by using radiomics of ultrasound backscatter homodyned-K imaging. The study showed their approach to be superior to the radiomics of uncompressed envelope images, concluding that radiomics can increase the ability of ultrasound backscatter statistical parameters for evaluating liver fibrosis.

4.2. Constructing the Best Machine Learning Models Using Different Classifiers

ML techniques have emerged as a potential tool for prediction and decision-making in many disciplines. Developing a machine learning model would be a valuable aid in identifying cirrhosis and making effective clinical decisions in real-time. We evaluated three different ML classifiers: logistic regression, naïve Bayes, and neural networks represented by a multilayer perceptron. Each method has its strengths and weaknesses. The reason for using multiple ML methods is to compare their relative diagnostic performances and determine the best diagnosis model. The study results showed that all three algorithms, naïve Bayes, logistic regression, and multilayer perceptron, performed well. However, nB and MLP achieved better performance and lower variability than logistic regression. The aim is to construct a simple model that captures a pattern representative of the population but excludes the inherent noise in the training data. Our results showed that using a model as simple as nB with LOO or 10-fold cross-validations can produce high and consistent performance in detecting liver fibrosis with ultrasound. The results show that complex models representing a nonlinear mapping between input and output vectors, like those in multilayer perceptron did not provide significant benefit.
Table 4 compares machine learning algorithms for liver disease diagnosis [44,45,46,47,48,49,50,51,52]. In the current study, a high level of diagnostic performance was observed. A factor contributing to such high performance could be the controlled setting of experiments where high frequencies were used for imaging and image parameters were fixed throughout the study. These practices, if translated into clinical studies, could help in improving the diagnostic performance of the US for grading the extent of fibrosis.

4.3. The Effects of the Data Splitting Approach on ML Model Performance

The study shows nonequivalent data splits with larger testing sets, like 70:30% and 80:20%, had weaker performance and more variability than equal splits or n-fold cross-validations. These results show that the data split alone in testing and training can induce statistically significant variation. Larger variations are expected when using split validation techniques compared to the more rigorous validation techniques [53]. It was reported that more computationally expensive techniques, like cross-validation, provide higher precision and stability. Our results agree with the literature from this perspective. There is a strong relationship between sample size and reported performance because ML models are biased to produce under or overly optimistic results when a small sample is used for training. In supervised learning, the ideal model is expected to estimate the training data regularities and take a broad view of unseen new data. However, when the training data contains noise, it may not be representative of the population. In this case, it is unlikely that the model would be able to approximate the data regularities.
A potential limitation of the study is the lack of independence between the sequential images acquired from the same subject. This approach, which is used extensively in the literature [54] is aimed at increasing the sample size for ML-based analyses. In this study, images were acquired from different parts of the liver and in diverse planes to minimize the effect of data independence. The study was performed under a controlled imaging protocol with fixed experimental settings. Clinical studies are often performed with variable imaging scanners, settings, and protocols. These factors contributing to the diagnostic variability in clinical settings were not a part of the current study.

5. Conclusions

This study evaluated the use of QUS features with ML methods to build a reliable model for detecting early and advanced liver fibrosis. QUS showed high discrimination between early and advanced fibrosis. Comparing the three ML methods, nB and MLP showed better and more consistent performance when compared to logistic regression. Implementing simple ML approaches with QUS can achieve high performance in detecting liver fibrosis and thus improves US sensitivity. More complex methods like MLP also yielded high performance but no significant improvement over simpler methods. Future larger studies with clinical image data using further standardization of ROI selection by automation, can help to minimize the user-dependent variability and increase the sensitivity of clinical ultrasound for assessing liver fibrosis without needing biopsies.

Author Contributions

Data curation, M.A.-H., L.R.S. and M.B.K.; Formal analysis, M.A.-H. and L.R.S.; Funding acquisition, C.M.S.; Investigation, L.R.S., M.B.K. and C.M.S.; Methodology, M.A.-H., L.R.S., H.S. and C.M.S.; Project administration, M.A.-H., L.R.S., M.B.K. and C.M.S.; Resources, C.M.S.; Software, T.W.C.; Supervision, H.S.; Visualization, M.A.-H. and C.M.S.; Writing—original draft, M.A.-H. and L.R.S.; Writing—review & editing, M.A.-H., L.R.S. and C.M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by NIH grants CA204446-01, EB022612-01, and S10657301.

Institutional Review Board Statement

This study was conducted according to the guidelines of the Declaration of Helsinki and was approved by the Institutional Animal Care and Use Committee and approved by the University of Pennsylvania (IACUC number 804998, approved on 26 September 2017).

Informed Consent Statement

Not applicable. The study did not involve humans.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to the evolving nature of the project.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Scaglione, S.; Kliethermes, S.; Cao, G.; Shoham, D.; Durazo, R.; Luke, A.; Volk, M.L. The epidemiology of cirrhosis in the United States a population-based study. J. Clin. Gastroenterol. 2015, 49, 690–696. [Google Scholar] [CrossRef] [PubMed]
  2. Mokdad, A.A.; Lopez, A.D.; Shahraz, S.; Lozano, R.; Stanaway, J.; Murray, C.J.; Naghavi, M. Liver cirrhosis mortality in 187 countries between 1980 and 2010: A systematic analysis. BMC Med. 2014, 12, 145. [Google Scholar] [CrossRef] [Green Version]
  3. Tapper, E.B.; Parish, N.D. Mortality due to cirrhosis and liver cancer in the United States, 1999–2016: Observational study. BMJ 2018, 362, K2817. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Nash, E.C. Needle biopsy of the liver. J. Natl. Med. Assoc. 1951, 43, 359–364. [Google Scholar] [PubMed]
  5. Regev, A.; Berho, M.; Jeffers, L.J.; Milikowski, C.; Molina, E.G.; Pyrsopoulos, N.T.; Feng, Z.Z.; Reddy, K.R.; Schiff, E.R. Sampling error and intraobserver variation in liver biopsy in patients with chronic HCV infection. Am. J. Gastroenterol. 2002, 97, 2614–2618. [Google Scholar] [CrossRef]
  6. Cadranel, J.F.; Rufat, P.; Degos, F. Practices of liver biopsy in France: Results of a prospective nationwide survey. For the Group of Epidemiology of the French Association for the Study of the Liver (AFEF). Hepatology 2000, 32, 477–481. [Google Scholar] [CrossRef]
  7. Lydatakis, H.; Hager, I.P.; Kostadelou, E.; Mpousmpoulas, S.; Pappas, S.; Diamantis, I. Noninvasive markers to predict the liver fibrosis in non-alcoholic fatty liver disease. Liver Int. 2006, 26, 864–871. [Google Scholar] [CrossRef]
  8. Gibson, P.R.; Fraser, J.R.; Brown, T.J.; Finch, C.F.; Jones, P.A.; Colman, J.C.; Dudley, F.J. Hemodynamic and liver function predictors of serum hyaluronan in alcoholic liver disease. Hepatology 1992, 15, 1054–1059. [Google Scholar] [CrossRef]
  9. Gressner, O.A.; Weiskirchen, R.; Gressner, A.M. Biomarkers of liver fibrosis: Clinical translation of molecular pathogenesis or based on liver-dependent malfunction tests. Clin. Chim. Acta 2007, 381, 107–113. [Google Scholar] [CrossRef]
  10. Chou, R.; Wasson, N. Blood tests to diagnose fibrosis or cirrhosis in patients with chronic hepatitis C virus infection. Ann. Intern. Med. 2013, 159, 372. [Google Scholar] [CrossRef]
  11. Huber, A.T.; Ebner, L.; Heverhagen, J.; Christe, A. State-of-the-art imaging of liver fibrosis and cirrhosis: A comprehensive review of current applications and future perspectives. Eur. J. Radiol. Open. 2015, 26, 90–100. [Google Scholar] [CrossRef] [Green Version]
  12. Gerstenmaier, J.F.; Gibson, R.N. Ultrasound in chronic liver disease. Insights Imaging 2014, 5, 441–455. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Rix, A.; Lederle, W.; Theek, B.; Lammers, T.; Moonen, C.; Schmitz, G.; Kiessling, F. Advanced Ultrasound Technologies for Diagnosis and Therapy. J. Nucl. Med. 2018, 59, 740–746. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Aubé, C.; Oberti, F.; Korali, N.; Namour, M.-A.; Loisel, D.; Tanguy, J.-Y.; Valsesia, E.; Pilette, C.; Rousselet, M.C.; Bedossa, P.; et al. Ultrasonographic diagnosis of hepatic fibrosis or cirrhosis. J. Hepatol. 1999, 30, 472–478. [Google Scholar] [CrossRef]
  15. Petzold, G. Role of Ultrasound Methods for the Assessment of NAFLD. J. Clin. Med. 2022, 11, 4581. [Google Scholar] [CrossRef] [PubMed]
  16. Hagström, H.; Nasr, P.; Ekstedt, M.; Hammar, U.; Stål, P.; Hultcrantz, R.; Kechagias, S. Fibrosis Stage but Not NASH Predicts Mortality and Time to Development of Severe Liver Disease in Biopsy-Proven NAFLD. J. Hepatol. 2017, 67, 1265–1273. [Google Scholar] [CrossRef]
  17. Vilar-Gomez, E.; Calzadilla-Bertot, L.; Wai-Sun Wong, V.; Castellanos, M.; Aller-de la Fuente, R.; Metwally, M.; Eslam, M.; Gonzalez-Fabian, L.; Alvarez-Quiñones Sanz, M.; Conde-Martin, A.F.; et al. Fibrosis Severity as a Determinant of Cause-Specific Mortality in Patients With Advanced Nonalcoholic Fatty Liver Disease: A Multi-National Cohort Study. Gastroenterology 2018, 155, 443–457.e17. [Google Scholar] [CrossRef]
  18. Dulai, P.S.; Singh, S.; Patel, J.; Soni, M.; Prokop, L.J.; Younossi, Z.; Sebastiani, G.; Ekstedt, M.; Hagstrom, H.; Nasr, P.; et al. Increased Risk of Mortality by Fibrosis Stage in Nonalcoholic Fatty Liver Disease: Systematic Review and Meta-Analysis. Hepatology 2017, 65, 1557–1565. [Google Scholar] [CrossRef] [Green Version]
  19. Poynard, T.; Bedossa, P.; Opolon, P. Natural History of Liver Fibrosis Progression in Patients with Chronic Hepatitis, C. The OBSVIRC, METAVIR, CLINIVIR, and DOSVIRC Groups. Lancet 1997, 349, 825–832. [Google Scholar] [CrossRef]
  20. Berzigotti, A.; Ashkenazi, E.; Reverter, E.; Abraldes, J.G.; Bosch, J. Non-Invasive Diagnostic and Prognostic Evaluation of Liver Cirrhosis and Portal Hypertension. Dis. Markers 2011, 31, 129–138. [Google Scholar] [CrossRef]
  21. Choong, C.-C.; Venkatesh, S.K.; Siew, E.P.Y. Accuracy of Routine Clinical Ultrasound for Staging of Liver Fibrosis. J. Clin. Imaging Sci. 2012, 2, 58. [Google Scholar] [CrossRef] [PubMed]
  22. Ladenheim, J.A.; Luba, D.G.; Yao, F.; Gregory, P.B.; Jeffrey, R.B.; Garcia, G. Limitations of liver surface US in the diagnosis of cirrhosis. Radiology 1992, 185, 21–23; discussion 23–24. [Google Scholar] [CrossRef] [PubMed]
  23. Ong, T.Z.; Tan, H.J. Ultrasonography is not reliable in diagnosing liver cirrhosis in clinical practice. Singap. Med J. 2003, 44, 293–295. [Google Scholar] [PubMed]
  24. D’Souza, J.C.; Sultan, L.R.; Hunt, S.J.; Schultz, S.M.; Brice, A.; Wood, A.K.W.; Sehgal, C.M. B-mode ultrasound for the assessment of hepatic fibrosis: A quantitative multiparametric analysis for a radiomics approach. Sci. Rep. 2019, 9, 8708. [Google Scholar] [CrossRef] [Green Version]
  25. Sultan, L.R.; Karmacharya, M.B.; Hunt, S.J.; Wood, A.K.W.; Sehgal, C.M. Subsequent Ultrasound Vascular Targeting Therapy of Hepatocellular Carcinoma Improves the Treatment Efficacy. Biology 2021, 10, 79. [Google Scholar] [CrossRef]
  26. Xiong, H.; Sultan, L.R.; Cary, T.W.; Schultz, S.M.; Bouzghar, G.; Sehgal, C.M. The diagnostic performance of leak-plugging automated segmentation versus manual tracing of breast lesions on ultrasound images. Ultrasound 2017, 25, 98–106. [Google Scholar] [CrossRef] [Green Version]
  27. Sultan, L.R.; Chen, Y.T.; Ms, T.W.C.; Ashi, K.; Sehgal, C.M. Quantitative pleural line characterization outperforms traditional lung texture ultrasound features in detection of COVID-19. J. Am. Coll. Emerg. Physicians Open 2021, 2, e12418. [Google Scholar] [CrossRef]
  28. Szczypiński, P.M.; Strzelecki, M.; Materka, A.; Klepaczko, A. MaZda—A sofware package for image texture analysis. Comput. Methods Programs Biomed. 2009, 94, 66–76. [Google Scholar] [CrossRef]
  29. Brzakovic, D.; Tou, J.T. Image understanding vis texture analysis. In Proceedings of the IEEE Artificial Intelligence Applications First Conference, Denver, CO, USA, 5–7 December 1984; pp. 585–590. [Google Scholar]
  30. Kashyap, R.L.; Khotanzad, A. A model-based method for rotation invariant texture classification. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 8, 472–481. [Google Scholar] [CrossRef]
  31. Paris, M.T.; Mourtzakis, M. Muscle Composition Analysis of Ultrasound Images: A Narrative Review of Texture Analysis. Ultrasound Med. Biol. 2021, 47, 880–895. [Google Scholar] [CrossRef] [PubMed]
  32. Daw, S.; Basak, R. Machine Learning Applications Using Waikato Environment for Knowledge Analysis. In Proceedings of the 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC), Eroda, India, 11–13 March 2020; pp. 346–351. [Google Scholar]
  33. Hosmer, D.W., Jr.; Lemeshow, S.; Sturdivant, R.X. Applied Logistic Regression, 3rd ed.; Wiley: Hoboken, NJ, USA, 2013. [Google Scholar]
  34. Wickramasinghe, I.; Kalutarage, H. Naive Bayes: Applications, Variations and Vulnerabilities—A Review of Literature with Code Snippets for Implementation. 2020. Available online: https://github.com/HarshaKumaraKalutarage/Naive-Bayes-Applications-and-Vulnerabilities (accessed on 1 October 2022).
  35. Collobert, R.; Bengio, S. Links between Perceptrons, MLPs and SVMs. In Proceedings of the Twenty-First International Conference on Machine Learning, (ICML), Banff, AB, Canada, 4–8 July 2004. [Google Scholar]
  36. Destrempes, F.; Gesnik, M.; Chayer, B.; Roy-Cardinal, M.H.; Olivié, D.; Giard, J.M.; Sebastiani, G.; Nguyen, B.N.; Cloutier, G.; Tang, A. Quantitative ultrasound, elastography, and machine learning for assessment of steatosis, inflammation, and fibrosis in chronic liver disease. PLoS ONE 2022, 17, e0262291. [Google Scholar] [CrossRef]
  37. Yilmaz, Y.; Yesil, A.; Gerin, F.; Ergelen, R.; Akin, H.; Celikel, A.; Imeryuz, N. Detection of hepatic steatosis using the controlled attenuation parameter: A comparative study with liver biopsy. Scand. J. Gastroenterol. 2014, 49, 611–616. [Google Scholar] [CrossRef] [PubMed]
  38. Chauhan, A.; Sultan, L.R.; Furth, E.E.; Jones, L.P.; Khungar, V.; Sehgal, C.M. Diagnostic accuracy of hepatorenal index in the detection and grading of hepatic steatosis. J. Clin. Ultrasound 2016, 44, 580–586. [Google Scholar] [CrossRef]
  39. Ghoshal, G.; Lavarello, R.J.; Kemmerer, J.P.; Miller, R.J.; Oelze, M.L. Ex vivo study of quantitative ultrasound parameters in fatty rabbit livers. Ultrasound Med. Biol. 2012, 38, 2238–2248. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Mathiesen, U.; Franzen, L.; Åselius, H.; Resjö, M.; Jacobsson, L.; Foberg, U.; Frydén, A.; Bodemar, G. Increased liver echogenicity at ultrasound examination reflects degree of steatosis but not of fibrosis in asymptomatic patients with mild/moderate abnormalities of liver transaminases. Dig. Liver Dis. 2002, 34, 516–522. [Google Scholar] [CrossRef]
  41. Chen, W.; Rock, J.B.; Yearsley, M.M.; Ferrell, L.D.; Frankel, W.L. Different collagen types show distinct rates of increase from early to late stages of hepatitis C-related liver fibrosis. Hum. Pathol. 2014, 45, 160–165. [Google Scholar] [CrossRef]
  42. Park, E.J.; Kim, S.H.; Park, S.J.; Baek, T.W. Texture Analysis of Gray-Scale Ultrasound Images for Staging of Hepatic Fibrosis. Taehan Yongsang Uihakhoe Chi. 2021, 82, 116–127. [Google Scholar] [CrossRef]
  43. Zhou, Z.; Zhang, Z.; Gao, A.; Tai, D.-I.; Wu, S.; Tsui, P.-H. Liver Fibrosis Assessment Using Radiomics of Ultrasound Homodyned-K imaging Based on the Artificial Neural Network Estimator. Ultrason. Imaging 2022, 44, 229–241. [Google Scholar] [CrossRef] [PubMed]
  44. Byra, M.; Styczynski, G.; Szmigielski, C.; Kalinowski, P.; Michałowski, Ł.; Paluszkiewicz, R.; Ziarkiewicz-Wróblewska, B.; Zieniewicz, K.; Sobieraj, P.; Nowicki, A. Transfer learning with deep convolutional neural network for liver steatosis assessment in ultrasound images. Int. J. Comput. Assist. Radiol. Surg. 2018, 13, 1895–1903. [Google Scholar] [CrossRef] [Green Version]
  45. Reddy, D.S.; Bharath, R.; Rajalakshmi, P. A Novel Computer-Aided Diagnosis Framework Using Deep Learning for Classification of Fatty Liver Disease in Ultrasound Imaging. In Proceedings of the Internation Conference on the E-Health Networking, Applications and Services, Ostrava, Czech Republic, 17–20 September 2018; pp. 1–5. [Google Scholar]
  46. Han, A.; Byra, M.; Heba, E.; Andre, M.P.; Erdman, J.W., Jr.; Loomba, R.; Sirlin, C.B.; O’Brien, W.D., Jr. Noninvasive Diagnosis of Nonalcoholic Fatty Liver Disease and Quantification of Liver Fat with Radiofrequency Ultrasound Data Using One-dimensional Convolutional Neural Networks. Radiology 2020, 295, 342–350. [Google Scholar] [CrossRef]
  47. Byra, M.; Han, A.; Boehringer, A.S.; Zhang, Y.N.; O’Brien, W.D., Jr.; Erdman, J.W., Jr.; Loomba, R.; Sirlin, C.B.; Andre, M. Liver Fat Assessment in Multiview Sonography Using Transfer Learning with Convolutional Neural Networks. J. Ultrasound Med. 2022, 41, 175–184. [Google Scholar] [CrossRef] [PubMed]
  48. Cha, D.I.; Kang, T.W.; Min, J.H.; Joo, I.; Sinn, D.H.; Ha, S.Y.; Kim, K.; Lee, G.; Yi, J. Deep learning-based automated quantification of the hepatorenal index for evaluation of fatty liver by ultrasonography. Ultrasonography 2021, 40, 565–574. [Google Scholar] [CrossRef]
  49. Chou, T.-H.; Yeh, H.-J.; Chang, C.-C.; Tang, J.-H.; Kao, W.-Y.; Su, I.-C.; Li, C.-H.; Chang, W.-H.; Huang, C.-K.; Sufriyana, H.; et al. Deep learning for abdominal ultrasound: A computer-aided diagnostic system for the severity of fatty liver. J. Chin. Med. Assoc. 2021, 84, 842–850. [Google Scholar] [CrossRef] [PubMed]
  50. Kuppili, V.; Biswas, M.; Sreekumar, A.; Suri, H.S.; Saba, L.; Edla, D.R.; Marinhoe, R.T.; Sanches, J.; Suri, J.S. Extreme learning machine framework for risk stratification of fatty liver disease using ultrasound tissue characterization. J. Med. Syst. 2017, 41, 152. [Google Scholar] [CrossRef] [PubMed]
  51. Biswas, M.; Kuppili, V.; Edla, D.R.; Suri, H.S.; Saba, L.; Marinhoe, R.T.; Sanches, J.; Suri, J.S. Symtosis: A liver ultrasound tissue characterization and risk stratification in optimized deep learning paradigm. Comput. Methods Programs Biomed. 2018, 155, 165–177. [Google Scholar] [CrossRef]
  52. Zamanian, H.; Mostaar, A.; Azadeh, P.; Ahmadi, M. Implementation of combinational deep learning algorithm for non-alcoholic fatty liver classification in ultrasound images. J. Biomed. Phys. Eng. 2021, 11, 73–84. [Google Scholar] [CrossRef]
  53. Singh, V.; Pencina, M.; Einstein, A.J.; Liang, J.X.; Berman, D.S.; Slomka, P. Impact of train/test sample regimen on performance estimate stability of machine learning in cardiovascular imaging. Sci. Rep. 2021, 11, 14490. [Google Scholar] [CrossRef]
  54. Sultan, L.R.; Cary, T.W.; Al-Hasani, M.; Karmacharya, M.B.; Venkatesh, S.S.; Assenmacher, C.-A.; Radaelli, E.; Sehgal, C.M. Can Sequential Images from the Same Object Be Used for Training Machine Learning Models? A Case Study for Detecting Liver Disease by Ultrasound Radiomics. AI 2022, 3, 739–750. [Google Scholar] [CrossRef]
Figure 1. Schematic representation of the image and data analysis process. Subfigure (A) shows the image analysis process which includes region of interest ( ROI) placement on the selected liver images. Subfigure (B) shows the feature extraction from the selected liver ROIs. Features extracted include: first order statistics and computerized texture features. These features are then further selected by t-test, for building the predcitive model. Subfigure (C) demsontrate the different steps towards building the predtive diagnostic model, consisting of: training the data set by different classfiers and with different data spitting methods, then applying these methods for testing the diagnostic performance of data.
Figure 1. Schematic representation of the image and data analysis process. Subfigure (A) shows the image analysis process which includes region of interest ( ROI) placement on the selected liver images. Subfigure (B) shows the feature extraction from the selected liver ROIs. Features extracted include: first order statistics and computerized texture features. These features are then further selected by t-test, for building the predcitive model. Subfigure (C) demsontrate the different steps towards building the predtive diagnostic model, consisting of: training the data set by different classfiers and with different data spitting methods, then applying these methods for testing the diagnostic performance of data.
Diagnostics 12 02737 g001
Figure 2. Comparison of image features on a normalized scale (0–1). Orange and blue bars represent early and advanced-stage fibrosis, respectively. EI: echo-intensity, HT: heterogeneity, SK: skewness, KT: kurtosis, SR: short run length, LR: long run length, GU: gray-level nonuniformity, RU: run length nonuniformity, R%: run percentage, AM: angular second momentum, CT: contrast, DS: dissimilarity, ET: entropy; GM: gray-level mean, GV: gray level variance, GC: gray-level correlation. * denotes statistical significance, p-value < 0.05.
Figure 2. Comparison of image features on a normalized scale (0–1). Orange and blue bars represent early and advanced-stage fibrosis, respectively. EI: echo-intensity, HT: heterogeneity, SK: skewness, KT: kurtosis, SR: short run length, LR: long run length, GU: gray-level nonuniformity, RU: run length nonuniformity, R%: run percentage, AM: angular second momentum, CT: contrast, DS: dissimilarity, ET: entropy; GM: gray-level mean, GV: gray level variance, GC: gray-level correlation. * denotes statistical significance, p-value < 0.05.
Diagnostics 12 02737 g002
Table 1. The mean and standard deviation of the ultrasound texture features of early fibrosis compared to advanced liver fibrosis. GLNU: gray-level nonunformity; RLNU: run length nonuniformity; ASM: Angular second momentum; GLCM: gray level mean; GLCV: gray level variance; GLCC: gray level correlation.
Table 1. The mean and standard deviation of the ultrasound texture features of early fibrosis compared to advanced liver fibrosis. GLNU: gray-level nonunformity; RLNU: run length nonuniformity; ASM: Angular second momentum; GLCM: gray level mean; GLCV: gray level variance; GLCC: gray level correlation.
First Order Statistics
Early FibrosisAdvanced Fibrosisp-Value
Echo-intensity35.7 ± 9.056.8 ± 10.20.00
Heterogeneity16.9 ± 2.922.5 ± 2.60.00
Kurtosis1.9 ± 3.60.3 ± 0.60.00
Skewness0.9 ± 0.20.2 ± 0.00.00
Run Length Features
Early FibrosisAdvanced Fibrosisp-Value
Short run0.3 ± 0.00.2 ± 0.00.00
Long run15.3 ± 1.612.2 ± 1.50.00
GLNU0.3 ± 0.00.2 ± 0.00.00
RLNU0.3 ± 0.00.2 ± 0.00.00
Run %0.3 ± 0.10.7 ± 0.20.00
Gray-Level Co-Occurrence Features
Early FibrosisAdvanced Fibrosisp-Value
ASM0.1 ± 0.00.4 ± 1.10.00
Contrast3.6 ± 0.03.7 ± 0.70.43
Dissimilarity1.4 ± 0.11.7 ± 0.60.00
Entropy3.3 ± 0.13.4 ± 0.30.00
GLCM2.1 ± 0.2146.0 ± 43.60.00
GLCV1075.8 ± 216.61383.0 ± 512.70.00
GLCC0.0 ± 0.00.1± 0.00
Table 2. The diagnostic performance of the ultrasound texture features in detecting early and advanced liver fibrosis.
Table 2. The diagnostic performance of the ultrasound texture features in detecting early and advanced liver fibrosis.
GLCM
Features
SensitivitySpecificityRun Length
Features
SensitivitySpecificityFirst Order Histogram FeaturesSensitivitySpecificity
ASM65.9%100%Short run length96%71.9%Echo-intensity90.5% 87.5%
GLCM mean83.3% 87.5%Long run length82.5%87.5%Heterogeneity88.1% 87.5%
GLCM variance71.4 % 93.7%Run length nonuniformity72.2%93.7%Kurtosis82.6%87.1%
GLCM correlation71.4 % 87%Gray level
nonuniformity
67.5% 100 %Skewness82.1%89.8%
Entropy61.1% 100%Run Percentage80.4%70.2%
Contrast73.8% 84.4%
Dissimilarity72.3%91.8%
Table 3. The effect of train-test sampling on the diagnostic performance (AUC) on different ML models. LR: logistic regression, Nb: naïve bayes, MLP: multilayer perceptron.
Table 3. The effect of train-test sampling on the diagnostic performance (AUC) on different ML models. LR: logistic regression, Nb: naïve bayes, MLP: multilayer perceptron.
Train-Test Method LRNbMLP
Leave-One-Out0.950.940.95
10-fold0.900.940.96
Percentage split 50–50%Trial 10.880.940.96
Trial 20.920.960.96
Trial 30.870.950.95
Trial 40.960.960.98
Trial 50.940.940.94
Mean0.910.950.96
STD0.040.010.01
CV4%1%2%
Percentage split 60–40%Trial 10.900.940.96
Trial 20.870.960.97
Trial 30.910.940.94
Trial 40.940.950.97
Trial 50.930.950.95
Mean0.910.950.96
STD0.030.010.01
CV3%1%1%
Percentage split 70–30%Trial 10.860.940.95
Trial 20.890.940.95
Trial 30.920.960.96
Trial 40.940.980.99
Trial 50.960.970.97
Trial 10.860.980.99
Mean0.920.960.96
STD0.40.20.2
CV4%2%2%
Percentage split 80–20%Trial 10.860.980.99
Trial 20.830.980.97
Trial 30.990.950.94
Trial 40.920.960.99
Trial 50.970.950.93
Mean0.910.950.96
STD0.040.010.02
CV5%1%2%
Table 4. Literature review of the use of ML for liver disease diagnosis.
Table 4. Literature review of the use of ML for liver disease diagnosis.
StudyGoal-Standard ReferenceSample SizeML Classifier (s)ResultsNotes
Byra et al. [44] Liver biopsy55 patients with severe obesity, 38 of whom had fatty liver diseaseDeep learning Sensitivity: 100%; specificity: 88%; accuracy: 96%; AUC: 0.98NAFLD-fatty liver
Redyy et al. [45]Radiologist qualitative score157 B-mode ultrasound liver images from unknown number of participantsDeep learning Sensitivity: 95%; specificity: 85%; accuracy: 90.6%; AUC: 0.96NAFLD-fatty liver
Han et al. [46]MRI proton density fat fraction(>5% hepatic fat content)204 participants, 140 of whom had NAFLD, 64 control participantsOne-dimensional CNNsSensitivity: 97%; specificity: 94%; accuracy: 96%; AUC: 0.98
-
Prospective study of NAFL/steatosis cases
-
Ultrasound data were acquired from a single scanner platform by one physician.
Byral et al. [47]MRI proton density fat fraction(>5% hepatic fat content)135 adult participants with known or suspected NAFLDTransfer learning with a pretrained CNN by four ultrasound views of liver routinely obtainedSCC: 0.81; AUC: 0.91 (PDFF ≥ 5%)NAFLD/fatty liver
Cha et al. [48]Liver biopsy295 subjects, 198 mild fatty liver, one moderate degree of fatty liverDCNN-based organ segmentation with Gaussian mixture modeling for automated quantification of the HRIICC of two radiologists and DCNN were 0.919, 0.916, and 0.734
-
Retrospective study, composed of subjects who were donors for liver transplantation. This would have led to selection bias
-
Image data only included several major US machine vendors; the algorithm may not work effectively with US images from machines made by other manufacturers.
Chou et al. [49]Abdominal ultrasound21855 B-mode ultrasound images, 2070 patients with different severities from none to severe fatty liverPretrained CNN models The areas under the receiver operating characteristic curves were 0.974 (mild steatosis vs. others), 0.971 (moderate steatosis vs. others), 0.981 (severe steatosis vs. others), 0.985 (any severity vs. normal), and 0.996 (moderate-to-severe steatosis clinically abnormal vs. normal-to-mild steatosis clinically normal)
-
Inconsistent quality of the images in this study might have caused several biases.
-
Some of the images came from different types of US machines.
Kuppili et al. [50]Liver biopsy (not defined)Adapting four types of K-fold cross-validation (K = 2, 3, 5 and 10) protocols on three kinds of data sizes: S0-original, S4-four splits, S8-sixty-four splits (a total of 12 cases), and 46 types of grayscale features, ELMa, SVMUsing the K10 cross-validation protocol, ELM showed an accuracy of 96.75% compared to 89.01% for SVM, and correspondingly, the AUC: 0.97 and 0.91, respectivelyRetrospective study, all NAFLD cases no steatosis
Biswas et al. [51]Liver biopsy (not defined)Support vector machine (SVM) and extreme learning machine (ELM) results. The liver US data consists of 63 patients (27 normal/36 abnormal). Using the K10 cross-validation protocol (90% training and 10% testing) CNNa, SVM, ELMDetection and risk stratification accuracies are: 82%, 92%, and 100% for SVM, ELM, and DL systems, respectively. The corresponding area under the curve is: 0.79, 0.92, and 1.0, respectively.
-
Retrospective study all NAFLD cases no steatosis
-
The systems take longer convergence time unlike ELM, which only has single layer.
Zamanian et al. [52]Liver biopsy (>5% hepatocyte steatosis)55 patients with fatty liver.CNN +  SVM We implemented pre-trained convolutional neural networks of Inception-
ResNetv2,
GoogleNet, AlexNet, and ResNet101 to extract features from the images and after combining these resulted features.
The area under the receiver operating characteristic curve (AUC) for the introduced combined network resulted in 0.9999.Prospective study, 50% steatosis cases
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Al-Hasani, M.; Sultan, L.R.; Sagreiya, H.; Cary, T.W.; Karmacharya, M.B.; Sehgal, C.M. Ultrasound Radiomics for the Detection of Early-Stage Liver Fibrosis. Diagnostics 2022, 12, 2737. https://doi.org/10.3390/diagnostics12112737

AMA Style

Al-Hasani M, Sultan LR, Sagreiya H, Cary TW, Karmacharya MB, Sehgal CM. Ultrasound Radiomics for the Detection of Early-Stage Liver Fibrosis. Diagnostics. 2022; 12(11):2737. https://doi.org/10.3390/diagnostics12112737

Chicago/Turabian Style

Al-Hasani, Maryam, Laith R. Sultan, Hersh Sagreiya, Theodore W. Cary, Mrigendra B. Karmacharya, and Chandra M. Sehgal. 2022. "Ultrasound Radiomics for the Detection of Early-Stage Liver Fibrosis" Diagnostics 12, no. 11: 2737. https://doi.org/10.3390/diagnostics12112737

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop