Next Article in Journal
Complete Genomic Characterisation and Mutation Patterns of Iraqi SARS-CoV-2 Isolates
Next Article in Special Issue
Electrical Stimulation for Preventing Skin Injuries in Denervated Gluteal Muscles—Promising Perspectives from a Case Series and Narrative Review
Previous Article in Journal
Differences in Striatal Metabolism in [18F]FDG PET in Parkinson’s Disease and Atypical Parkinsonism
Previous Article in Special Issue
Neuromuscular Stability of Dental Occlusion in Patients Treated with Aligners and Fixed Orthodontic Appliance: A Preliminary Electromyographical Longitudinal Case-Control Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Specification of Neck Muscle Dysfunction through Digital Image Analysis Using Machine Learning

1
Institute of Precision Medicine, Medical and Life Sciences, Hochschule Furtwangen, 78054 Villingen-Schwenningen, Germany
2
Physiotherapie, Fakultät Gesundheit, Sicherheit, Gesellschaft, Hochschule Furtwangen, Studienzentrum Freiburg, 79110 Freiburg, Germany
*
Author to whom correspondence should be addressed.
Diagnostics 2023, 13(1), 7; https://doi.org/10.3390/diagnostics13010007
Submission received: 16 November 2022 / Revised: 17 December 2022 / Accepted: 18 December 2022 / Published: 21 December 2022
(This article belongs to the Special Issue 2022 Padua Days on Muscle and Mobility Medicine – On-Site)

Abstract

:
Everyone has or will have experienced some degree of neck pain. Typically, neck pain is associated with the sensation of tense, tight, or stiff neck muscles. However, it is unclear whether the neck muscles are objectively stiffer with neck pain. This study used 1099 ultrasound elastography images (elastograms) obtained from 38 adult women, 20 with chronic neck pain and 18 asymptomatic. For training machine learning algorithms, 28 numerical characteristics were extracted from both the original and transformed shear wave velocity color-coded images as well as from respective image segments. Overall, a total number of 323 distinct features were generated from the data. A supervised binary classification was performed, using six machine-learning algorithms. The random forest algorithm produced the most accurate model to distinguish the elastograms of women with chronic neck pain from asymptomatic women with an AUC of 0.898. When evaluating features that can be used as biomarkers for muscle dysfunction in neck pain, the region of the deepest neck muscles (M. multifidus) provided the most features to support the correct classification of elastograms. By constructing summary images and associated Hotelling’s T2 maps, we enabled the visualization of group differences and their statistical confirmation.

1. Introduction

Almost everyone has or will have experienced some degree of neck pain [1]. Most neck pain is unspecific with the tendency to become recurrent [2]. The one-year prevalence of disabling neck pain has been estimated between 1.7% and 11.5% [3], and the risk is especially high in middle-aged women who have already experienced neck pain [2]. Typically, neck pain is associated with the sensation of tense, tight, or stiff neck muscles [4]. Therapeutic interventions often include measures to reduce neck muscle tone. However, it is unclear whether the neck muscles are objectively stiffer with neck pain. Some investigations documented higher stiffness of the neck muscles with neck pain [5,6], while others did not find differences compared to asymptomatic study participants [7,8]. Knowledge of a potentially increased objective stiffness of the neck muscles is important when related to diagnosis or therapeutic decisions [9].
Currently, shear-wave elastography is the superior modality for measurements of tissue stiffness. Ultrasound shear wave elastography (SWE) is more precise than muscle hardness meters [10] and enables reliable measurements of neck muscle stiffness [11,12]. However, the methods to measure muscle stiffness vary between studies. Muscle stiffness is usually not homogeneous but varies locally. Most SWE systems provide a region of interest (ROI) for the measurements that is much smaller than the visible area of the muscle. The active and individual positioning of the ROI has a biasing potential. Some systems allow for the manual circumscription of a visually determined muscle region to include a large muscle area. Often, the elastograms include some black areas in which the shear waves could not be tracked. These areas affect the measurements and must be avoided, which again reduces the included measured muscle area. It can be assumed that the inclusion of all pixels with sufficient quality over the full visible muscle area provides the most representative stiffness measurements for the respective muscle.
Although some research has been carried out on investigating neck muscle stiffness, only one study [7] has attempted to investigate muscle stiffness differences in the deep neck muscles between chronic neck pain and asymptomatic patients. Inclusion of the deep neck muscles may be of relevance because the superficially accessible trapezius muscle primarily controls scapular movement [12]. Moreover, evidence supports differences in the activation of the deep neck muscles with neck pain [13,14]. Several studies support altered and less variable activation of the deep-lying spinal muscles in chronic spinal pain [15,16,17].
New technologies have become a standard tool to address diverse research questions in broad contexts in medical diagnosis. This covers various methods, e.g., utilizing virtual reality to detect motion sickness in Parkinson’s disease patients [18], or using machine learning to reason about aging and a healthy lifestyle based on soft tissue measurements [19]. Specifically, the usage of artificial intelligence algorithms in image analysis has seen a growing occurrence in recent publications.
Machine learning algorithms have been employed, for example, for the classification of chronic liver diseases [20], breast tumors [21], thyroid nodules [22], prostate cancer [23] and knee cartilage degeneration [24]. On the other hand, deep learning convolutional neural network algorithms have been used for the segmentation of muscles as well as the automatic measurement of muscle thickness and muscle fat infiltration [25,26,27]. To the best of our knowledge, there have been no attempts to employ machine learning algorithms in ultrasound elastography images for the classification of neck pain. Moreover, such an automated approach enables a computed image analysis, which may provide new insights into differences in the mechanical properties of the neck muscles in individuals with neck pain compared to asymptomatic individuals. The study presented here extends the analysis carried out by Dieterich et al. in 2020 and proposes the application of modern image analysis and machine learning methods considering different aspects of the stiffness areas in the neck muscles to investigate the changes of objective muscle stiffness with chronic neck pain detectable in 2-D ultrasound elastography.
The primary objective of this research was to train a machine learning model that by analyzing elastography shear wave images from the neck area can objectively predict neck muscle dysfunction with high accuracy. The secondary aim was to examine the spatial disposition of the stiffness difference in the neck extensor muscles in the active and resting state between women with and without chronic neck pain.
The remainder of the paper is organized into four additional distinct sections. The second section is concerned with the materials and methodology employed for this study. In the third section, we present the results. In the fourth section, we explain our findings in more detail and compare them with the findings of other studies. The final section consists of a summary of the study’s findings.

2. Materials and Methods

The current study used ultrasound elastography images (elastograms) from a cross-sectional, observational study undertaken by Dieterich et al. in 2020 [7]. The original study has been approved by Medical Ethics Committee of the Medical Center Göttingen, Germany (No.23/10/15). Participants provided written informed consent prior to study participation, and all procedures were performed in accordance with the Declaration of Helsinki. The elastograms were obtained from 38 adult women, 20 with chronic neck pain and 18 asymptomatic. The inclusion criteria for the pain group were non-specific pain longer than six months with symptom duration of at least one week, neck stiffness sensation, and a Neck Disability Index higher than 10/50. On the other hand, for the control group, inclusion criteria included no history of recurrent neck or low back pain or neck pain that affected the neck function or required treatment. Exclusion criteria were major circulatory, neurological, or respiratory disorders, pregnancy, cervical spinal surgery, participation in neck muscle training in the past 6 months, body mass index higher than 30, and intake of medications that may affect muscle stiffness. On the days of imaging, participants were asked to not use pain medication. As shown in the demographic summary (Table 1), the sample size, age, and body mass index of the two groups are comparable. The range of motion in flexion, extension, and maximal voluntary isometric contraction is reduced in the pain group.
During the imaging sessions, participants performed 14 diverse activities including a head lift from prone, stressful office work, they balanced a weight (1 kg) on the head and graded isometric neck extension at force levels of 12 N, 24 N, 36 N, and 48 N. The activities were repeated thrice, in three different imaging sessions. Overall, this would lead to (20 + 18) × 14 × 3 = 1596 images. However, not all participants were able to perform a complete set of activities during each imaging session, and images from four (two control and two pain) participants had to be removed due to incomplete clinical data. During the quality control, eight images with a missing quality map were excluded from the analysis. The images have been recorded from 34 (16 control–18 pain) participants in a median of 14 (IQR: 13–14) images per participant in the first session, 11 (IQR: 10–11) images in the second session, and nine (IQR: 8–9) images in the third session. In total, 1099 images (512 control-587 pain), 454 from the first session, 356 images from the second session, and 289 images from the third session were useable for the image analysis.
Elastograms were obtained on the neck extensor muscle group 1 cm lateral to the spinous processes, the transducer centered at the C4 level in longitudinal orientation (Acuson S3000; Siemens, Germany, 9L4 linear transducer with 4 cm footprint). A maximal shear wave speed of 10m/s was set. Gain (14–20 dB), dynamic range (45–65 dB), and image depth 3.5–4.5 cm were adjusted for good visualization.
Each measurement produced three images, a B-mode ultrasound image; a color-coded elastogram indicating the shear wave velocity in the ROI with a range of 0.5–10 m/s; and a color-coded quality map presenting the quality of shear wave detection in each pixel ranging from low quality to high quality. The images have a resolution of 1024 × 768 and were stored in BMP file format. For the data analysis, we categorized the images according to functional differences of the performed tasks. We distinguished a low force and a high force category. The low force category consisted of images obtained while the participants were in a resting state or performing neck extension with a maximal force level of 12 N, whereas the high force category consisted of images obtained while the participants performed neck extension at levels of 36 N and 48 N. Alternatively, images were categorized according to tasks that require precise muscle activation (targeting a specific force level, balancing task) and less precise muscle activation (office work and prone neck extension).
The image analysis was carried out using the free open-source programming language Python (3.10) [28]. The packages used in the analysis are: Numpy (1.23.4) [29] and Scipy (1.9.3) [30] to extend the Python core functionally for scientific computing; Matplotlib (3.6.2) [31] for figure generation and visualization; Pandas (1.3.5) [32] for producing and the manipulation of data tables; Pillow (9.3.0) [33] and Scikit-Image (0.19.3) [34] for image processing; and Scikit-Learn (1.1.3) [35] for machine learning.
For the training of the machine learning algorithms, 28 features were handcrafted, extracting numeric information from the shear wave velocity color-coded images (Table 2). The first set of numerical features was computed by performing various statistical operations on the whole image, such as computing grand mean, median, and standard deviation. Furthermore, the images were separated into three color channels: red, green, and blue. The statistical operations and grand sum were computed for each color channel of each image. The images were converted into grayscale color mode and the same features were extracted. For the second set of features, an inverse mapping was performed from RGB color mode to stiffness using the full range shear velocity scale (Figure 1). The number of pixels in the images was quantified from three stiffness level categories: low-level stiffness category with a shear velocity between 0.5–3.67 m/s, medium-level stiffness category with a shear velocity between 3.67–6.84 m/s, and high-level stiffness category with a shear velocity between 6.84–10 m/s. The third set of features was obtained with Gabor filtering, which was used in several studies for texture classification [36,37,38]. In this study, we used the mean and variance of the resulting convoluted image as features, using Gabor kernels with four different angles: 45°, 90°, 135° and 180°. Finally, the images were divided into ten equal horizontal segments and the aforementioned operations were repeated for each of the segments leading to a total number of 308 features. To the 308 features extracted from each shear wave elastography image, 14 activities performed by the participants were added as 13 dummy variables in order to investigate the impact of different physical activities on the differences between both groups. Additionally, to investigate the technical variability, as a negative control, two dummy variables were added, representing the three imaging sessions. Overall, this gives a total of 323 features extracted from each of the 1099 images.
A supervised binary classification was performed, using six classification machine learning algorithms from the package scikit-learn [35], as follows: Random Forest (RF) [39], Decision Trees (DT) [40], Support Vector Machines (SVM) [41], K Nearest Neighbor (KNN) [42], Logistic Regression (LR) [43] and Naive Bayes (NB) [44]. The main task was a prediction of one of the two target classes: participants with neck pain and participants without neck pain. The class label was assigned to the images before the machine learning. Figure 2 depicts two pairs of elastograms used for the training of machine learning models. For model training and validation, we carried out nested 10-fold cross-validation with the whole dataset [45]. The model training and validation were repeated 10 times with new hyperparameter optimization using a grid search algorithm and using new ten random folds. Finally, with an intention to find reliable biomarkers that can be used to objectify and enhance future diagnostics, we extracted the features used for training the RF model and sorted them by importance (Figure 3).
To comprehensively explore the spatial disposition of the stiffness differences between the two groups, summary images were produced, and visual analysis was carried out (Figure 4 and Figure 5). For quality control, the quality maps generated during the imaging sessions were used and pixels with a quality lower than 50% on the quality map were excluded. The summary images were generated by calculating the arithmetic mean pixelwise on the intersection of the region of interest of all images in each group. Summary images retain the color coding of the original SWE images. Additionally, they were generated for the different categories of activities performed by the participants. Therefore, they provide a visual overview for each group and enable a visual comparison by depicting the average level of stiffness in each pixel per group. To statistically verify the difference between both groups, for RGB values of each pixel Hotelling’s T2 test, a multivariate test comparing 3-dimensional mean vectors was performed [46]. Using this method, Hotelling’s T2 maps were generated for each of the different categories, revealing the regions with significant differences. The map is a color-coded representation of p-values for each pixel, indicating three categories: p-value greater than 0.05, lower than 0.05, and lower than 0.01.

3. Results

The arithmetic mean and the confidence interval of the performance metrics, as empirical quantiles (2.5%; 97.5%), were computed from 100 cross-validation results. The RF model showed the highest mean AUC score, closely followed by KNN and SVM with the Radial Basis Function (RBF) kernel (Table 3).
A major part of the most important features extracted from the RF model are coming from the deepest horizontal segment, which corresponds to the multifidus muscle closest to the spine (Figure 3).
The summary images of both groups use the same color coding as the elastograms and enable a visual group comparison. Hotelling’s T2 test maps mark the regions of significant differences between the two groups.
The visual inspection of the summary images (Figure 4 and Figure 5) shows apparent group differences in the lower half of the images corresponding to the deeper neck muscles m. semispinalis cervicis and m. multifidus. The large regions of significant difference are visible on Hotelling’s T2 maps (Figure 4 and Figure 5) in the lower layer of the image, while the upper half of the maps represents m. semispinalis capitis, m. splenius capitis, and m. trapezius, and displays very small regions of significant difference. Summary images of the other tasks are available in the Supplementary Data.

4. Discussion

In the current study, we trained and evaluated machine learning models to support the diagnosis of muscle dysfunction in neck pain. The RF model as well as KNN and SVM showed a very good performance for distinguishing elastograms of women with chronic neck pain from asymptomatic women. As we are looking for biomarkers for muscle dysfunction in neck pain, we evaluated the 20 most important features from RF. Interestingly, the region of deepest neck muscle (M. multifidus) provided nine of these 20 features. By constructing summary images and associated Hotelling’s T2 maps, we enabled the visualization of group differences and its statistical confirmation. Each of these findings is discussed in detail in the following paragraphs.
Among the six machine learning classifiers that were assessed, the RF demonstrated the highest mean performance, reaching an AUC score of 0.898 (with sensitivity, specificity, accuracy of 78.2%, 84.0% and 81.2%, respectively). The second-best performing algorithms are KNN (accuracy 78.9%, sensitivity 79.6%, specificity 78.2% and 0.886 AUC score) and SVM with RBF kernel (accuracy 81.4%, sensitivity 80.2%, specificity 82.5%, and 0.884 AUC score). The other algorithms showed clearly lower performances. A possible explanation for these results may be the heterogeneity of the medical data, the variance introduced by the pseudo-random selection process, or overfitting due to the high number of features in the training step. In reviewing the literature, we could not find other studies that employed machine learning algorithms on SWE data to analyze neck muscle pain. The most similar approach was used in a study by Huang et al. in 2020 [22], who did a comparative analysis of five prediction models for cervical lymph node metastasis prediction. Consistent with our findings, an RF model demonstrated the highest predictive accuracy (0.889 AUC score). Another study that used an SVM model to classify SWE images of chronic liver disease reported a maximum predictive accuracy of 87.3% [20].
The most indicative features chosen from the RF algorithm are represented in Figure 3. What stands out is that many of the features are extracted from the lowest horizontal segment, which corresponds to the multifidus muscle. This suggests that a considerable amount of information for the distinction of both groups is located in the tissue of the deep neck muscles close to the spine. Contrary to our expectations, the type of activity that participants performed did not play an important role in the classification.
The secondary aim of this study was to examine the spatial disposition of the stiffness differences in the neck extensor muscles in the active and resting state. For this purpose, we generated summary images and Hotelling’s T2 maps for each activity performed during the imaging sessions. The summary images and Hotelling’s T2 maps (Figure 4 and Figure 5) indicate the presence of differences in stiffness in the neck muscle tissue between groups, which are more pronounced with active muscle recruitment. Figure 5 demonstrates the positive correlation between force intensity, stiffness levels, and difference between groups. Generally, higher muscle stiffness is correlated with higher muscle activation [47,48,49]. Other neck pain studies reported reduced activation of the deep neck muscles in the pain group [13,14], which was also observed in our summary images of the neck muscles with graded extension (Figure 5). On the contrary, this was not observed in the summary images of neck extension in prone (Figure 4). Reduced deep muscle activity might be a result of an adaptive mechanism to avoid or reduce pain sensation. It can be assumed that pain is higher with higher muscle activity, so the adaptive mechanism might be involuntary pain management. The observation of the resting and active states in prone summary images indicates higher muscle stiffness during the resting state in the pain group (Figure 4). This might be a result of a reduced ability to relax the deep neck muscles, which again might indicate a protective strategy [50]. Up to now, there has been little research that directly assesses passive stiffness in neck muscles [9]. The higher passive stiffness in the pain group suggests that chronic pain in the neck muscles influences the passive level of stiffness. The summary images of graded neck extension display the effect of the force intensity on the spatial disposition and stiffness intensity (Figure 5). The stiffness builds up, originating from the deep neck muscles, corresponding to the lower portion of semispinalis cervicis and m. multifidus. This finding was also reported by Yavuz et al. in 2015 [49]. With the increase of force, the stiffness of the superficial muscles increases, and this increase in stiffness is pronounced in the control group. A possible interpretation might be that asymptomatic participants could use the neck muscles more actively and reach higher muscle activation levels with higher force, while the participants with neck pain involuntarily limited themselves to control and minimize the painful sensations. On the other hand, the compensation or adaptation to force may lead to different patterns of activation between groups.
With regard to the research method, some limitations should be acknowledged. The data presented here was obtained from a rather small number of individuals. An absence of a feature selection step in some of the models might have led to over-fitting or may have introduced bias in the training step, resulting in a limited performance. An additional weakness is that the machine learning models were not validated with external data, which limits the generalizability of the study results.
Further work is required to confirm and validate our findings using independent data from more comprehensive cohorts including diverse genders, consisting of measurements recorded by different persons using different machines. Moreover, additional research is needed to better understand the discrepancies between both groups and to ascertain the possibility of adopting machine learning algorithms as a robust and reliable diagnostic tool for neck pain diagnosis.

5. Conclusions

This study highlights the potential usefulness of machine learning methods, especially the RF, KNN and SVM models, for diagnosing neck pain by analyzing SWE images. Moreover, the feature importance extracted from the RF model indicate that the deeper neck muscles contain important information for the classification. The summary images and Hotelling’s T2 maps illustrate the difference of muscle stiffness in passive state, which becomes more pronounced during active muscle recruitment. Neck pain is a complex condition and until now the diagnosis has been subjective, dependent on the participant’s personal sensation of pain and the diagnostic experience of the diagnostician. The feasibility to train a classification machine learning algorithm with very good accuracy provides initial tentative evidence of objective alterations in the neck muscles of patients with chronic neck pain that can be captured with shear wave elastography. Such a machine learning algorithm that can classify with an accuracy of or better than a trained diagnostician will improve medical diagnosis and therapy.

Supplementary Materials

Author Contributions

F.P. wrote the manuscript and carried out the analysis. J.S. aided in conducting the analysis and interpreting the results. A.D. provided the sample datasets and supervised the project. M.K. contributed to the interpretation of the results and supervised the project. A.D, and M.K. revised the manuscript and contributed to the editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by an institutional grant (IAF Tandem 2019) of the Institute of Applied Research of Furtwangen University. We acknowledge support for the Article Processing Charge from the DFG.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The complete Python code developed in this project is open-source and is available on GitHub (https://github.com/fpaskali/neck-swe-classification (accessed on 9 December 2022)) under L-GPL-3 license. The data that support the findings of this study are available from AD, upon reasonable request.

Acknowledgments

We would like to thank Weronika Schary for her constructive comments and support in editing the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Haldeman, S.; Carroll, L.J.; Cassidy, J.D. The Empowerment of People with Neck Pain: Introduction. The Bone and Joint Decade 2000-2010 Task Force on Neck Pain and Its Associated Disorders. J. Manip. Physiol. Ther. 2009, 32, S10–S16. [Google Scholar] [CrossRef] [PubMed]
  2. Blanpied, P.R.; Gross, A.R.; Elliott, J.M.; Devaney, L.L.; Clewley, D.; Walton, D.M.; Sparks, C.; Robertson, E.K. Neck Pain: Revision 2017. J. Orthop. Sports Phys. Ther. 2017, 47, A1–A83. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Hogg-Johnson, S.; van der Velde, G.; Carroll, L.J.; Holm, L.W.; Cassidy, J.D.; Guzman, J.; Côté, P.; Haldeman, S.; Ammendolia, C.; Carragee, E.; et al. The Burden and Determinants of Neck Pain in the General Population: Results of the Bone and Joint Decade 2000-2010 Task Force on Neck Pain and Its Associated Disorders. Spine 2008, 33, S39–S51. [Google Scholar] [CrossRef] [PubMed]
  4. MacDermid, J.C.; Walton, D.M.; Bobos, P.; Lomotan, M.; Carlesso, L. A Qualitative Description of Chronic Neck Pain Has Implications for Outcome Assessment and Classification. Open Orthop. J. 2016, 10, 746–756. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Taş, S.; Korkusuz, F.; Erden, Z. Neck Muscle Stiffness in Participants With and Without Chronic Neck Pain: A Shear-Wave Elastography Study. J. Manip. Physiol. Ther. 2018, 41, 580–588. [Google Scholar] [CrossRef] [PubMed]
  6. Hvedstrup, J.; Kolding, L.T.; Ashina, M.; Schytz, H.W. Increased Neck Muscle Stiffness in Migraine Patients with Ictal Neck Pain: A Shear Wave Elastography Study. Cephalalgia 2020, 40, 565–574. [Google Scholar] [CrossRef]
  7. Dieterich, A.V.; Yavuz, U.Ş.; Petzke, F.; Nordez, A.; Falla, D. Neck Muscle Stiffness Measured With Shear Wave Elastography in Women With Chronic Nonspecific Neck Pain. J. Orthop. Sports Phys. Ther. 2020, 50, 179–188. [Google Scholar] [CrossRef]
  8. Ishikawa, H.; Muraki, T.; Morise, S.; Sekiguchi, Y.; Yamamoto, N.; Itoi, E.; Izumi, S.-I. Changes in Stiffness of the Dorsal Scapular Muscles before and after Computer Work: A Comparison between Individuals with and without Neck and Shoulder Complaints. Eur. J. Appl. Physiol. 2017, 117, 179–187. [Google Scholar] [CrossRef]
  9. Aljinović, J.; Barišić, I.; Poljičanin, A.; Kuzmičić, S.; Vukojević, K.; Gugić Bokun, D.; Vlak, T. Can Measuring Passive Neck Muscle Stiffness in Whiplash Injury Patients Help Detect False Whiplash Claims? Wien. Klin. Wochenschr. 2020, 132, 506–514. [Google Scholar] [CrossRef]
  10. Akagi, R.; Kusama, S. Comparison Between Neck and Shoulder Stiffness Determined by Shear Wave Ultrasound Elastography and a Muscle Hardness Meter. Ultrasound Med. Biol. 2015, 41, 2266–2271. [Google Scholar] [CrossRef]
  11. Xie, Y.; Thomas, L.; Hug, F.; Johnston, V.; Coombes, B.K. Quantifying Cervical and Axioscapular Muscle Stiffness Using Shear Wave Elastography. J. Electromyogr. Kinesiol. Off. J. Int. Soc. Electrophysiol. Kinesiol. 2019, 48, 94–102. [Google Scholar] [CrossRef] [PubMed]
  12. Dieterich, A.V.; Andrade, R.J.; Le Sant, G.; Falla, D.; Petzke, F.; Hug, F.; Nordez, A. Shear Wave Elastography Reveals Different Degrees of Passive and Active Stiffness of the Neck Extensor Muscles. Eur. J. Appl. Physiol. 2017, 117, 171–178. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Schomacher, J.; Falla, D. Function and Structure of the Deep Cervical Extensor Muscles in Patients with Neck Pain. Man. Ther. 2013, 18, 360–366. [Google Scholar] [CrossRef] [PubMed]
  14. O’Leary, S.; Falla, D.; Jull, G. The Relationship between Superficial Muscle Activity during the Cranio-Cervical Flexion Test and Clinical Features in Patients with Chronic Neck Pain. Man. Ther. 2011, 16, 452–455. [Google Scholar] [CrossRef]
  15. Schomacher, J.; Falla, D. Reduced Specificity of Semispinalis Cervicis Activity in Women with Persistent Neck Pain. Eur. J. Pain Suppl. 2011, 5, 41. [Google Scholar] [CrossRef]
  16. Falla, D.; Jull, G.; Hodges, P.W. Feedforward Activity of the Cervical Flexor Muscles during Voluntary Arm Movements Is Delayed in Chronic Neck Pain. Exp. Brain Res. 2004, 157, 43–48. [Google Scholar] [CrossRef]
  17. Wallwork, T.L.; Stanton, W.R.; Freke, M.; Hides, J.A. The Effect of Chronic Low Back Pain on Size and Contraction of the Lumbar Multifidus Muscle. Man. Ther. 2009, 14, 496–500. [Google Scholar] [CrossRef]
  18. Petel, A.; Jacob, D.; Aubonnet, R.; Frismand, S.; Petersen, H.; Gargiulo, P.; Perrin, P. Motion Sickness Susceptibility and Visually Induced Motion Sickness as Diagnostic Signs in Parkinson’s Disease. Eur. J. Transl. Myol. 2022, 32, 4. [Google Scholar] [CrossRef]
  19. Recenti, M.; Ricciardi, C.; Edmunds, K.; Jacob, D.; Gambacorta, M.; Gargiulo, P. Testing Soft Tissue Radiodensity Parameters Interplay with Age and Self-Reported Physical Activity. Eur. J. Transl. Myol. 2021, 31, 9929. [Google Scholar] [CrossRef]
  20. Gatos, I.; Tsantis, S.; Spiliopoulos, S.; Karnabatidis, D.; Theotokas, I.; Zoumpoulis, P.; Loupas, T.; Hazle, J.D.; Kagadis, G.C. A Machine-Learning Algorithm Toward Color Analysis for Chronic Liver Disease Classification, Employing Ultrasound Shear Wave Elastography. Ultrasound Med. Biol. 2017, 43, 1797–1810. [Google Scholar] [CrossRef]
  21. Mao, Y.-J.; Lim, H.-J.; Ni, M.; Yan, W.-H.; Wong, D.W.-C.; Cheung, J.C.-W. Breast Tumour Classification Using Ultrasound Elastography with Machine Learning: A Systematic Scoping Review. Cancers 2022, 14, 367. [Google Scholar] [CrossRef] [PubMed]
  22. Zhang, B.; Tian, J.; Pei, S.; Chen, Y.; He, X.; Dong, Y.; Zhang, L.; Mo, X.; Huang, W.; Cong, S.; et al. Machine Learning–Assisted System for Thyroid Nodule Diagnosis. Thyroid 2019, 29, 858–867. [Google Scholar] [CrossRef] [PubMed]
  23. Wildeboer, R.R.; Mannaerts, C.K.; van Sloun, R.J.G.; Budäus, L.; Tilki, D.; Wijkstra, H.; Salomon, G.; Mischi, M. Automated Multiparametric Localization of Prostate Cancer Based on B-Mode, Shear-Wave Elastography, and Contrast-Enhanced Ultrasound Radiomics. Eur. Radiol. 2020, 30, 806–815. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ciliberti, F.K.; Cesarelli, G.; Guerrini, L.; Gunnarsson, A.E.; Forni, R.; Aubonnet, R.; Recenti, M.; Jacob, D.; Jónsson, H.; Cangiano, V.; et al. The Role of Bone Mineral Density and Cartilage Volume to Predict Knee Cartilage Degeneration. Eur. J. Transl. Myol. 2022, 32, 10678. [Google Scholar] [CrossRef]
  25. Orhan, K.; Yazici, G.; Kolsuz, M.E.; Kafa, N.; Bayrakdar, I.S.; Çelik, Ö. An Artificial Intelligence Hypothetical Approach for Masseter Muscle Segmentation on Ultrasonography in Patients With Bruxism. J. Adv. Oral Res. 2021, 12, 206–213. [Google Scholar] [CrossRef]
  26. Weber, K.A.; Abbott, R.; Bojilov, V.; Smith, A.C.; Wasielewski, M.; Hastie, T.J.; Parrish, T.B.; Mackey, S.; Elliott, J.M. Multi-Muscle Deep Learning Segmentation to Automate the Quantification of Muscle Fat Infiltration in Cervical Spine Conditions. Sci. Rep. 2021, 11, 16567. [Google Scholar] [CrossRef]
  27. Loram, I.; Siddique, A.; Sanchez, M.B.; Harding, P.; Silverdale, M.; Kobylecki, C.; Cunningham, R. Objective Analysis of Neck Muscle Boundaries for Cervical Dystonia Using Ultrasound Imaging and Deep Learning. IEEE J. Biomed. Health Inform. 2020, 24, 1016–1027. [Google Scholar] [CrossRef]
  28. Van Rossum, G.; Drake, F.L. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009; ISBN 1-4414-1269-7. [Google Scholar]
  29. Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array Programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef]
  30. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [Green Version]
  31. Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
  32. McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010; Volume 445, pp. 51–56. [Google Scholar]
  33. Clark, A. Pillow (PIL Fork) Documentation 2015. Available online: https://pillow.readthedocs.io/en/stable/ (accessed on 15 November 2022).
  34. van der Walt, S.; Schönberger, J.L.; Nunez-Iglesias, J.; Boulogne, F.; Warner, J.D.; Yager, N.; Gouillart, E.; Yu, T. Scikit-Image: Image Processing in Python. PeerJ 2014, 2, e453. [Google Scholar] [CrossRef] [PubMed]
  35. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  36. Palm, C.; Lehmann, T.M. Classification of Color Textures by Gabor Filtering. Mach. Graph. Vis. Int. J. 2002, 11, 26. [Google Scholar]
  37. Bianconi, F.; Fernández, A. Evaluation of the Effects of Gabor Filter Parameters on Texture Classification. Pattern Recognit. 2007, 40, 3325–3335. [Google Scholar] [CrossRef] [Green Version]
  38. Idrissa, M.; Acheroy, M. Texture Classification Using Gabor Filters. Pattern Recognit. Lett. 2002, 23, 1095–1102. [Google Scholar] [CrossRef]
  39. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  40. Breiman, L.; Friedman, J.; Olshen, R.; Stone, C.J. Classification and Regression Trees; Wadsworth: Belmont, CA, USA, 1984. [Google Scholar]
  41. Chang, C.-C.; Lin, C.-J. LIBSVM: A Library for Support Vector Machines. ACM Trans. Intell. Syst. Technol. 2011, 2, 27:1–27:27. [Google Scholar] [CrossRef]
  42. Fix, E.; Hodges, J.L. Discriminatory Analysis. Nonparametric Discrimination: Consistency Properties. Int. Stat. Rev. Rev. Int. Stat. 1989, 57, 238–247. [Google Scholar] [CrossRef]
  43. LIBLINEAR—A Library for Large Linear Classification. Available online: https://www.csie.ntu.edu.tw/~cjlin/liblinear/ (accessed on 18 November 2021).
  44. The Optimality of Naive Bayes. Available online: https://www.aaai.org/Library/FLAIRS/2004/flairs04-097.php (accessed on 18 November 2021).
  45. Cawley, G.C.; Talbot, N.L.C. On Over-Fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation. J. Mach. Learn. Res. 2010, 11, 2079–2107. [Google Scholar]
  46. The Generalization of Student’s Ratio|SpringerLink. Available online: https://link.springer.com/chapter/10.1007/978-1-4612-0919-5_4 (accessed on 5 December 2022).
  47. Kozinc, Ž.; Šarabon, N. Shear-Wave Elastography for Assessment of Trapezius Muscle Stiffness: Reliability and Association with Low-Level Muscle Activity. PLoS ONE 2020, 15, e0234359. [Google Scholar] [CrossRef]
  48. Bouillard, K.; Nordez, A.; Hug, F. Estimation of Individual Muscle Force Using Elastography. PLoS ONE 2011, 6, e29261. [Google Scholar] [CrossRef] [PubMed]
  49. Yavuz, A.; Bora, A.; Bulut, M.D.; Batur, A.; Milanlioglu, A.; Göya, C.; Andic, C. Acoustic Radiation Force Impulse (ARFI) Elastography Quantification of Muscle Stiffness over a Course of Gradual Isometric Contractions: A Preliminary Study. Med. Ultrason. 2015, 17, 49–57. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. van der Hulst, M.; Vollenbroek-Hutten, M.M.; Rietman, J.S.; Schaake, L.; Groothuis-Oudshoorn, K.G.; Hermens, H.J. Back Muscle Activation Patterns in Chronic Low Back Pain during Walking: A “Guarding” Hypothesis. Clin. J. Pain 2010, 26, 30–37. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Color-coded elastography shear wave image with muscle layer organization.
Figure 1. Color-coded elastography shear wave image with muscle layer organization.
Diagnostics 13 00007 g001
Figure 2. A sample of shear wave elastography images from the control and pain group, recorded during neck extension while lying prone; (a) The difference between classes is visually detectable, with increased stiffness in the bottom part of the image from the pain group; (b) The assigning to both classes by visual inspection is challenging.
Figure 2. A sample of shear wave elastography images from the control and pain group, recorded during neck extension while lying prone; (a) The difference between classes is visually detectable, with increased stiffness in the bottom part of the image from the pain group; (b) The assigning to both classes by visual inspection is challenging.
Diagnostics 13 00007 g002
Figure 3. Top 20 most important features extracted from trained Random Forest model, sorted by Gini importance. The names of the features are explained in more detail in the supplement Table S1.
Figure 3. Top 20 most important features extracted from trained Random Forest model, sorted by Gini importance. The names of the features are explained in more detail in the supplement Table S1.
Diagnostics 13 00007 g003
Figure 4. Summary mean images of the elastograms’ RGB values comparison between resting and active state of prone neck extension; Hotelling’s T2 test map—green color p-value < 0.05—yellow color p-value < 0.01.
Figure 4. Summary mean images of the elastograms’ RGB values comparison between resting and active state of prone neck extension; Hotelling’s T2 test map—green color p-value < 0.05—yellow color p-value < 0.01.
Diagnostics 13 00007 g004
Figure 5. Summary mean images of the elastograms’ RGB values of the neck muscles with graded neck extension; Hotelling’s T2 test map—green color p-value < 0.05—yellow color p-value < 0.01. In participants with neck pain, we note higher stiffness in the upper half of the image (superficial muscles) during the resting state in sitting, and less stiffness in the lower half of the images (deep muscles) during active neck extension.
Figure 5. Summary mean images of the elastograms’ RGB values of the neck muscles with graded neck extension; Hotelling’s T2 test map—green color p-value < 0.05—yellow color p-value < 0.01. In participants with neck pain, we note higher stiffness in the upper half of the image (superficial muscles) during the resting state in sitting, and less stiffness in the lower half of the images (deep muscles) during active neck extension.
Diagnostics 13 00007 g005
Table 1. Demographic summary: mean ± SD, median (IQR). Abbreviations: MVIC, maximal voluntary isometric contraction; NRS, numerical rating scale; SD, standard deviation; IQR, interquartile range.
Table 1. Demographic summary: mean ± SD, median (IQR). Abbreviations: MVIC, maximal voluntary isometric contraction; NRS, numerical rating scale; SD, standard deviation; IQR, interquartile range.
Neck Pain, n = 20Control, n = 18
Age (years)52.5 (12.0)48.5 (9.0)
Body Mass Index (kg/m2)23.8 ± 3.222.2 ± 2.4
Range of Motion Neck Flexion50.6° ± 12.4°69.4° ± 10.8°
Range of Motion Neck Rotation
(left + right)
115.5° ± 14.3°145.5° ± 26.6°
MVIC neck extension (N)56.3 ± 18.564.0 ± 19.2
Pain Today (NRS)3.6 ± 2.20
Pain Duration (years)8.0 (16.3)-
Neck Disability Index %
(scored 0%–100%)
32.5% ± 12.3%-
Table 2. The description of the features used in the training of machine learning models.
Table 2. The description of the features used in the training of machine learning models.
Feature NameDescription
meanGrand mean of all pixels from all color channels
medianGrand median of all pixels from all color channels
sdGrand standard deviation of all pixels from all color channels
red_layer_value_meanGrand mean of the intensities in the red image layer
red_layer_value_medianGrand median of the intensities in the red image layer
red_layer_value_sumGrand sum of the intensities in the red image layer
blue_layer_value_meanGrand mean of the intensities in the blue image layer
blue_layer_value_medianGrand median of the intensities in the blue image layer
blue_layer_value_sumGrand sum of the intensities in the blue image layer
green_layer_value_meanGrand mean of the intensities in the green image layer
green_layer_value_medianGrand median of the intensities in the green image layer
green_layer_value_sumGrand sum of the intensities in the green image layer
gray_valueGrand sum of the grayscale intensities
gray_value_meanGrand mean of the grayscale intensities
gray_value_medianGrand median of the grayscale intensities
gray_value_minMinimum intensity in the grayscale image
gray_value_maxMaximum intensity in the grayscale image
blue_pixelsNumber of low-level stiffness in the elastography image
green_pixelsNumber of medium-level stiffness in the elastography image
red_pixelsNumber of high-level stiffness in the elastography image
gabor_kernel1_meanGrand mean of the convoluted image with 45° Gabor kernel
gabor_kernel1_varGrand variance of the convoluted image with 45° Gabor kernel
gabor_kernel2_meanGrand mean of the convoluted image with 90° Gabor kernel
gabor_kernel2_varGrand variance of the convoluted image with 90° Gabor kernel
gabor_kernel3_meanGrand mean of the convoluted image with 135° Gabor kernel
gabor_kernel3_varGrand variance of the convoluted image with 135° Gabor kernel
gabor_kernel4_meanGrand mean of the convoluted image with 180° Gabor kernel
gabor_kernel4_varGrand variance of the convoluted image with 180° Gabor kernel
[feature_name]_hsegment_[n]/10Naming of the features extracted from the horizontal segments; n—the horizontal segment number (1—close to the surface, 10—close to the spine)
Table 3. Summary of the cross-validation during training of the classification machine learning algorithms (mean, confidence interval 95%) Abbreviations: KNN—K-Nearest Neighbor; SVM—Support Vector Machine; AUC—Area Under Curve.
Table 3. Summary of the cross-validation during training of the classification machine learning algorithms (mean, confidence interval 95%) Abbreviations: KNN—K-Nearest Neighbor; SVM—Support Vector Machine; AUC—Area Under Curve.
MetricsKNNLogistic RegressionNaïve BayesSVMDecision TreesRandom Forest
Accuracy0. 789
(0.721–0.863)
0.696
(0.609–0.773)
0.621
(0.540–0.690)
0.814
(0.755–0.877)
0.708
(0.618–0.795)
0.812
(0.755–0.873)
Balanced Accuracy *0.790
(0.719–0.863)
0.695
(0.607–0.767)
0.620
(0.537–0.689)
0.814
(0.754–0.876)
0.707
(0.619–0.798)
0.811
(0.750–0.872)
Sensitivity0.798
(0.701–0.890)
0.671
(0.526–0.818)
0.600
(0.455–0.736)
0.802
(0.708–0.889)
0.689
(0.543–0.833)
0.782
(0.686–0.872)
Specificity0.782
(0.687–0.868)
0.720
(0.612–0.824)
0.640
(0.527–0.741)
0.825
(0.744–0.907)
0.726
(0.568–0.816)
0.840
(0.770–0.924)
AUC Score0.886
(0.834–0.929)
0.771
(0.682–0.849)
0.682
(0.588–0.755)
0.884
(0.836–0.937)
0.707
(0.619–0.798)
0.898
(0.842–0.946)
Brier Score0.144
(0.112–0.174)
0.206
(0.161–0.255)
0.364
(0.297–0.451)
0.137
(0.107–0.169)
0.292
(0.382–0.205)
0.149
(0.128–0.166)
* arithmetic mean of sensitivity and specificity; The best results are in bold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Paskali, F.; Simantzik, J.; Dieterich, A.; Kohl, M. Specification of Neck Muscle Dysfunction through Digital Image Analysis Using Machine Learning. Diagnostics 2023, 13, 7. https://doi.org/10.3390/diagnostics13010007

AMA Style

Paskali F, Simantzik J, Dieterich A, Kohl M. Specification of Neck Muscle Dysfunction through Digital Image Analysis Using Machine Learning. Diagnostics. 2023; 13(1):7. https://doi.org/10.3390/diagnostics13010007

Chicago/Turabian Style

Paskali, Filip, Jonathan Simantzik, Angela Dieterich, and Matthias Kohl. 2023. "Specification of Neck Muscle Dysfunction through Digital Image Analysis Using Machine Learning" Diagnostics 13, no. 1: 7. https://doi.org/10.3390/diagnostics13010007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop