Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods

Pantic, Igor; Valjarevic, Svetlana; Cumic, Jelena; Paunkovic, Ivana; Terzic, Tatjana; Corridon, Peter R.

doi:10.3390/fractalfract7030272

Open AccessArticle

Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods

by

Igor Pantic

^1,2,3,*

,

Svetlana Valjarevic

⁴

,

Jelena Cumic

⁵,

Ivana Paunkovic

⁶

,

Tatjana Terzic

⁷ and

Peter R. Corridon

^8,9,10

¹

Department of Medical Physiology, Faculty of Medicine, University of Belgrade, Višegradska 26/2, RS-11129 Belgrade, Serbia

²

University of Haifa, 199 Abba Hushi Blvd, Mount Carmel, Haifa IL-3498838, Israel

³

Department of Pharmacology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi P.O. Box 127788, United Arab Emirates

⁴

Clinical Hospital Center “Zemun”, Faculty of Medicine, University of Belgrade, Vukova 9, RS-11080 Belgrade, Serbia

⁵

University Clinical Centre of Serbia, Faculty of Medicine, University of Belgrade, Dr. Koste Todorovića 8, RS-11129, Belgrade, Serbia

⁶

Department of Histology and Embryology, Faculty of Medicine, University of Belgrade, Višegradska 26/2, RS-11129 Belgrade, Serbia

⁷

Department of Pathology, Faculty of Medicine, University of Belgrade, Dr. Subotića 1, RS-11129 Belgrade, Serbia

⁸

Department of Immunology and Physiology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi P.O. Box 127788, United Arab Emirates

⁹

Biomedical Engineering, Healthcare Engineering Innovation Center, Khalifa University of Science and Technology, Abu Dhabi P.O. Box 127788, United Arab Emirates

¹⁰

Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi P.O. Box 127788, United Arab Emirates

^*

Author to whom correspondence should be addressed.

Fractal Fract. 2023, 7(3), 272; https://doi.org/10.3390/fractalfract7030272

Submission received: 30 January 2023 / Revised: 14 March 2023 / Accepted: 16 March 2023 / Published: 20 March 2023

Download

Browse Figures

Versions Notes

Abstract

:

In this work, we demonstrate that it is possible to create supervised machine-learning models using a support vector machine and random forest algorithms to separate yeast cells exposed to hyperosmotic stress from intact cells. We performed fractal, gray level co-occurrence matrix (GLCM), and discrete wavelet transform analyses on digital micrographs of nuclear regions of interest of a total of 2000 Saccharomyces cerevisiae cells: 1000 exposed to hyperosmotic environments and 1000 control cells. For each nucleus, we calculated values for fractal dimension, angular second moment, inverse difference moment, textural contrast, correlation feature, textural variance, and discrete wavelet coefficient energy. The support vector machine achieved an acceptable classification accuracy of 71.7% in predicting whether the cell belonged to the experimental or control group. The random forest model performed better than the support vector machine, with a classification accuracy of 79.8%. These findings can serve as a starting point for developing AI-based methods that use GLCM, fractal, and wavelet data to classify damaged and healthy cells and make predictions about various physiological and pathological phenomena associated with osmotic stress.

Keywords:

cell; signal analysis; nucleus; stress; supervised learning

1. Introduction

In recent years, many new machine learning (ML) algorithms have been developed that can analyze and model data obtained from two-dimensional signals [1,2,3,4]. Many of these algorithms are based on supervised machine learning (SML) techniques such as binomial logistic regression, decision trees, support vector machines, and artificial neural networks [5,6,7]. These techniques and the resulting sensing systems have the potential to greatly improve image recognition and classification in various biological and medical fields. The use of SML for analyzing microscopic data in histology, pathology, and cell biology is a relatively new area of research, but it holds great promise for being integrated into future diagnostic protocols and procedures. Additionally, AI can automate and speed up decision-making in these fields by quickly processing large amounts of data. Furthermore, some supervised machine learning models may be able to detect patterns in microscopic data related to tissue and cell structure that are not visible to even the most experienced professionals [8,9,10,11,12].

There are many ways to train and test a machine learning model applicable to microscopy. RGB (red, green, blue) pixel intensities obtained from micrographs in JPG and BMP formats can sometimes be used as input data for AI training, such as in convolutional neural networks [13,14,15]. Another way is to perform a two-dimensional signal analysis and obtain a set of quantifications that are used as inputs. Recently different forms of texture analysis were suggested as an objective and efficient way to generate these quantifications that can be used for machine learning. An example would be gray-level co-occurrence matrix (GLCM) analysis, where pairs of resolution units with the assigned gray-level intensity quantifications are analyzed using second-order statistics [1,16,17,18]. This way, several important textural features can be calculated, such as angular second moment as an indicator of textural uniformity or inverse difference moment as an indicator of local homogeneity. Approaches based on GLCM have been used on numerous occasions to detect subtle morphological alterations in cells and tissues in various experimental settings, both in vivo and in vitro. It was suggested that GLCM as a method may detect changes in nuclear chromatin following the induction of cell damage. Sometimes, discrete wavelet transform (DWT), as a form of mathematical image analysis, is used as an addition to GLCM to provide useful insight into the changes in GLCM features [1,18]. Both GLCM and DWT indicators can be utilized to develop a machine learning model for the classification of microscopic phenomena or for predicting pathological processes.

Fractal analysis of microscopic data is also a way to obtain quantifications for machine learning that can later be used for classification or prediction. Fractal analysis enables us to indirectly measure the complexity of a signal, typically by determining the fractal dimension value. This technique is often used in microscopy in binarized or grayscale images of biological structures and can sometimes be useful in evaluating different regions of interest (ROIs) in micrographs representing parts of tissue or cellular components. Fractal analysis of nuclear structure was shown to be a potentially useful predictor of some pathological processes, with fractal dimension also being a potentially valuable prognostic indicator for the outcome of some diseases [19,20,21].

Osmotic stress is a potentially important contributing factor to the development of many diseases, particularly the ones associated with inflammation [22]. Protective biochemical responses and signaling pathways that are activated as the result of osmotic stress, although not entirely understood, are present in almost all organisms. Mild stress in some cells usually does not lead to substantial and microscopically-visible morphological changes; however, this does not necessarily imply that the cell remains structurally and functionally intact [23,24,25]. So far, to the best of our knowledge, fractal, GLCM, and wavelet analyses have not been used for the development of a computer sensing system capable of evaluating morphological changes associated with osmotic stress. Recently, machine learning models have been developed on several occasions for the prediction and classification of various physiological and pathological phenomena associated with osmotic stress. Some examples would be the prediction of ethanol yields in yeast fermentation cultures during high-sugar osmotic stress [26] or the classification of electrophysiological responses resulting from such stress in plants [27]. However, none of these or similar machine learning methods used fractal, GLCM, and wavelet data as inputs.

Our research presented in this article indicates that exposure of yeast cells to hyperosmotic stress causes notable alterations in the GLCM, fractal, and wavelet parameters of their nuclear structure. We also demonstrate that it is possible to use these indicators as input data for machine learning models, such as the ones based on random forest and support vector machine algorithms, to identify cells that have been exposed to hyperosmotic stress with acceptable accuracy. These findings provide a foundation for initiating the development of AI-based techniques that leverage GLCM, fractal, and wavelet data to distinguish between damaged and healthy cells and to forecast diverse physiological and pathological phenomena linked with osmotic stress.

2. Materials and Methods

Saccharomyces cerevisiae yeast cells similar to the ones described previously [8], previously purchased from commercially available sources, were kept in Yeast Extract Peptone–Dextrose (YPD) broth in an orbital shaker at 25 °C and pH 6.5 ± 0.2 with agitation at 200 rpm. The cell samples for the experiments were later transferred to special tissue chamber/slides, as mentioned in our previous publication [8]. The genetic information of Saccharomyces cerevisiae—a widely studied microorganism in molecular genetics and cell biology—can be found at The European Nucleotide Archive. The cells were exposed to a hyperosmotic environment by adding NaCl to reach 0.8 M concentration for 2 h, after which normal tonicity was swiftly restored. We created digital micrographs of the treated and control cells in JPG format using a TCA1000-C instrument equipped with an Aptina MT9J003 CMOS sensor mounted on OPTIC900TH Trinocular Biological Microscope (COLO LabExperts, Novo Mesto, Slovenia). The size of the micrographs was set to 3584 (width) × 2748 (height) resolution units, and the bit depth equaled 24. A similar approach was applied in our previously published work [8], although, in our study, we modified the values of color temperature, saturation, hue and other parameters in order to make the micrographs even more suitable for GLCM analysis. The micrographs were converted to an 8-bit grayscale format for the calculation of GLCM parameters.

For GLCM evaluation, we used our modification of plugins previously developed by Julio E. Cabrera and Toby C. Cornish for the ImageJ software (National Institutes of Health, Bethesda, MD, USA, version 1.53e based on 64-bit Java 1.8.0_172). We analyzed a total of 2000 circular nuclear regions of interest (ROI): 1000 ROIs of the cells exposed to a hyperosmotic environment and 1000 ROIs from control cells (Figure 1). As in our previous work, for each ROI, the values of 5 GLCM indicators were determined: angular second moment (ASM), inverse difference moment (IDM), contrast (CON), correlation (COR), and textural variance (VAR). The standard GLCM method is performed on gray-scale images, where each pixel is given a value based on its gray intensity. After that, value pairs are analyzed using second-order statistics, and GLCM features are calculated.

Considering that p(i,j) is the (i,j)th entry of the normalized co-occurrence matrix, the value of the inverse difference moment as the measure of local homogeneity was calculated as follows:

IDM = \sum_{i} \sum_{j} \frac{1}{{1 + (i - j)}^{2}} p (i, j)

Angular second moment describing the uniformity (orderliness) within the distribution of gray levels was determined as follows:

ASM = \sum_{i} \sum_{j} {\{p (i, j)\}}^{2}

Considering that μ and σ are the mean and the standard deviation, respectively, of rows x and y, within the normalized GLCM, the contrast and correlation features were determined as follows:

CO N = \sum_{i} \sum_{j} {(i - j)}^{k} P_{d} {[i, j]}^{n}

C O R = \frac{\sum_{i} \sum_{j} (i j) p (i, j) - μ_{x} μ_{y}}{σ_{x} σ_{y}}

The contrast in these terms essentially relates to the degree of variation of gray level intensities in the two-dimensional signal, whereas the correlation represents the linear dependencies of gray levels on the other levels of the neighboring resolution units [8,28]. The level of dispersion of the gray level intensity distribution, when considering the value of the GLCM mean, was quantified as variance:

VAR = \sum_{i = 1}^{N_{g}} \sum_{j = 1}^{N_{g}} {(i - μ)}^{2} p (i, j)

All the quantifications were analyzed as a part of a data frame (a two-dimensional data structure) in the Python Data Analysis Library (pandas)—an open-source platform for data analysis and manipulation.

Discrete wavelet transform (DWT) analysis of nuclear ROIs was performed in “Mazda” software previously prepared for the COST B21 European project “Physiological modelling of MR Image formation” and COST B11 AQ6 European project “Quantitative Analysis of Magnetic Resonance Image Texture”. The platform was created by Dr. Michal Strzelecki and Dr. Piotr Szczypinski (Institute of Electronics, Technical University of Lodz, Poland) and can perform a variety of tasks related to texture analysis [29,30,31,32]. For the purpose of our study, we calculated wavelet coefficient energy during a filtering cascade of rows and columns of data using high-pass filtering (Figure 2). Briefly, the linear transformation was performed on data vectors which had the length of an integer power of two. The vectors were transformed to the same length but numerically different vectors, after which the data was separated into various frequency components depending on the scale. Factor 2 subsampling was performed after a cascade of filterings was implemented on the data. We used different combinations of low-pass (L) and high-pass filters (H). For the details on the procedure, the reader is referred to the previously published works on the method [29,33].

The energy (En) was calculated as follows:

E n = \frac{\sum_{x, y \in R O I} {(d_{x, y}^{s u b b a n d})}^{2}}{n}

where subband locations are marked as x and y, and n represents the number of ROI pixels.

Fractal analysis was carried out in FracLac, V. 2.5—a platform designed for ImageJ software by A. Karperien, Charles Sturt University, Australia/Canada—previously used on numerous occasions for description and quantification of complex biological structures and phenomena [34,35]. For each ROI, after binarization, we calculated the value of the fractal dimension using the box-counting method. As explained in previous publications, this method applies a number of boxes over the structure at different scales (ε), after which a graph is created representing the logarithmic value of the number of boxes (N) at least partially filled with the structure (Figure 3). The fractal dimension is calculated based on the slope of the linear regression of log(1/ε) versus log N(ε) for all ε [20,35]. Fractal dimension may be viewed as an indirect measure of complexity and level of detail and previously was used to detect small, microscopic alterations in biological structures that are generally not visible using conventional means.

Raw data obtained from fractal, GLCM, and DWT analyses were statistically analyzed in SPSS (v. 25.0, IBM Corporation, Chicago, IL, USA). The multivariate analysis of variance (MANOVA) was used to determine whether there were any differences between the two groups of cells. Regarding the ML models, the raw data were later used as inputs for training and testing ML models. The first model was based on a support vector machine algorithm, a supervised learning non-probabilistic binary linear classifier. This model regards individual data points as a mathematical p-dimensional vector, and its main task is developing the ability to separate the data along a (p-1)-dimensional hyperplane(s). Support vector machines are commonly used both for the classification and regression of data in biological sciences, with frequent application in image analysis and prediction of biological phenomena based on two-dimensional signals.

The second machine learning model involved the development of random decision forests classifier, an ensemble method in which multiple decision trees are constructed and averaged for their prediction output. Generally, random forests are more capable of classification compared to individual trees, although the interpretability of the model may be reduced in some circumstances. As with the support vector machine, random forests are a form of supervised learning, meaning that the model learns when given a series of examples with known input and output. During the training, the model reveals a pattern of data organization or a rule that connects inputs with output.

Both models were trained in scikit-learn machine learning library for the Python programming language using Google Colaboratory—a platform which enables the scientist to write and execute Python code in a browser. The Colaboratory includes a hosted Jupyter notebook service which can be used to import various libraries and modules for machine learning. The target data of both trained models were the class of the cell which was set to either ‘0’ for the controls or ‘1’ for the cells exposed to the hyperosmotic environment. Approximately 80% of the sample was used for training, and 20% was used for model testing. Classification accuracy for both models was quantified using the scikit-learn “metrics” module [36]. Optimization of hyperparameters was performed with Grid Search (GridSearchCV module in scikit-learn). For the SVM classifier, it was determined that the optimal hyperparameters were radial basis function (RBF) kernel, “C” hyperparameter of 1, and “gamma” set to “scale”. For the random forests model, the “entropy” value was found to be the optimal “criterion” and “log 2” optimal for the maximal number of features. The optimal number of estimators was found to be 100. For the evaluation of the discriminatory power of the models, we used receiver operating characteristics (ROC) analysis, also in the scikit-learn “metrics” module, and the area under the ROC curve was determined. Furthermore, in scikit-learn, we calculated the classification accuracies of the models.

3. Results

3.1. Results of GLCM, Fractal, and DWT Analyses

One of the important objectives of our study was to determine to what extent exposure to a hyperosmotic environment leads to the changes in GLCM, fractal, and DWT indicators of yeast nuclear structure. The mean values of the nuclear angular second moment and inverse difference moment in the control group of cells (untreated cells) were 0.0015 ± 0.0011 and 0.167 ± 0.013, respectively (Table 1). In the experimental group (cells exposed to hyperosmotic stress conditions), a statistically significant reduction of ASM was observed (p < 0.01, Figure 4) to the average value of 0.00076 ± 0.00092, indicating the decrease of nuclear textural uniformity in the hyperosmotic environment. A similar reduction was noticed in the mean value of IDM, which equaled 0.154 ± 0.012 in the stressed cells. This result implied that osmotic stress leads to the reduction of local textural homogeneity in cell nuclei.

The correlation feature in GLCM analysis was also reduced from 0.0041 ± 0.0037 in the control group to 0.0019 ± 0.0023 in the experimental group (p < 0.01). This result was in line with the observed changes in ASM and IDM and indicated the increase of linear dependencies of gray levels on the other levels of the neighboring resolution units. On the other hand, there was a statistically highly significant increase in both contrast and variance. The mean value of the contrast textural feature in the control group was 51.03 ± 9.17, while in the experimental group, it was 59.87 ± 8.78 (p < 0.01). The average textural variance of the cell nuclei in controls was 507.17 ± 434.37; in the cells exposed to hyperosmotic conditions, it was 799.39 ± 363.96. The variance values showed the highest degree of variability of all quantified textural features.

The average value of DWT coefficient energy in controls was 0.250 ± 0.084, while in the experimental group, it significantly increased to 0.369 ± 0.089 (p < 0.05). On the other hand, the fractal dimension of the nuclear structure was reduced from 1.538 ± 0.142 to 1.454 ± 0.155 (p < 0.05). This result indicated that exposure to a hyperosmotic environment might be associated with the reduction of fractal complexity of nuclear structure. The scale of changes in DWT coefficient energy and fractal dimension values was much less pronounced when compared to the changes in GLCM features.

3.2. Machine Learning Models

Based on the fractal, wavelet, and GLCM data, both the support vector machine and random forests model were successfully trained and tested. The support vector machine had an acceptable classification accuracy of 71.7% in predicting whether the cell belonged to the experimental or the control group. The area under the ROC curve for this model was 0.74, indicating acceptable, although not excellent discriminatory power in separating treated from intact cells (Figure 5). The random forests model outperformed the support vector machine model since its classification accuracy was determined to be 79.8%. The area under the ROC curve for this model equaled 0.85, indicating a relatively good discriminatory power (Figure 6).

4. Discussion

Our study shows that exposing yeast cells to hyperosmotic stress results in significant changes in the nuclear texture, which can be quantified using GLCM and DWT methods, as well as significant changes in the nuclear fractal dimension. We also propose using GLCM, fractal, and DWT indicators as input data for machine learning models, such as support vector machines and random forests. The models trained on a relatively small sample achieved a decent level of classification accuracy and discriminatory power when distinguishing between treated and healthy cells. These findings and models can serve as a foundation for further developing AI-based methods for detecting osmotic shock injury in cells and their components.

In the field of cell biology, the use of computational methods, such as GLCM, DWT, and fractal analysis, to analyze the texture of cell nuclei is relatively new and has not been widely tested. There are several limitations and concerns regarding the sensitivity and validity of these methods. They can be applied to various cell populations, both in vivo and in vitro, and can provide information about structural homogeneity in cell and tissue micrographs [17,20,35]. However, it should be noted that textural homogeneity does not always correlate with homogeneity in histological terms. The use of DWT indicators for quantifying textural heterogeneity also requires further validation by future studies. Likewise, fractal analysis is a mathematical and biophysical method that can be used to infer the complexity of a signal, whether one-dimensional or two-dimensional, as in our study, but its potential applications in this field also remain to be confirmed by future research.

This is not the first time we have used Saccharomyces cerevisiae to create machine learning models for assessing cell damage. In a recent study, we examined the impact of sublethal doses of ethanol on GLCM indicators such as angular second moment and inverse difference [8]. Ethanol caused similar changes as hyperosmotic stress caused by NaCl, with a reduction in textural uniformity and local homogeneity of cell nuclei. In addition to GLCM analysis, we proposed machine learning models based on random trees, multilayer perceptron, and binomial logistic regression. All three models showed high classification accuracy, and the neural network performed best in terms of the area under the receiver operating characteristics curve (the AUC equaled 0.87). The changes in GLCM indicators are somewhat in accordance with the results of our current study since alcohol can cause hyperosmotic stress under certain conditions. However, we should note that these alterations in the nuclear structure are more likely to result from ethanol-induced damage to the genetic material of the cells or the reorganization of chromatin in nuclei due to the activation of specific signaling pathways associated with ethanol damage.

Fractal analysis has previously been used to indirectly quantify structural complexity and level of detail in micrographs of cells and tissues [20,21]. The analysis of neurons in the central and peripheral nervous system is perhaps the most extensive application of this method in microscopy, as fractal dimension can aid in the assessment of branching patterns of axons and dendrites. [37,38,39]. However, there have been several studies that tried to quantify fractal dimension and other fractal indicators in cell nuclei and chromatin. The fractal dimension of euchromatin and heterochromatin seems to differ. In recent years, there have been attempts to introduce the so-called “fractal globule” model of chromatin organization as an alternative to the conventional equilibrium model. Chromatin, as a macromolecule, and DNA possess certain self-similarity traits, and their fractality remains to be fully investigated. In our study, we applied fractal analysis solely to identify subtle morphological alterations in cell nuclei and to generate data for the ML model training.

Exposure to a hyperosmotic environment in Saccharomyces cerevisiae yeast cells leads to a significant reduction of cell volume and activation of numerous adaptation mechanisms [40,41,42]. These cells are exceptionally resilient to high tonicity; the viability is generally preserved, and the cells have numerous well-preserved genes that are involved in osmoprotection. Severe osmotic shock in yeast often leads to cell cycle arrest, growth inhibition, and reduced metabolic activity. Glycerol, trehalose, and erythritol as compatible solutes to NaCl are being produced to counteract the high osmolarity of the extracellular space. In addition, during hyperosmotic stress caused by NaCl or other osmotically active compounds, a number of stress-response pathways are activated, such as the high osmolarity glycerol (HOG) pathway or the stress-activated protein kinase pathway (SAPK). This all leads to significant changes in DNA transcription and gene expression and possibly to the reorganization of nuclear chromatin patterns. Furthermore, hyperosmotic stress may be associated with the increased production of reactive oxygen species and oxidative stress to the cell genetic material. While it is currently unclear which of these processes is responsible for the observed changes in computational texture indicators, it is plausible to speculate that the primary factor contributing to these changes is the epigenetic alterations that occur as part of the cell’s adaptation mechanisms to damage.

The most interesting aspect of our study is probably the fact that computational methods were able to detect subtle changes in cell morphology that were not clearly visible during standard microscopy. The analyzed cells did not show visible signs of programmed cell death, necrosis, or substantial nuclear injury. Even to the researcher with wide previous experience in the fields of microscopy and cell biology, cells from both groups appeared morphologically identical, and there was no subjective way of adequately separating and classifying them into the two groups. Some minor alterations in nuclear structure, such as the darkened areas on the nuclear periphery (visible in Figure 1), were not specific to the experimental group and could easily be a physiological variation characteristic of this cell type or a minor variation in light exposure and white balance during microscopy and cell acquisition. On the other hand, computational methods showed significant differences in fractal, textural, and wavelet indicators between the groups. This potentially demonstrates the power of these methods in detecting discrete morphological phenomena, increasing their scientific value in the field of pathology. In the future, these methods may be useful as part of computer-aided diagnostic systems in pathology for classifying different types of pathological cells or separating pathologically changed cells from intact cells in various experimental and clinical conditions. However, the various steps and other limitations of these methods will need to be addressed before this can happen. There are a few significant limitations of our study that may hamper its impact in the fields of microscopy and cell biology. As mentioned earlier, all the applied computational methods are relatively new in this area of research. They have not undergone rigorous testing for their validity, accuracy, and quality assurance in general. Inter- and intra-observer reliability of the methods is undetermined for most cell populations and tissues. Despite some efforts in the past to perform quality assurance, this remains a significant obstacle to the use of the techniques in contemporary histology and pathology. The second limitation is the relatively high degree of variability of the values of fractal and GLCM indicators across different softer platforms and under different experimental settings. For example, the values of the angular second moment and inverse difference moment can drastically differ when micrographs are created in different sizes and resolutions or when a different image acquisition system is used. The same applies to various microscope settings such as exposure, hue, saturation, and white balance, which may greatly depend on the type of microscope, the imaging device and default preferences set within imaging software. Finally, the fact that we were able to observe changes in computational indications in this yeast culture does not necessarily imply that the changes are present in other cultures, particularly when various fixation and staining protocols are applied. To draw definitive conclusions on the changes in chromatin textural patterns during hyperosmotic stress and the potential usefulness of pattern recognition approaches in this field, future studies should utilize specific staining procedures aimed at visualizing chromatin structure in yeast. In our research, we only quantified 5 GLCM indicators: angular second moment, inverse difference moment, textural contrast, correlation, and variance. However, GLCM and other similar forms of textural analysis can be used to obtain many more textural features. The examples include entropy, sum entropy, difference entropy, information measures of correlation, maximal correlation coefficient and other quantifications. According to the original work of Haralick et al. [43], a total of 28 textural features can be theoretically extracted from gray-tone spatial-dependence matrices, and today different computing platforms can be used to quantify the majority of them. Future studies would need to use all possible features to design the ML models with the best performance before this approach can be included in contemporary cell biology and pathology practice. The same reasoning applies to fractal analysis, where other features can also be quantified apart from the fractal dimension. The most important example would be the lacunarity feature, a measure of the level of “gappiness” within a fractal, which is frequently calculated alongside the fractal dimension to provide better insight into the changes in complexity. In the future, it would be interesting to see the classification accuracy and discriminatory power of the RF and SVM models constructed with the combination of lacunarity and GLCM data as inputs.

The machine learning approach applied in our study also has certain limitations that need to be discussed. Support vector machine and random forest are just two of many supervised ML algorithms that can be trained by presenting a series of examples of input and (correct) output or target data [44]. Other models that are also potentially valuable include neural networks, various decision trees other than random forest, as well as models that rely on binomial logistic regression analysis. Some of these approaches may be better for the identification of specific patterns within the GLCM and DWT data and may yield higher discriminatory power and classification accuracy when trained in this setting. In the future, it would be advisable to develop and compare all possible ML models, after which the best one could be deployed as a web or other application. In our study, the samples used for training and testing were relatively low. It is possible that with a larger amount of especially GLCM data, one could develop a complex model (i.e., the one based on a multilayer perceptron network) that would have outstanding performance in cell classification. Finally, another important limitation concerning ML relates to the fact that ML models, in general, suffer from low interpretability. Random forest and support vector machines are no exception, and although we may obtain interesting results on their performance, it is still difficult to explain how the model actually functions and what actual inner mechanisms lead to its decisions in cell classification. This, along with the abovementioned general lack of quality assurance of computational techniques for data generation, greatly limits the overall reproducibility of the results. Future works will have to be focused on how to resolve these issues before these types of models become ready to be integrated into contemporary research and diagnostic protocols in pathology and other fields.

5. Conclusions

In conclusion, machine learning models such as those based on random forests and support vector machines can be trained using GLCM and DWT data to identify yeast cells that have been previously exposed to a hyperosmotic environment. Osmotic stress induces significant changes in nuclear textural indicators, suggesting that GLCM and DWT methods can detect subtle structural alterations in cell nuclei under these experimental conditions. Our study highlights the significance of textural analysis computational methods and machine learning approaches in the morphological assessment of yeast cells and presents potentially useful data for future research in the fields of cellular physiology and pathology.

Author Contributions

Conceptualization, I.P. (Igor Pantic), J.C., S.V., P.R.C., T.T. and I.P. (Ivana Paunkovic); Methodology, I.P. (Igor Pantic); Software, I.P. (Igor Pantic); Validation, I.P. (Igor Pantic); Resources, I.P. (Igor Pantic), J.C., S.V., P.R.C., T.T. and I.P. (Ivana Paunkovic); Writing—Original Draft Preparation, I.P. (Igor Pantic), J.C., S.V., P.R.C., T.T. and I.P. (Ivana Paunkovic); Writing—Review & Editing, I.P. (Igor Pantic), J.C., S.V., P.R.C., T.T. and I.P. (Ivana Paunkovic); Funding Acquisition, I.P. (Igor Pantic) All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Science Fund of the Republic of Serbia, grant No. 7739645, “Automated sensing system based on fractal, textural and wavelet computational methods for detection of low-level cellular damage”, SensoFracTW and the Ministry of Education and Science of the Republic of Serbia, grant no. 200110. Support for this project was also provided by Khalifa University of Science and Technology, Grant Numbers: FSU-2020-25 and RC2-2018-022 (HEIC), and the College of Medicine and Health Sciences, Abu Dhabi, United Arab Emirates.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest. The funding sponsors had no role in the design of the study; in the collection, analysis, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

AlKubeyyer, A.; Ben Ismail, M.M.; Bchir, O.; Alkubeyyer, M. Automatic detection of the meningioma tumor firmness in MRI images. J. X-Ray Sci. Technol. 2020, 28, 659–682. [Google Scholar] [CrossRef] [PubMed]
Althubiti, S.A.; Paul, S.; Mohanty, R.; Mohanty, S.N.; Alenezi, F.; Polat, K. Ensemble Learning Framework with GLCM Texture Extraction for Early Detection of Lung Cancer on CT Images. Comput. Math. Methods Med. 2022, 2022, 2733965. [Google Scholar] [CrossRef]
Alyami, J.; Sadad, T.; Rehman, A.; Almutairi, F.; Saba, T.; Bahaj, S.A.; Alkhurim, A. Cloud Computing-Based Framework for Breast Tumor Image Classification Using Fusion of AlexNet and GLCM Texture Features with Ensemble Multi-Kernel Support Vector Machine (MK-SVM). Comput. Intell. Neurosci. 2022, 2022, 7403302. [Google Scholar] [CrossRef]
Anand, L.; Mewada, S.; Shamsi, W.; Ritonga, M.; Aflisia, N.; KumarSarangi, P.; NdoleArthur, M. Diagnosis of Prostate Cancer Using GLCM Enabled KNN Technique by Analyzing MRI Images. BioMed. Res. Int. 2023, 2023, 3913351. [Google Scholar] [CrossRef] [PubMed]
Anwar, S.M.; Majid, M.; Qayyum, A.; Awais, M.; Alnowami, M.; Khan, M.K. Medical Image Analysis using Convolutional Neural Networks: A Review. J. Med. Syst. 2018, 42, 226. [Google Scholar] [CrossRef] [Green Version]
Yu, K.H.; Beam, A.L.; Kohane, I.S. Artificial intelligence in healthcare. Nat. Biomed. Eng. 2018, 2, 719–731. [Google Scholar] [CrossRef]
Pantic, I.; Cumic, J.; Dugalic, S.; Petroianu, G.; Corridon, P. Gray level co-occurrence matrix and wavelet analyses reveal discrete changes in proximal tubule cell nuclei after mild acute kidney injury. Sci. Rep. 2022, 13, 4025. [Google Scholar] [CrossRef]
Davidovic, L.M.; Cumic, J.; Dugalic, S.; Vicentic, S.; Sevarac, Z.; Petroianu, G.; Corridon, P.; Pantic, I. Gray-Level Co-occurrence Matrix Analysis for the Detection of Discrete, Ethanol-Induced, Structural Changes in Cell Nuclei: An Artificial Intelligence Approach. Microsc. Microanal. Off. J. Microsc. Soc. Am. Microbeam Anal. Soc. Microsc. Soc. Can. 2021, 28, 265–271. [Google Scholar] [CrossRef]
Dimitriadis, I.; Zaninovic, N.; Badiola, A.C.; Bormann, C.L. Artificial intelligence in the embryology laboratory: A review. Reprod. Biomed. Online 2021, 44, 435–448. [Google Scholar] [CrossRef]
Hudson, I.L. Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology. Methods Mol. Biol. 2021, 2190, 167–184. [Google Scholar] [CrossRef] [PubMed]
Shah, S.M.; Khan, R.A.; Arif, S.; Sajid, U. Artificial intelligence for breast cancer analysis: Trends & directions. Comput. Biol. Med. 2022, 142, 105221. [Google Scholar] [CrossRef]
Pantic, I.V.; Shakeel, A.; Petroianu, G.A.; Corridon, P.R. Analysis of Vascular Architecture and Parenchymal Damage Generated by Reduced Blood Perfusion in Decellularized Porcine Kidneys Using a Gray Level Co-occurrence Matrix. Front. Cardiovasc. Med. 2022, 9, 797283. [Google Scholar] [CrossRef] [PubMed]
Alkhodari, M.; Fraiwan, L. Convolutional and recurrent neural networks for the detection of valvular heart diseases in phonocardiogram recordings. Comput. Methods Programs Biomed. 2021, 200, 105940. [Google Scholar] [CrossRef] [PubMed]
Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the 3rd International Conference on Learning Representations, ICLR Conference 2015, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Zhang, T.; Zeng, Y.; Zhang, Y.; Zhang, X.; Shi, M.; Tang, L.; Zhang, D.; Xu, B. Neuron type classification in rat brain based on integrative convolutional and tree-based recurrent neural networks. Sci. Rep. 2021, 11, 7291. [Google Scholar] [CrossRef]
Tan, J.; Gao, Y.; Liang, Z.; Cao, W.; Pomeroy, M.J.; Huo, Y.; Li, L.; Barish, M.A.; Abbasi, A.F.; Pickhardt, P.J. 3D-GLCM CNN: A 3-Dimensional Gray-Level Co-Occurrence Matrix-Based CNN Model for Polyp Classification via CT Colonography. IEEE Trans. Med. Imaging 2020, 39, 2013–2024. [Google Scholar] [CrossRef] [PubMed]
Tan, T.C.; Ritter, L.J.; Whitty, A.; Fernandez, R.C.; Moran, L.J.; Robertson, S.A.; Thompson, J.G.; Brown, H.M. Gray level Co-occurrence Matrices (GLCM) to assess microstructural and textural changes in pre-implantation embryos. Mol. Reprod. Dev. 2016, 83, 701–713. [Google Scholar] [CrossRef]
Vidya, K.S.; Ng, E.Y.; Acharya, U.R.; Chou, S.M.; Tan, R.S.; Ghista, D.N. Computer-aided diagnosis of Myocardial Infarction using ultrasound images with DWT, GLCM and HOS methods: A comparative study. Comput. Biol. Med. 2015, 62, 86–93. [Google Scholar] [CrossRef]
Gupta, S.; Savala, R.; Gupta, N.; Dey, P. Fractal dimension and chromatin textural analysis to differentiate follicular carcinoma and adenoma on fine needle aspiration cytology. Cytopathol. Off. J. Br. Soc. Clin. Cytol. 2020, 31, 491–493. [Google Scholar] [CrossRef]
Mattos, A.C.; Florindo, J.B.; Adam, R.L.; Lorand-Metze, I.; Metze, K. The Fractal Dimension Suggests Two Chromatin Configurations in Small Cell Neuroendocrine Lung Cancer and Is an Independent Unfavorable Prognostic Factor for Overall Survival. Microsc. Microanal. Off. J. Microsc. Soc. Am. Microbeam Anal. Soc. Microsc. Soc. Can. 2022, 28, 522–526. [Google Scholar] [CrossRef]
Metze, K.; Adam, R.; Florindo, J.B. The fractal dimension of chromatin—A potential molecular marker for carcinogenesis, tumor progression and prognosis. Expert Rev. Mol. Diagn. 2019, 19, 299–312. [Google Scholar] [CrossRef]
Brocker, C.; Thompson, D.C.; Vasiliou, V. The role of hyperosmotic stress in inflammation and disease. Biomol. Concepts 2012, 3, 345–364. [Google Scholar] [CrossRef] [PubMed]
Colin, L.; Ruhnow, F.; Zhu, J.K.; Zhao, C.; Zhao, Y.; Persson, S. The cell biology of primary cell walls during salt stress. Plant Cell 2023, 35, 201–217. [Google Scholar] [CrossRef]
Reiling, J.H.; Sabatini, D.M. Stress and mTORture signaling. Oncogene 2006, 25, 6373–6383. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sadowska, A.; Kameda, T.; Krupkova, O.; Wuertz-Kozak, K. Osmosensing, osmosignalling and inflammation: How intervertebral disc cells respond to altered osmolarity. Eur. Cells Mater. 2018, 36, 231–250. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Itto-Nakama, K.; Watanabe, S.; Ohnuki, S.; Kondo, N.; Kikuchi, R.; Nakamura, T.; Ogasawara, W.; Kasahara, K.; Ohya, Y. Prediction of ethanol fermentation under stressed conditions using yeast morphological data. J. Biosci. Bioeng. 2023, 135, 210–216. [Google Scholar] [CrossRef] [PubMed]
Pereira, D.R.; Papa, J.P.; Saraiva, G.F.R.; Souza, G.M. Automatic classification of plant electrophysiological responses to environmental stimuli using machine learning and interval arithmetic. Comput. Electron. Agric. 2018, 145, 35–42. [Google Scholar] [CrossRef] [Green Version]
Santos, T.A.; Maistro, C.E.; Silva, C.B.; Oliveira, M.S.; Franca, M.C., Jr.; Castellano, G. MRI Texture Analysis Reveals Bulbar Abnormalities in Friedreich Ataxia. AJNR Am. J. Neuroradiol. 2015, 36, 2214–2218. [Google Scholar] [CrossRef] [Green Version]
Kociołek, M.; Materka, A.; Strzelecki, M.; Szczypinski, P. Discrete wavelet transform—Derived features for digital image texture analysis. In Proceedings of the International Conference on Signals and Electronic Systems, Lodz, Poland, 18–21 September 2001; pp. 163–168. [Google Scholar]
Strzelecki, M.; Szczypinski, P.; Materka, A.; Klepaczko, A. A software tool for automatic classification and segmentation of 2D/3D medical images. Nucl. Instrum. Methods Phys. Res. A 2013, 702, 137–140. [Google Scholar] [CrossRef]
Szczypinski, P.; Strzelecki, M.; Materka, A. MaZda—A Software for Texture Analysis. In Proceedings of the 2007 International Symposium on Information Technology Convergence, ISITC 2007, Jeonju, Republic of Korea, 23–24 November 2007; pp. 245–249. [Google Scholar]
Szczypinski, P.; Strzelecki, M.; Materka, A.; Klepaczko, A. MaZda-A software package for image texture analysis. Comput. Methods Programs Biomed. 2009, 94, 66–76. [Google Scholar] [CrossRef]
Mallat, S. A Wavelet Tour of Signal Processing; Academic Press: San Diego, CA, USA, 1998. [Google Scholar]
Karperien, A. FracLac for ImageJ. Available online: http://rsb.info.nih.gov/ij/plugins/fraclac/FLHelp/Introduction.htm (accessed on 28 January 2023).
Dincic, M.; Todorovic, J.; Nesovic Ostojic, J.; Kovacevic, S.; Dunderovic, D.; Lopicic, S.; Spasic, S.; Radojevic-Skodric, S.; Stanisavljevic, D.; Ilic, A.Z. The Fractal and GLCM Textural Parameters of Chromatin May Be Potential Biomarkers of Papillary Thyroid Carcinoma in Hashimoto’s Thyroiditis Specimens. Microsc. Microanal. Off. J. Microsc. Soc. Am. Microbeam Anal. Soc. Microsc. Soc. Can. 2020, 26, 717–730. [Google Scholar] [CrossRef]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Kim, J.; Kwon, N.; Chang, S.; Kim, K.T.; Lee, D.; Kim, S.; Yun, S.J.; Hwang, D.; Kim, J.W.; Hwu, Y.; et al. Altered branching patterns of Purkinje cells in mouse model for cortical development disorder. Sci. Rep. 2011, 1, 122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cozzini, T.; Piona, C.; Marchini, G.; Merz, T.; Brighenti, T.; Bonetto, J.; Marigliano, M.; Olivieri, F.; Maffeis, C.; Pedrotti, E. In vivo confocal microscopy study of corneal nerve alterations in children and youths with Type 1 diabetes. Pediatr. Diabetes 2021, 22, 780–786. [Google Scholar] [CrossRef] [PubMed]
Li, Q.; Wang, X.; Wang, Z.H.; Lin, Z.; Yang, J.; Chen, J.; Wang, R.; Ye, W.; Li, Y.; Wu, Y.; et al. Changes in dendritic complexity and spine morphology following BCG immunization in APP/PS1 mice. Hum. Vaccines Immunother. 2022, 18, 2121568. [Google Scholar] [CrossRef] [PubMed]
Blomberg, A. Yeast osmoregulation—Glycerol still in pole position. FEMS Yeast Res. 2022, 22, foac035. [Google Scholar] [CrossRef]
Saxena, A.; Sitaraman, R. Osmoregulation in Saccharomyces cerevisiae via mechanisms other than the high-osmolarity glycerol pathway. Microbiology 2016, 162, 1511–1526. [Google Scholar] [CrossRef]
de Nadal, E.; Posas, F. The HOG pathway and the regulation of osmoadaptive responses in yeast. FEMS Yeast Res. 2022, 22, foac013. [Google Scholar] [CrossRef]
Haralick, R.M.; Shanmugam, K.; Dinstein, I. Textural Features for Image Classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef] [Green Version]
Pantic, I.; Paunovic, J.; Cumic, J.; Valjarevic, S.; Petroianu, G.A.; Corridon, P.R. Artificial neural networks in contemporary toxicology research. Chem. Biol. Interact. 2023, 369, 110269. [Google Scholar] [CrossRef]

$Fractalfract 07 00272 g001 550$

Figure 1. Saccharomyces cerevisiae yeast cells exposed to a hyperosmotic environment (right half of the image, 6 cells) and control cells (left half of the image, 6 cells). Although microscopically, no significant morphological differences can be observed, nuclear ROIs of these cells have different values of GLCM, fractal and wavelet indicators. For example, the first cell of the control group (upper left cell, marked with the black arrow) had a nuclear IDM of 0.163 and a fractal dimension of 1.551. The morphologically similar cell exposed to a hyperosmotic environment (right half of the image, marked with the white arrow) had an IDM value of 0.149 and a fractal dimension of 1.434.

$Fractalfract 07 00272 g001$

$Fractalfract 07 00272 g002 550$

Figure 2. Process of a cascade of filterings during DWT analysis.

$Fractalfract 07 00272 g002$

$Fractalfract 07 00272 g003 550$

Figure 3. The fractal dimension of ROIs was calculated based on the slope of the regression line of log(1/ε) versus log N(ε) for all scales. The value of the fractal dimension in this example is 1.6142. The standard deviations for the experimental and control groups of cells were 0.155 and 0.142, respectively.

$Fractalfract 07 00272 g003$

$Fractalfract 07 00272 g004 550$

Figure 4. Average values of nuclear GLCM indicators in cells exposed to hyperosmotic environment and controls.

$Fractalfract 07 00272 g004$

$Fractalfract 07 00272 g005 550$

Figure 5. Receiver operating characteristics curve for the support vector machine model.

$Fractalfract 07 00272 g005$

$Fractalfract 07 00272 g006 550$

Figure 6. Receiver operating characteristics curve for the random forests model.

$Fractalfract 07 00272 g006$

Table 1. Average values of GLCM, fractal and DWT indicators of nuclear ROIs on cells exposed to hyperosmotic environment and controls. * p < 0.05 ** p < 0.01.

	Osmotic Stress	Controls
Angular second moment	0.00076 ± 0.00092 **	0.0015 ± 0.0011
Inverse difference moment	0.154 ± 0.012 **	0.167 ± 0.013
Contrast	59.87 ± 8.78 **	51.03 ± 9.17
Correlation	0.0019 ± 0.0023 **	0.0041 ± 0.0037
Variance	799.39 ± 363.96 **	507.17 ± 434.37
Fractal dimension	1.454 ± 0.155 **	1.538 ± 0.142
DWT coefficient energy	0.369 ± 0.089 *	0.250 ± 0.084

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pantic, I.; Valjarevic, S.; Cumic, J.; Paunkovic, I.; Terzic, T.; Corridon, P.R. Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods. Fractal Fract. 2023, 7, 272. https://doi.org/10.3390/fractalfract7030272

AMA Style

Pantic I, Valjarevic S, Cumic J, Paunkovic I, Terzic T, Corridon PR. Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods. Fractal and Fractional. 2023; 7(3):272. https://doi.org/10.3390/fractalfract7030272

Chicago/Turabian Style

Pantic, Igor, Svetlana Valjarevic, Jelena Cumic, Ivana Paunkovic, Tatjana Terzic, and Peter R. Corridon. 2023. "Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods" Fractal and Fractional 7, no. 3: 272. https://doi.org/10.3390/fractalfract7030272

Article Menu

Gray Level Co-Occurrence Matrix, Fractal and Wavelet Analyses of Discrete Changes in Cell Nuclear Structure following Osmotic Stress: Focus on Machine Learning Methods

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Results of GLCM, Fractal, and DWT Analyses

3.2. Machine Learning Models

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI