Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis

Khatun, Saleha; Morshed, Bashir I.; Bidelman, Gavin M.

doi:10.3390/s24041054

Open AccessArticle

Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis

by

Saleha Khatun

¹,

Bashir I. Morshed

^2,*

and

Gavin M. Bidelman

³

¹

Department of Electrical and Computer Engineering, University of Memphis, Memphis, TN 38152, USA

²

Compute Science Department, Texas Tech University, Lubbock, TX 79409, USA

³

School of Communication Sciences and Disorders, University of Memphis, Memphis, TN 38152, USA

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(4), 1054; https://doi.org/10.3390/s24041054

Submission received: 22 December 2023 / Revised: 28 January 2024 / Accepted: 2 February 2024 / Published: 6 February 2024

(This article belongs to the Special Issue Deep Learning-Based Neural Networks for Sensing and Imaging)

Download

Browse Figures

Versions Notes

Abstract

:

A deviation in the soundness of cognitive health is known as mild cognitive impairment (MCI), and it is important to monitor it early to prevent complicated diseases such as dementia, Alzheimer’s disease (AD), and Parkinson’s disease (PD). Traditionally, MCI severity is monitored with manual scoring using the Montreal Cognitive Assessment (MoCA). In this study, we propose a new MCI severity monitoring algorithm with regression analysis of extracted features of single-channel electro-encephalography (EEG) data by automatically generating severity scores equivalent to MoCA scores. We evaluated both multi-trial and single-trail analysis for the algorithm development. For multi-trial analysis, 590 features were extracted from the prominent event-related potential (ERP) points and corresponding time domain characteristics, and we utilized the lasso regression technique to select the best feature set. The 13 best features were used in the classical regression techniques: multivariate regression (MR), ensemble regression (ER), support vector regression (SVR), and ridge regression (RR). The best results were observed for ER with an RMSE of 1.6 and residual analysis. In single-trial analysis, we extracted a time–frequency plot image from each trial and fed it as an input to the constructed convolutional deep neural network (CNN). This deep CNN model resulted an RMSE of 2.76. To our knowledge, this is the first attempt to generate automated scores for MCI severity equivalent to MoCA from single-channel EEG data with multi-trial and single data.

Keywords:

electroencephalography; event-related potential; mild cognitive impairment; Montreal cognitive assessment; deep learning; ensemble regression

1. Introduction

Memory disorder brings difficulties in performing Basic Activities of Daily Living (BADL) such as walking, eating, dressing, showering, feeding, continence, and entertainment, and Instrumental Activities of Daily Living (IADL) such as preparing meals, solving everyday situations, managing finances, doing laundry, and taking medications [1]. Mild cognitive impairment (MCI) is considered to be the state between the normal cognition and dementia [2]. There are many benefits of early detection of MCI. MCI has a 54% chance of becoming Alzheimer’s disease (AD) or related dementia [3]. MCI reduces the ability of elderly people to perform daily activities and to live an independent life. In some cases, patients suffering from complicated diseases (e.g., Parkinson’s Disease) may have develop MCI later [4], and progression of cognitive impairment monitoring is crucial for their caregivers, doctors, and families. MCI may be associated with cardiovascular disease, metabolic syndrome, type 2 diabetes, sedentary activity, obesity, excess alcohol, and smoking [5].

According to the Alzheimer’s Association, persons who already have cognitive impairment based on clinical observations or have self-reported concerns should undergo a cognitive impairment assessment. For cognitive health screening, they are recommended to undergo any one of the cognitive screening tests: (a) Montreal Cognitive Assessment (MoCA), (b) Mini-Mental State Examination (MMSE), (c) St. Louis University Mental Status Exam (SLUMS), (d) General Practitioner Assessment of Cognition (GPCOG), (e) Mini-Cog, (f) Memory Impairment Screen (MIS), (g) AD8, or (h) Informant Questionnaire on Cognitive Decline in the Elderly (short-IQCODE) [6]. These tests are manual and conducted by experienced clinicians or caregivers. Scientists compared the performances of these cognitive assessment tests and showed that when a patient’s cognitive impairment falls beyond memory impairment, MoCA should be used [7]. Furthermore, some studies suggest that MoCA is more sensitive than the widely used MMSE [7,8]. Researchers have predicted cognitive scores based on physiological data, such as an electroencephalogram (EEG), in a motivation setting, to help monitor a person’s cognitive health easily and at a time when a patient is unable to undergo a manual cognitive assessment test. All of the research related to measuring the severity of cognitive impairment was based on features extracted from a multi-channel EEG system: an EEG power ratio from a 16-channel system by Bennys et al. [9], EEG power spectra and other information from a 16-channel system by Kowalski et al. [10], EEG power of prominent bands from a 20-channel EEG system [11], spectral characteristics of EEG from a 20-channel system [12], an EEG power ratio from a 19-channel system [13], the grand total EEG score from a 16-channel system [14], etc. EEG rhythms are shown to capture brain neural synchronization and functional coupling in Alzheimer’s disease [15]. EEG rhythms can show cortical sources of resting state in Parkinson’s disease related dementia and Alzheimer’s disease [16]. Also, brain oscillatory patterns in MCI due to Alzheimer’s and Parkinson’s disease were detected with high-density EEG [17]. Thus, this work on MCI severity detection by generating MoCA-equivalent scores from a single-channel EEG system in unsupervised settings is novel.

In this work, we have performed objective generation of MCI severity scores based on single-channel EEG data collected from the Fz location. Our analysis is divided into two parts: (a) multi-trial, and (b) single-trial. We followed a multi-trial approach based on classical regression techniques. We used features extracted from grand average event-related potential (ERP) in this case. In the single-trial analysis, we determined MCI severity based on a convolutional deep neural network. We used the time–frequency plot image of each trial as the input for the deep neural network. Our key contribution here is our proposed models based on a multi-trial approach (best for offline system) that uses multiple data in aggregate, and a single-trial approach (best for real-time system with streaming data) that uses each new datum in isolation to predict the severity of cognitive impairment to continually assess cognitive health.

2. Method

2.1. Participants

Twenty-three older adults were recruited in a study which was designed to observe the aging effect on the auditory system and to evaluate the cognitive performance of the subjects. EEG data were collected in this study from an experiment where twenty-three older adults (age ranged from 52 to 86 years; mean ± standard deviation: 70.2 ± 7.2 years) received auditory stimuli. All the participants had no known history of neurological or psychiatric illness, and they all were strongly right handed [18]. We anticipate that there would be no change in outcome with left-handed patients; however, there were no subjects available at the time of recruitment, which can be a focus of a future study. During the participant recruitment process, exclusion criteria were followed based on age, hearing loss, musical training, and handedness [19]. Written consent under the protocol approved by the Baycrest Centre Ethics Committee (IRB number: REB #06-31) was collected from each participant before data collection, and participants were compensated for their time after data collection. The cognitive health status of the participants was assessed by the well-established cognitive screening tool called the Montreal cognitive assessment (MoCA) test [20]. In this cognitive screening test, among all the participants, fifteen older participants (8 male, 7 female) were found to have normal cognition (MoCA score ≥ 26 points; mean ± standard deviation: 27.6 ± 1.18; range: 26–30), and eight participants (4 male, 4 female) were assessed to have MCI (MoCA score < 26; mean ± standard deviation: 23.0 ± 1.85; range: 20–25). Also, it is important to mention that in general, patients with Alzheimer’s disease or more severe dementia generate MoCA scores in the range of 11.4 to 21 [20].

2.2. Study Design

This study was designed so that the subjects would hear five auditory stimuli. Stimuli were constructed with a perceptual phonetic continuum from

∕ u

/to

∕

a/by, varying the first formant (F1) frequency parametrically between 430 and 730 Hz over five equal steps (for further stimulus details, see [21]). The synthetic five-step vowel continuum (denoted hereafter as “vw1-5”) was built in a way such that each token of 100 ms sound would differ minimally acoustically during hearing, and would still be perceived categorically [22,23]. As we are aware that hearing loss due to aging may alter auditory-evoked potentials from the cortical region [23], and it may affect the response of the participant, audiometric testing was performed. However, the audiometric testing demonstrated that hearing thresholds were not distinguishable between the groups (Normal and MCI) at octave frequencies between 250 and 4000 Hz [19], which is well outside of the bandwidth of the stimuli.

2.3. Collecting Data and Processing Event-Related Potentials

The data collection technique and response evaluation employed in this study are similar to the methodologies reported in previous research [24]. During EEG recording, the participants underwent 200 trials for each sound token within an electro-acoustically shielded chamber (Industrial Acoustics, Inc., North Aurora, IL, USA). Stimuli were delivered through earphones (ER-3A, Etymotic Research, Elk Grove Village, IL, USA) in both ears at an intensity of 83 dB SPL. To mitigate electromagnetic stimulus artifacts, extended acoustic tubing (50 cm) was utilized [25]. Sound tokens were presented randomly, and participants were tasked with rapidly categorizing them with a binary response (“u” or “a”). Each sound token had a duration of 100 ms, with 10 ms of rise and fall time to minimize spectral splatter [23]. Following the participants’ response, an inter-stimulus interval (ISI) followed randomly between 400 and 600 ms (20-ms steps, rectangular distribution) to prevent subjects from anticipating subsequent stimuli [26]. EEG data were captured from the subjects using SynAmps RT EEG amplifiers (Compumedics Neuroscan, Charlotte, NC, USA). This system consists of 64-channel EEG data collection electrodes. The participants were in a magnetically shielded room while the data collection occurred. EEG data were recorded between an electrode placed high on the forehead at the hairline with reference to linked mastoids. For recording auditory-evoked potentials of cortical origin, the Fpz—A1/A2 montage was considered optimal [27,28]. Throughout the data collection procedure, contact impedances were maintained below 3 kΩ, the EEG signal was sampled at 20 kHz, and raw signals were filtered with a band-pass filter with a passband range of 0.05 Hz to 3500 Hz.

The EEG data then underwent processing using ERPLAB, an open-source toolbox operating within the MATLAB environment [29]. Each EEG epoch consisted of a 700 ms (100 ms pre-stimulus and 600 ms post-stimulus) interval, as depicted in Figure 1 [3]. The pre-stimulus region was used as the reference, and the subtraction method was used for baseline correction [30]. Trials surpassing ±50 μV were excluded from analysis by the threshold method, as they were likely tainted by various artifacts, such as eye blinks and eye movements, as mentioned in previous studies [31,32]. Artifact-free epochs from each auditory stimulus were utilized to compute the grand average, called event-related potentials (ERPs), representing the average across 200 trials. Subsequently, the grand average ERP underwent bandpass filtering within the range of 0 to 30 Hz, aligning with a priori knowledge of ERP bandwidths and the stimuli [33]. A visual representation of the extracted ERP from multiple trials of a representative subject (ID 3610) is illustrated in Figure 1.

3. Multi-Trial Analysis

3.1. Extraction and Ranking of Features

Multi-trial analysis uses multiple trial data to predict the MoCA score offline, while single-trial analysis uses single trial data to predict MoCA in real time. We extracted a total of 590 candidate features from the prominent points of the ERP and the characteristics in both the time and spectral domains. From this pool, the top 25 features were selected for integration into the classification models, determined by ranking using the random forest algorithm. The cortical auditory-evoked responses indicated that prominent ERP points possess discriminatory power in distinguishing between normal and MCI stages among older adults [34]. Hence, we included ERP prominent points, such as Pa, P1, N1, and P2, in the candidate feature vector (CFV). These ERP prominent points were defined as peaks within the intervals {25–35 ms}, {60–80 ms}, {90–110 ms}, and {150–250 ms}, respectively [3], as illustrated in Figure 2. The CFV incorporates not only the peak amplitudes and their corresponding latencies but also the mean amplitudes of the intervals containing these prominent points. This inclusion is based on their recognized significance in distinguishing between groups in experiments involving evoked responses [35]. Furthermore, the CFV encompasses the relative powers in EEG bands (i.e., delta, theta, alpha, and beta rhythms), known to be valuable in classifying normal, mild cognitive impaired, and Alzheimer’s disease groups [36]. This addition results in a total of 16 features calculated from the ERP prominent points for each stimulus, contributing to a cumulative 80 feature points in the CFV.

Given the rapid temporal changes in ERPs during the application of a stimulus, it is crucial to capture the high-resolution variations in time-domain characteristics. To achieve this, we implemented a 25 ms window with a 50% overlap across the entire ERP signal, as illustrated in Figure 2. The sliding window methodology enables the observation of time-domain property variations, a factor previously shown to be significant in distinguishing between MCI and normal subjects in earlier studies [19]. A total of 107 time-domain characteristics, including signal statistics, correlation properties, and entropies, were computed from each window using the open-source software package “HCTSAtool” [29] in MATLAB (r2018b). For details on the extracted time-domain characteristics, please refer to [31,32,33]. To assess the variation in time-domain characteristics over time, we calculated the slope and the coefficient of variation (CV) from two feature points at each time stamp for every stimulus. In the selection process for the CFV, feature points demonstrating both intra-class similarity and inter-class variability, determined through visual inspection, were included. For instance, Figure 2a,b displays the slope and CV of a time-domain characteristic (computed by HCTSAtool as “proportion of data within two-standard deviation of mean” [34,35,36]) for the normal and MCI groups, revealing intra-class similarity within the shaded regions but inter-class variability, thereby making it a CFV feature point. Similarly, the slope and CV in Figure 2c,d at different time frames exhibit both intra-class similarity and inter-class variability, meeting the criteria for inclusion in the CFV. Out of all time-domain feature points, 510 fulfilled the conditions mentioned above, securing their place in the CFV.

The CFV, comprising a total of 590 features (80 from prominent points and 510 from time-domain characteristics), underwent ranking by the lasso regression algorithm. From this comprehensive set, the top 13 features were selected for integration into the regression methods. The implementation of the lasso regression algorithm was carried out using MATLAB. Lasso serves as a regularization technique designed to identify optimal features in a regression problem, thereby reducing the number of features within a large feature set [37]. For a given value of λ, lasso solves the below mathematical problem:

\min_{β_{0}, β} (\frac{1}{2 N} \sum_{i = 1}^{N} {(y_{i} - β_{0} - x_{i}^{T} β)}^{2} + λ \sum_{j = 1}^{p} |β_{j}|)

Here, N is the number of observations, y_i is the response at observation I, x_i is data, a vector of p values at observation i, λ is a positive regularization parameter corresponding to one value of Lambda,

β_{0}

,

β

are scalar parameters, and p is vector parameters.

According to Figure 3a, when the lambda is close to 10⁻¹, the cross-validated deviance is the lowest and the number of features that have generated the deviance is 13.

3.2. Regression

As the MoCA scores are bounded 0–30, instead of straight linear regression, multivariate regression with logit link function, ensemble regression, support vector regression, and ridge regression were analyzed. We performed five-fold cross-validation to ensure the robustness of the model and the reliability of the results.

4. Single-Trial Analysis

4.1. Feature Extraction

For the single trial analysis, we considered each trial length from the onset of the stimulus to the response of the subjects. We excluded those responses which were less than 0.2 ms or greater than 1.5 ms [19], or where the range of the magnitude of the signal was greater than 100 µV, as they might have been contaminated by artifacts [38]. We then performed a time–frequency (TF) analysis of each individual trial from 0 to 100 Hz frequencies. We collected the generated images of the TF plots in a png format and used them as inputs for deep neural network analysis. The function pspectrum.m of MATLAB (2018a) was used to extract the TF features.

4.2. Deep Neural Regression and Bayesian Optimization

We had a total of 2848 samples as inputs to the deep learning network. We constructed the deep neural network with the help of the keras package from Python. We used a sequential model, where we constructed three sets of a convolutional 2D layer and max-pooling layer, with a ‘relu’ activation layer. Then, we used the flatten layer, two dense layers with a ‘tanh’ activation layer in between. To find out the optimal convolutional filter size, dropout rate, and number of epochs, we used Gaussian process-based Bayesian optimization. The whole process of the Gaussian process-based Bayesian optimization can be divided into four steps (Figure 4). For convenience, in our explanation, we will explain the third and fourth steps together. Thus, the three steps are: (a) pre-sample, (b) kernel comparison, and (c) exploration and exploitation. In the pre-sample stage, we defined a uniform distribution for each of our hyperparameters (convolutional filter size, dropout rate, and no. of epochs). Each time, we randomly sampled from the uniform distributions without repetition and used them in the deep neural network to obtain the cross-validated root mean square error. We repeated the process five times in the pre-sample stage. Then, we entered into the kernel comparison stage. We used five kernels: (i) Matern, (ii) Radial Basis Function, (iii) Rational Quadratic, (iv) Exponential Sinsquared, and (v) Dot Product to compare the performance of the sample fitting into the Gaussian process. The kernel performances in terms of the sum square error (SSE) are given in Table 1.

We used the Rational Quadratic kernel for the rest of the Bayesian optimization process. In the exploration stage, we defined the Gaussian process with the y kernel. We fitted it with pre-sampled data. Then, we used the following acquisition function to calculate our next sample.

E I (x) = \{\begin{matrix} (μ (x) - f (x^{+})) Φ (Z) + σ (x) ϕ (Z) i f σ (x) > 0 \\ 0 i f σ (x) = 0 \end{matrix}

Z = \frac{μ (x) - f (x^{+})}{σ (x)}

where x is the array of possible samples that can be taken in the next steps,

μ (x)

and

σ (x)

are the mean and standard deviation of the output values returned by the Gaussian process against x,

f (x^{+})

is the lowest RMSE obtained so far, and

Φ (Z)

and

ϕ (Z)

are the PDF and CDF of the standard normal distribution, respectively [39]. We took the sample with the highest EI value according to equation Z for deep neural network implementation. We repeated the process fifteen times and concluded the exploration stage. As we obtained many similarly performing samples in the exploration stage, we did not want to use some samples. Finally, we selected the convolutional filter size, dropout rate, and epoch. The detail parameters of the model are depicted in Figure 5. In the Bayesian optimization phase and in the final phase, after constructing the deep neural network model, we compiled it with an ‘adadelta’ optimizer. We evaluated the model with five-fold cross-validation.

5. Results

In the multi-trial analysis, we used the top 13 features, which were decided by the outcome of the lasso regression-based feature extraction techniques in all the regression techniques mentioned in the regression section, to predict the MoCA scores of the neural measures (Figure 6). To compare the regression techniques used in the multi-trial analysis, we used the root mean square error (RMSE). The scores are reported in Table 2.

The regression methods MR, ER, SVR, and RR generate five-fold cross-validated RMSE values of 30.9, 1.6, 0.27, and 2.61, respectively. Based on the RMSE, support vector regression (SVR) performs the best out of all of the techniques in our study. The second-best technique is ensemble regression (ER) in this work. We have performed residual analysis to check the robustness of these models.

In Figure 7, we depict the residual analysis results of the two best-performing regression models (SVR, ER). For each regression model, we presented two graphical analysis: (a) sample quantiles vs theoretical quantiles of the residuals or quantile–quantile plot (qqplot), and (b) residuals vs fitted data by the regression model. The qqplot of the residuals is generally used to verify the assumption that the residuals are normally distributed. As both of the models show normal patterns (an approximately straight line) in the qqplot, the models pass the normality test. We further analyzed the residual vs fitted plot. A good regression model should be homoscedastic, which means its residual is uncorrelated, uniform, and random over the fitted or predicted value. The obtained results fit a good regression model criterion.

The plot of SVR shows a pattern, whereas the plot for ER shows randomness of the residual. After this analysis, we found that the ER model for these data is superior to SVR, although SVR gave a lower RMSE than ER (Figure 7). This is because the ER model shows robustness in all the analyses in all the models we experimented with. For the implementation of ER and RR, we used MATLAB, and for implementing MR and SVR, we used several packages such as “e1071”, “caret”, etc. For residual analysis, we also used MATLAB.

In the single-trial analysis, we calculated the cross-validated RMSE and mean absolute error (MAE) for the constructed convolutional deep neural network. To check the robustness of the model, we also carried out a residual analysis which is presented in Figure 8. We tried a different combination of convolutional, max-pooling, and activation layers and calculated the RMSE and MAE for all of them. The best result is reported here (Table 3). Our deep neural network regression model was able to achieve an RMSE of 2.76 and an MAE of 1.81.

Finally, in Figure 8, we show the residual analysis of the deep neural network. We observed that the qqplot (sample quantiles vs. theoretical quantiles) of the residuals can be approximated as linear with a small deviation. So, the model maintains the normality or monotonicity assumption. Also, the residuals vs the fitted plot shows that the residuals do not follow any pattern with the change in the fitted value. Thus, these might be uncorrelated.

In comparison, the results demonstrate that multi-trial analysis can be utilized for MoCA score prediction with sufficient confidence. In particular, SVR has a very small MASE (0.27). Furthermore, single-trial analysis can also be utilized to predict the MoCA score with a deep neural network. However, the RMSE is higher (2.76). Nonetheless, single-trial analysis might be more suitable in practical cases, where patient profiles can be updated during each visit, using a single-trial approach to predict MoCA objectively. This can be further explored for in-home MoCA score prediction applications for more granular data points.

6. Conclusions

Although MCI detection has been investigated by researchers in recent years at great length, MCI severity measurement has not been studied intensely; although, the ability to measure such a metric objectively and automatically could improve MCI early diagnosis, prognosis, and patient management. In the current study, we generated the MoCA score from the single-channel EEG data using unsupervised algorithms, which can help to assess cognitive impairment automatically and continually with a low-cost, portable, and less burdensome EEG system. We carried out multi- and single-trial analysis, and the results of both cases were promising. Multi-trial analysis uses multiple trial data, while single-trial analysis uses single trial data, to predict the MoCA score. In the multi-trial analysis, we applied several classical regression models and found that the ER (ensemble regression) performed best with an RMSE of 1.6. In the single-trial analysis, our constructed convolutional deep neural network generated an MoCA score with an RMSE of 2.76. To the best of our knowledge, we are the first to generate automated scores equivalent to (manual) MoCA scores from single-channel EEG data. We have performed this with both multi-trial and single-trial data from a single-channel EEG. We believe that our current study has a broad scope in portable cognitive health study, precision medicine, mobile health (mHealth), and smart and connected community (SCC) research. For instance, access to objective and more granular MoCA scores can be beneficial to improve diagnosis, prognosis, and efficacy monitoring of therapy of Alzheimer’s disease. This can allow at-home monitoring systems or remote monitoring systems for MoCA scores beyond traditional clinical settings. However, the implications of the availability of more granular MoCA scores should be examined with consideration of invasion of privacy, disparity, and ethical implications.

Author Contributions

Conceptualization, B.I.M. and G.M.B.; methodology, S.K. and B.I.M.; software, S.K.; validation, S.K.; resources, S.K.; data curation, S.K.; writing—original draft preparation, S.K.; writing—review and editing, B.I.M. and G.M.B.; supervision, B.I.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Russo, M.J.; Cohen, G.; Chrem Mendez, P.; Campos, J.; Martín, M.E.; Clarens, M.F.; Tapajoz, F.; Harris, P.; Sevlever, G.; Allegri, R.F. Utility of the Spanish version of the Everyday Cognition scale in the diagnosis of mild cognitive impairment and mild dementia in an older cohort from the Argentina-ADNI. Aging Clin. Exp. Res. 2018, 30, 1167–1176. [Google Scholar] [CrossRef]
Tsai, C.-L.; Pai, M.-C.; Ukropec, J.; Ukropcová, B. The Role of Physical Fitness in the Neurocognitive Performance of Task Switching in Older Persons with Mild Cognitive Impairment. J. Alzheimer’s Dis. 2016, 53, 143–159. [Google Scholar] [CrossRef]
Khatun, S.; Morshed, B.I.; Bidelman, G.M. Single channel EEG time-frequency features to detect Mild Cognitive Impairment. In Proceedings of the 2017 IEEE International Symposium on Medical Measurements and Applications, MeMeA 2017–Proceedings, Rochester, MN, USA, 7–10 May 2017. [Google Scholar]
Roalf, D.R.; Rupert, P.; Mechanic-Hamilton, D.; Brennan, L.; Duda, J.E.; Weintraub, D.; Trojanowski, J.Q.; Wolk, D.; Moberg, P.J. Quantitative assessment of finger tapping characteristics in mild cognitive impairment, Alzheimer’s disease, and Parkinson’s disease. J. Neurol. 2018, 265, 1365–1375. [Google Scholar] [CrossRef]
Maskill, L. Mild cognitive impairment: A quiet epidemic with occupation at its heart. Br. J. Occup. Ther. 2018, 81, 485–486. [Google Scholar] [CrossRef]
Lin, J.S.; O’Connor, E.; Rossom, R.C.; Perdue, L.A.; Burda, B.U.; Thompson, M.; Eckstrom, E. Screening for Cognitive Impairment in Older Adults: An Evidence Update for the US Preventive Services Task Force. 2013. Available online: https://pubmed.ncbi.nlm.nih.gov/24354019/ (accessed on 1 November 2020).
Tsoi, K.K.; Chan, J.Y.; Hirai, H.W.; Wong, A.; Mok, V.C.; Lam, L.C.; Kwok, T.C.; Wong, S.Y. Recall Tests Are Effective to Detect Mild Cognitive Impairment: A Systematic Review and Meta-analysis of 108 Diagnostic Studies. J. Am. Med. Dir. Assoc. 2017, 18, 807.e17–807.e29. [Google Scholar] [CrossRef]
Roalf, D.R.; Moberg, P.J.; Xie, S.X.; Wolk, D.A.; Moelter, S.T.; Arnold, S.E. Comparative accuracies of two common screening instruments for classification of Alzheimer’s disease, mild cognitive impairment, and healthy aging. Alzheimer’s Dement. 2013, 9, 529–537. [Google Scholar] [CrossRef]
Bennys, K.; Rondouin, G.; Vergnes, C.; Touchon, J. Diagnostic value of quantitative EEG in Alzheimer’s disease. Neurophysiol. Clin. 2001, 31, 153–160. [Google Scholar] [CrossRef]
Kowalski, J.W.; Gawel, M.; Pfeffer, A.; Barcikowska, M. The diagnostic value of EEG in Alzheimer disease: Correlation with the severity of mental impairment. J. Clin. Neurophysiol. 2001, 18, 570–575. [Google Scholar] [CrossRef]
Bennys, K.; Rondouin, G.; Vergnes, C.; Touchon, J. Quantitative EEG findings in different stages of Alzheimer’s disease. Neurophysiol. Clin. Clin. Neurophysiol. 2001, 23, 457–462. [Google Scholar]
FFraga, J.; Falk, T.H.; Kanda, P.A.M.; Anghinah, R. Characterizing Alzheimer’s Disease Severity via Resting-Awake EEG Amplitude Modulation Analysis. PLoS One 2013, 8, e72240. [Google Scholar] [CrossRef]
Brunovsky, M.; Matousek, M.; Edman, A.; Cervena, K.; Krajca, V. Objective assessment of the degree of dementia by means of EEG. Neuropsychobiology 2003, 48, 19–26. [Google Scholar] [CrossRef]
Song, Y.; Zang, D.; Wang, Z.; Guo, J.; Gong, Z.; Yao, Y. Grand Total EEG Analyses: A Promising Method Predict The Severity of Cognitive Impairment in Alzheimer’s Disease. J. Neurol. Sci. 2014, 31, 40. [Google Scholar]
Babiloni, C.; Lizio, R.; Marzano, N.; Capotosto, P.; Soricelli, A.; Triggiani, A.I.; Cordone, S.; Gesualdo, L.; Del Percio, C. Brain neural synchronization and functional coupling in Alzheimer’s disease as revealed by resting state EEG rhythms. Int. J. Psychophysiol. 2016, 103, 88–102. [Google Scholar] [CrossRef]
Babiloni, C.; De Pandis, M.F.; Vecchio, F.; Buffo, P.; Sorpresi, F.; Frisoni, G.B.; Rossini, P.M. Cortical sources of resting state electroencephalographic rhythms in Parkinson’s disease related dementia and Alzheimer’s disease. Clin. Neurophysiol. 2011, 122, 2355–2364. [Google Scholar] [CrossRef]
Polverino, P.; Ajčević, M.; Catalan, M.; Mazzon, G.; Bertolotti, C.; Manganotti, P. Brain oscillatory patterns in mild cognitive impairment due to Alzheimer’s and Parkinson’s disease: An exploratory high-density EEG study. Clin. Neurophysiol. 2022, 138, 1–8. [Google Scholar] [CrossRef]
Oldfield, R.C. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia 1971, 9, 97–113. [Google Scholar] [CrossRef]
Bidelman, G.M.; Lowther, J.E.; Tak, S.H.; Alain, C. Mild Cognitive Impairment Is Characterized by Deficient Brainstem and Cortical Representations of Speech. J. Neurosci. 2017, 37, 3610–3620. [Google Scholar] [CrossRef]
Nasreddine, Z.S.; Phillips, N.A.; Bédirian, V.; Charbonneau, S.; Whitehead, V.; Collin, I.; Cummings, J.L.; Chertkow, H. The Montreal Cognitive Assessment, MoCA: A Brief Screening Tool for Mild Cognitive Impairment. J. Am. Geriatr. Soc. 2005, 53, 695–699. [Google Scholar] [CrossRef]
Bidelman, G.M.; Moreno, S.; Alain, C. Tracing the emergence of categorical speech perception in the human auditory system. Neuroimage 2013, 79, 201–212. [Google Scholar] [CrossRef]
Pisoni, D.B. Auditory and phonetic memory codes in the discrimination of consonants and vowels. Percept. Psychophys. 1973, 13, 253–260. [Google Scholar] [CrossRef]
Karunathilake, I.D.; Dunlap, J.L.; Perera, J.; Presacco, A.; Decruy, L.; Anderson, S.; Kuchinsky, S.E.; Simon, J.Z. Effects of aging on cortical representations of continuous speech. J. Neurophysiol. 2023, 129, 1359–1377. [Google Scholar] [CrossRef]
Bidelman, G.M.; Alain, C. Musical Training Orchestrates Coordinated Neuroplasticity in Auditory Brainstem and Cortex to Counteract Age-Related Declines in Categorical Vowel Perception. J. Neurosci. 2015, 35, 1240–1249. [Google Scholar] [CrossRef] [PubMed]
Aiken, S.J.; Picton, T.W. Envelope and spectral frequency-following responses to vowel sounds. Hear. Res. 2008, 245, 35–47. [Google Scholar] [CrossRef] [PubMed]
Luck, S. An Introduction to the Event Related Potential Technique; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
Krishnan, A.; Suresh, C.H.; Gandour, J.T. Tone language experience-dependent advantage in pitch representation in brainstem and auditory cortex is maintained under reverberation. Hear. Res. 2019, 377, 61–71. [Google Scholar] [CrossRef] [PubMed]
Musacchia, G.; Strait, D.; Kraus, N. Relationships between behavior, brainstem and cortical encoding of seen and heard speech in musicians and non-musicians. Hear. Res. 2008, 241, 34–42. [Google Scholar] [CrossRef] [PubMed]
Lopez-Calderon, J.; Luck, S.J. ERPLAB: An open-source toolbox for the analysis of event-related potentials. Front. Hum. Neurosci. 2014, 8, 213. [Google Scholar] [CrossRef] [PubMed]
Hu, L.; Xiao, P.; Zhang, Z.G.; Mouraux, A.; Iannetti, G.D. Single-trial time-frequency analysis of electrocortical signals: Baseline correction and beyond. Neuroimage 2014, 84, 876–887. [Google Scholar] [CrossRef] [PubMed]
Khatun, S.; Mahajan, R.; Morshed, B.I. Comparative Study of Wavelet-Based Unsupervised Ocular Artifact Removal Techniques for Single-Channel EEG Data. IEEE J. Transl. Eng. Heal. Med. 2016, 4, 1–8. [Google Scholar] [CrossRef] [PubMed]
Khatun, S.; Mahajan, R.; Morshed, B.I. Comparative analysis of wavelet-based approaches for reliable removal of ocular artifacts from single channel EEG. In Proceedings of the IEEE International Conference on Electro Information Technology, Dekalb, IL, USA, 21–23 May 2015. [Google Scholar]
Bidelman, G.M. Towards an optimal paradigm for simultaneously recording cortical and brainstem auditory evoked potentials. J. Neurosci. Methods 2015, 241, 94–100. [Google Scholar] [CrossRef]
Lister, J.J.; Bush, A.L.H.; Andel, R.; Matthews, C.; Morgan, D.; Edwards, J.D. Cortical auditory evoked responses of older adults with and without probable mild cognitive impairment. Clin. Neurophysiol. 2016, 127, 1279–1287. [Google Scholar] [CrossRef]
Blankertz, B.; Lemm, S.; Treder, M.; Haufe, S.; Mueller, K.-R. Single-trial analysis and classification of ERP components—A tutorial. Neuroimage 2011, 56, 814–825. [Google Scholar] [CrossRef]
Van der Hiele, K.; Vein, A.A.; Van Der Welle, A.; Van Der Grond, J.; Westendorp, R.G.J.; Bollen, E.L.E.M.; Van Buchem, M.A.; Van Dijk, J.G.; Middelkoop, H.A.M. EEG and MRI correlates of mild cognitive impairment and Alzheimer’s disease. Neurobiol. Aging 2007, 28, 1322–1329. [Google Scholar] [CrossRef] [PubMed]
Tibshirani, R. Regression shrinkage and selection via the lasso: A retrospective. J. R. Stat. Soc. Ser. B Stat. Methodol. 2011, 73, 273–282. [Google Scholar] [CrossRef]
Aurlien, H.; Gjerde, I.O.; Aarseth, J.H.; Eldøen, G.; Karlsen, B.; Skeidsvoll, H.; Gilhus, N.E. EEG background activity described by a large computerized database. Clin. Neurophysiol. 2004, 15, 665–673. [Google Scholar] [CrossRef] [PubMed]
Brochu, E.; Cora, V.M.; De Freitas, N. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv 2010, arXiv:1012.2599. [Google Scholar]

Figure 1. A representative multi-trial ERP from a subject with auditory stimuli of vw1, vw2, vw3, vw4, and vw5.

Figure 2. Schematic of the feature extraction process: (a,c) are slopes and (b,d) are covariances for two different time windows from two timeframes as shown.

Figure 3. (a) Lasso regression-based feature selection: five-fold cross-validated mean squared error/deviance vs. lambda. (b) time–frequency image feature sample.

Figure 4. Deep learning classification steps: (a) Pre-sample processing, (b) Kernel comparison (see Table 1), and (c) Exploration and Exploitation stage.

Figure 5. Convolutional deep neural network for regression: layers and their parameters.

Figure 6. Flow Diagram of the method for single- and multi-trial analysis for individual or aggregate MoCA score prediction.

Figure 7. Residual analysis for support vector regression (SVR) and ensemble regression (ER) used in multi-trial analysis.

Figure 8. Residual analysis for deep neural network regression model used in single-trial analysis.

Table 1. Kernel performance of data fitting in gaussian process.

Kernel	SSE
Matern	5.45 e−19
Radial Basis Function	5.45 e−19
Rational Quadratic	3.59 e−19
Exponential Sine-squared	2.48 e0
Dot Product	54.14 e0

Table 2. RMSE for regression methods used for multi-trial analysis.

Method	RMSE
Multivariate Regression (link function = logit) (MR)	30.9
Ensemble Regression (ER)	1.6
Support Vector Regression (SVR)	0.27
Ridge Regression (RR)	2.61

Table 3. RMSE and MAE for deep neural network used in single-trial analysis.

RMSE	MAE
2.76	1.81

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khatun, S.; Morshed, B.I.; Bidelman, G.M. Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis. Sensors 2024, 24, 1054. https://doi.org/10.3390/s24041054

AMA Style

Khatun S, Morshed BI, Bidelman GM. Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis. Sensors. 2024; 24(4):1054. https://doi.org/10.3390/s24041054

Chicago/Turabian Style

Khatun, Saleha, Bashir I. Morshed, and Gavin M. Bidelman. 2024. "Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis" Sensors 24, no. 4: 1054. https://doi.org/10.3390/s24041054

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Monitoring Disease Severity of Mild Cognitive Impairment from Single-Channel EEG Data Using Regression Analysis

Abstract

1. Introduction

2. Method

2.1. Participants

2.2. Study Design

2.3. Collecting Data and Processing Event-Related Potentials

3. Multi-Trial Analysis

3.1. Extraction and Ranking of Features

3.2. Regression

4. Single-Trial Analysis

4.1. Feature Extraction

4.2. Deep Neural Regression and Bayesian Optimization

5. Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI