Next Article in Journal
Synthetic MRI Generation from CT Scans for Stroke Patients
Next Article in Special Issue
OutSplice: A Novel Tool for the Identification of Tumor-Specific Alternative Splicing Events
Previous Article in Journal
Explainable Machine Learning (XAI) for Survival in Bone Marrow Transplantation Trials: A Technical Report
Previous Article in Special Issue
Effective Feature Engineering and Classification of Breast Cancer Diagnosis: A Comparative Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Towards an Affordable Means of Surgical Depth of Anesthesia Monitoring: An EMG-ECG-EEG Case Study

1
Nsugbe Research Labs, Swindon SN1 3LG, UK
2
Hereford County Hospital, Wye Valley NHS Trust, Hereford HR1 2BN, UK
*
Author to whom correspondence should be addressed.
BioMedInformatics 2023, 3(3), 769-790; https://doi.org/10.3390/biomedinformatics3030049
Submission received: 19 June 2023 / Revised: 19 July 2023 / Accepted: 30 August 2023 / Published: 4 September 2023
(This article belongs to the Special Issue Feature Papers in Computational Biology and Medicine)

Abstract

:
The anesthetic dosing procedure is a key element of safe surgical practice, where it is paramount to ensure sufficient dosing of the anesthetic agent to the patient in order to reach the desired depth of sedation for the necessary procedure. One means of monitoring the depth of anesthesia (DoA) involves the use of the bispectral index (BIS), which decodes electroencephalography (EEG) signals acquired from the frontal cortex in a continuous fashion. The shortcomings of this include the complexity of the decoding of EEG signals, insensitivity to certain anesthetic agents, and the costly nature of the technology, which limits its adoption in resource-constrained settings. In this paper, we investigate an alternative source of physiological measurement modalities that can track DoA sufficiently while being much more affordable. Thus, we investigate this notion with the use of the University of Queensland database, which comprises EEG-EMG-ECG physiological data from patients going through a variety of surgical procedures. As part of this, select patient datasets were utilized in addition to a variety of signal decomposition and machine learning models—which totaled around 200 simulations—in order to investigate the most optimal combination of algorithms to track DoA using different physiological measurement modalities. The results showed that under certain algorithmic combinations and modeling processes, the ECG measurement (a ubiquitous monitor in anesthetic practice) can rival and occasionally surpass the accuracy of the EEG for DoA monitoring. In addition to this, we also propose a 2-phase modeling process that involves an algorithmic selection stage followed by a model deployment stage. Subsequent work in this area is advised to involve the acquisition of more physiological data from a broader mix of patients in order to further validate the consistency of the findings made in this study.

1. Introduction and Background

One of the key components of surgical processes is the dosing of anesthesia, which is managed by anesthetists and allows for the surgery itself to take place in a relatively painless fashion [1,2,3,4]. There are four recognized components of anesthesia: hypnosis, analgesia, amnesia, and muscle relaxation. An applied state of general anesthesia chiefly acts on the central nervous system, with the optimal dosing of the anesthetic agents being of key importance [4]. The measurement of the depth of anesthesia (DoA) has been an area for continued study amongst anesthesia researchers, where common metrics used for the measurement include the bispectral index (BIS), entropy measurement, auditory evoked potentials (AEP), and the surgical stress index. Other avenues for quantifying the degree of pain include the analgesia nociception index (ANI) and the surgical pleth index (SPI) [2]. In the absence of specific DoA monitors, anesthesia dosing is based on the physiological response of the patient, and due to physiological differences, there is substantial variation of the sedation index amongst patients, making it difficult to foster a single index for the measurement of the DoA and sedation level [4].
Nevertheless, as part of safer surgical practices, continuous work has been conducted in optimal monitoring and DoA tracking to prevent complications from over or underdosing of anesthesia [5,6,7,8]. The most widely used monitor is the BIS, whose full model architecture is protected due to intellectual property and works with a combination of electroencephalography (EEG) alongside what is believed to be a spectrum of high-order statistics from the acquired EEG time series [5,6,7,8]. Indices spanning 0–100 (from isoelectric/low brain activity to awake) are presented in real-time, and its introduction into the surgical theater in 1996, when it received FDA clearance, has helped to significantly reduce anesthetic dosing-related medical errors [5,6,7,8].
The BIS monitor uses fine-scale neural oscillations from the frontal cortex of the brain as input for computing the DoA metrics, and with a more reflective DoA measure, better recovery and wake-ups have now been noted after major surgery [9,10]. Nevertheless, the BIS monitor itself is only as accurate as its physiological measurement source, which in this case is EEG signals from the frontal cortex [11]. However, EEG has limitations, including: complexity and being difficult to decode; being non-reflective of cognitive state when drugs like ketamine and nitrous oxide are administered; sensitivity to interference; and being generally more expensive than other forms of physiological sensors [4,9,10]. This has prompted alternate exploration of physiological signals such as electrocardiograms (ECG) for DoA monitoring, which has been investigated by a number of authors, from whom it has been steadily observed that anesthetic information can be seen from rhythmic to nonrhythmic observations from an accompanying ECG signal [12,13,14,15,16,17,18].
The appeal of this form of physiological measurement as an inference for DoA is amplified by the fact that ECG signals and recording mechanisms are cheap to acquire and not as complex to process, which makes the adoption of this modality appealing to resource-constrained environments. However, previous studies have shown that the monitoring prowess and accuracy of ECG relative to EEG are slightly lower, which forms part of the investigation in this work [4]. For the first time in the published literature, bespoke advanced signal processing methods are adopted in the DoA prediction pipeline in an attempt to enhance the prediction accuracy of the candidate model as part of a potential low-cost anesthesia DoA monitoring platform. As part of this, we utilize physiological data from patients under various depths of anesthesia to perform a comparison study amongst modalities such as EEG, ECG, and an auxiliary physiological monitor, namely, electromyography (EMG). Data from several patients are utilized alongside various advanced signal processing methods using machine learning methods of different architectures [19,20].
The aforementioned advanced signal processing mechanism comprises a signal decomposition preprocessing mechanism and is followed by the extraction of a select list of features that form the core of a feature vector that is eventually used to train a machine learning model that serves as the prediction model [20]. Signal decomposition methods are a type of multiresolution approach typically utilized for the systematic separation of a candidate signal into subcomponents in order to find an optimal region within the signal, which minimizes noise and redundancies and boosts the modeling prowess of the signal. Common examples that have been applied within the area of DoA include the empirical mode decomposition and the wavelet decomposition, to name a few.
This work showcases, for the first time in the published literature, the application of the metaheuristically driven linear series decomposition learner (LSDL) and the deep wavelet scattering (DWS) method towards the decomposition of ECG and EMG physiological signals for anesthesia depth estimation [21,22]. Despite both of these being decomposition algorithms, they are underpinned by different decomposition configurations towards the systematic separation of the signal into component parts; hence, their respective performances are contrasted as part of the work done in this paper.
Once the features are extracted from a signal, a machine-learning model is typically utilized for the pattern recognition of the input data in association with an output label. In medicine, machine learning has played the role of data-driven prediction machines, which have been applied and deployed in various capacities. Machine learning models span various architectures, from statistically driven discriminant analysis and logistic regression to decision tree and nearest neighbor models, all the way to models with more complex architectures such as support vector machines and deep learning models, to name a few [14]. A variety of these models have been researched and applied to serve as decision support in areas such as diabetes care, psychiatry, oncology, pregnancy care, and, of course, anesthesia DoA, where the introduction of these models has contributed towards a human-machine interaction platform that allows for enhanced clinical decision making and better patient care [23,24,25,26].
With respect to the work being presented in this paper, machine learning modeling represents the final stage of the signal processing pipeline where metrics and predictions are output and are meaningfully comprehensible to the end operator and clinician. In this work, seven different machine learning models were trained and utilized for the analysis in order to understand which combination of physiological sensing modality/signal decomposition method/machine learning yields optimal DoA estimation performance. Thus, a cumulative total of around 200 simulations were carried out as part of the work done in this study.
The contributions presented as part of this manuscript are as follows:
-
Investigation of alternative physiological monitors that could contribute towards forming a low-cost avenue towards DoA monitoring by comparing the performance of ECG and EMG monitors with that of the traditionally used EEG;
-
Premier use of signal decomposition methods such as the LSDL and DWS for the preprocessing and decomposition of ECG and EMG signals from anesthetized patients, with a view towards enhancing the DoA information that can be decoded and inferred from the signal;
-
Comparison of the DoA estimation prowess across multiple classification models of varied model architecture complexities.
All the analytics yielded around 200 simulations across the pilot group of patients’ data, covering the various physiological measurement modalities. Given the proposed contributions of the work, it is hypothesized that the groundwork would be laid for an affordable pipeline for DoA monitoring comprising a physiological monitor as well as the postprocessing algorithms and machine learning models. This would have a higher level of appeal to developing nations as well as resource-constrained environments.

2. Materials and Methods

2.1. Dataset and Information

The dataset utilized as part of this paper is from the University of Queensland Vital Signs Dataset, which has received ethical approval from both the Royal Adelaide Hospital and the University of Queensland and has the goal of acquiring physiological measurements from patients during various surgical procedures [27]. A number of supporting modules were used as part of the data acquisition process, as can be seen in the source publication, where data were acquired at what is believed to be around 100 Hz and stored in CSV format. The acquisitions were cropped to ensure that the data recordings reflected the commencement of the induction phase, while the end also reflected the removal of the sensor from the head after emergence [27].
It needs to be mentioned that all cases recorded as part of the database represent mostly minor surgeries with no traumatic events, while the effects of the prior medication before the surgery and anesthetic procedure were not taken into consideration in order to avoid a subjective view of the effect of the anesthetic agent on the patient [27].
Based on the research needs of the paper, three unique case studies were selected from the data on the basis of the patient having spent a long enough time under anesthesia and having data present for the BIS, EMG, ECG, and EEG. The BIS was selected as the ground truth label to support all subsequent modeling activities in consistency with the authors’ prior work in this area. The classification case studies were then subsequently created, as seen in Table 1.
The windowing of the signal was chosen to be between 6000 and 9000 samples to account for 60–90 s worth of anesthetic dosing time, which is considered to be a robust window and segment time for all the modalities being considered. The classification exercise was formulated based on the dynamic range of the BIS monitor in order to establish a suitable DoA region where enough samples could be collected, all of which ensured minutes’ worth of samples were used as part of all modeling exercises. A hierarchical flow diagram of the various analytics done as part of the work presented in this paper for a sample patient can be seen in Figure 1.
Uncertainties: no patient demographic information; no details on the types of surgery done; a relatively small BIS variation range; only a small list of patients met the selection criteria.

2.2. Physiological Measurement Instrumentations

Although the work presented is not solely based upon theoretical simulations, the underpinning mathematical physics models behind the anatomical and physiological manifestations under certain given conditions are articulated and described as part of this subsection. In essence, these models quantify mathematical extracellular action potentials, which are acquired in a real-world scenario using physiological sensors.

2.2.1. EMG

The concept of electrophysiological signals is themed around a transient flow of bio-electricity within a segment of muscle fibers. This flow is measured using a set of EMG electrodes, which are typically placed some distance from the source.
Using the principles of biophysics, this electrophysiological manifestation can be framed from the perspective of volume conduction and the 3-dimensional Ohm’s law formulation in a biological tissue [28,29,30,31]. Given the conductivity σ , the 3-dimensional acquisition point P 0 ( x 0 , y 0 , z 0 ) , with uniform conductivity σ i , from a source current I s , at point P ( x , y , z ) , can be formulated as Equation (1):
V p 0 = 1 4 π σ i I s r i
where V p 0 is a voltage potential, and r i is the shortest distance between P 0 and P . From Equation (1), an acquired voltage from a given point carries proportionality to the intensity of the source current, with the voltage potential being particularly inhibited by the distance and conductivity factors.
The dipole is frequently used as a physics-based approximation to represent a source generator, which is surrounded by a field of sorts [28,29,30,31]. For a fiber of length d x in the range of a given action potential, as the current flows into an extracellular region with the hypothetical focus of the magnitude being along the axis, this can be modeled as p . d x , where p is the dipole current per unit length [28,29,30,31]. The source current permeates into free space in a dipole propagation characteristic, which thus leads to Equation (2), which can be said to hold for an extracellular potential:
d e = 1 4 π σ e . d ( 1 r ) d x . p ( x , t ) d x
where σ e is the conductivity of the extracellular mechanism and r is the source distance from the point of excitation to the instrumentation recording point P 0 . For an element p . d x along the coordinates ( x , y , z ) and P 0 at ( x 0 , y 0 , z 0 ) , the distance can then be formulated as Equation (3):
r = [ ( x x 0 ) 2 + ( y y 0 ) 2 + ( z z 0 ) 2 ] 1 2  
from which the total field can be estimated using the summations of the series of potentials from the resulting dipole current element, as can be seen in Equation (4):
e ( x 0 , y 0 , z 0 , t ) = x = x = p ( x , t ) 4 π σ e [ ( x x 0 ) 2 + y 0 2 + z 0 2 ] 3 2 d x
where t is time.
Note that in reality, the excitation source is likely to comprise a series of dipoles from a surface area, which represents an acquisition area.

2.2.2. EEG

EEG signals refer to mostly passive measurements that acquire electric field activities from the brain, which are said to encode information on the neurological state of the patient [32,33]. The mathematical models used to quantitatively describe EEG manifestations are based on the forward problem, with assumptions made for the model for the head and a conductivity overlay of the tissues within the skull [32,33]. As with the descriptions of the EMG, dipoles are useful approximations for the theoretical modeling of bio-electric manifestations within the brain. Using this basis and a series-based expansion approach, a 3-dipole approximation in a homogenous sphere has been utilized for the solution to a 4-layer spherical head model [32,33]. Thus, a multilayer head model L with concentric spheres with radii spanning 0 < r 1 < r 2 r L , with anisotropic conductivities σ 1 , σ L . For a singular dipole at a point S within proximity of a sphere of a radius r s < r 1 with a moment q , an electric potential u measured at a point x , located at the farthest sphere | | x | | = r L can thus be expressed as Equation (5):
u ( r , q , x ) = | | q | | 4 π σ L r L 2 = n = 1 2 n + 1 n ( r s r L ) n 1 f n [ n c o s α P n ( c o s γ ) + c o s β s i n α P n 1 ( cos γ ) ]
where α is the angle between S and the measurement acquisition point x , and γ is an angle between two planar vectors S and q on a specific side, and S and x on the other, while P n and P n 1 are the Legendre polynomial coefficients in the series. The case of f n , which corresponds to the n t h element of the EEG measurement, can be calculated as seen in Equation (6):
f n = n n m 22 + ( 1 + n ) m 21
where the coefficients m 22 and m 21 of the matrix m i j 1 ≤ I, j ≤ 2 can be obtained from Equation (7):
[ m 11 m 12 m 21 m 22 ] = 1 ( 2 n + 1 ) L 1 i = 1 L 1 [ n + ( n + 1 ) σ i σ i + 1 ( n + 1 ) ( σ i σ i + 1 1 ) ( r L r i ) 2 n + 1 n ( σ i σ i + 1 1 ) ( n + 1 ) + n σ i σ i + 1 ]

2.2.3. ECG

The ECG is a widely used tool for the recording of heart activity, but despite its widespread use, there continue to be apparent areas of uncertainty centered around it. The use of quantitative mathematical models has always provided an alternate insight into the mathematical and biophysical descriptions of a particular phenomenon [34]. The process of said quantitative modeling of a candidate set of ECGs is regarded as the forward modeling process, for which three assumptions and components are considered, namely: a model of the electrical activity of the heart; a model of the extracardiac regions of the torso; and some heart–torso conditions [34]. The complexity of the subsequent mathematical models hinges on the depth of the assumptions made as part of the forward modeling process. In this work, a 12-lead ECG model is described based on the work done by Boulakia et al. [34]. The components of this model are bidomain equations, phenomenological cell models, and generalized Laplacian equations to account for the torso [34].
-
Heart Tissue
The electrical dynamics of the heart are accounted for using a bidomain model and work with the postulation that, on a cellular scale, cardiac tissue acts as an ohmic conducting medium that is separated by a cell membrane, both intracellular and extracellular, comprising cardiac and the space between them, respectively [35,36,37]. All of this occupies a whole volume Ω H , alongside intracellular and extracellular density j i  and j e , conductivity tensors  σ i  and σ e , and electric potential u i and u e . The conservation of the electrical charge thus becomes d i v ( j i + j e ) = 0 in Ω H , where the homogenous representation of the electrical activity of the cell membrane is given as A m ( C m V m t + I o n ( V m , w ) ) + d i v ( j i ) = A m I a p p in Ω H , with the Ohm’s law representation j i = σ i u i and j e = σ e u e V m represents the transmembrane potential represented as V m = u i u e .
A m represents a constant of the rate of membrane area per unit volume, and C m is the membrane, capacitance per unit area. I i o n ( V m , w ) is the ionic current across the membrane and I a p p is a given applied current stimulus, both of which are measured in terms of the membrane area unit [35,36,37]. The ionic current is a vectorized quantity w that fulfills a system of ODEs of the format w t + g ( V m , w ) = 0 , in Ω H . g and I i o n are dependent upon the cell ionic current, where their levels of complexity are either phenomenological or physiological [35,36,37].
A scaled phenomenological 2-variable model is considered part of this model and expressed as Equations (8) and (9):
I i o n ( V m , w ) = w τ i n ( V m V m i n ) ^ 2 ( V m a x V i n ) V m a x V m i n + 1 τ o u t V i n V m i n V m a x V m i n
g ( V m , w ) = { w τ o p e n 1 τ o p e n ( V m a x V m i n ) 2   i f   V m < V g a t e w τ c l o s e   i f   V m > V g a t e
where τ i n , τ o u t ,   τ o p e n , τ c l o s e , and V g a t e are fixed parameters, while V m i n   and   V m a x represent scaling constants. w , the gating variable, is stringent upon the V g a t e voltage and associated time constants τ o p e n and τ c l o s e . The timing constants τ i n and τ c l o s e are linked to the length of both the depolarization and repolarization stages [35,36,37]. Thus, the system of equations that represents the electrical activity of the heart can be expressed as Equation (10):
{ A m ( C m V m t + I i o n ( V m , w ) ) d i v ( σ i V m ) d i v ( σ i u e ) = A m I a p p , i n   Ω H d i v ( ( σ i + σ e ) u e ) d i v ( σ i V m ) = 0 , i n   Ω H V m t + g ( V m , w ) = 0 , i n   Ω H
As far as boundary conditions go, given V m 0 and w 0 for the transmembrane potential and gate variable respectively, an initial condition of the form V m ( x , 0 ) = V m 0 ( x ) , w ( x , 0 ) = w 0 ( x ) x Ω H would be in place. With regards to the boundary conditions at the point = Ω H , with the assumption that the intracellular current does not propagate outside of the heart, we can then say j i . n = σ i u i . n = 0 , f o r , where n is the outward unit to the normal of Ω H [35,36,37].
Figure 2 shows a summary of the various physiological organs, their basic operating principles, and their physiological sensing modalities, along with the associated pros and cons and a sample time series segment from a patient under anesthesia [38,39,40]. From the time series, the different dynamics in their respective physiological measurements can be noted. The EMG signal represents an enveloped variant of a raw EMG time series segment characterized by discrete fluctuations. The ECG signal is characterized by overlapping traditional QRS waveforms, which show the cyclical patterns of the human heart alongside its biophysical tendencies to be regarded as a biological pump mechanism. The EEG time series signal shows a continued series of neural oscillations that are highly superimposed, largely due to the multitude of oscillatory frequencies that brainwaves span, where there can also be seen to exist a high degree of noise as a result of the overall complexity of the organ itself [38,39,40].
The set of plots in Figure 3 show the waveforms for a candidate patient across the various BIS values, i.e., over and under, where the left column represents the various time series projections of the physiological signals and the right column showcases the time-frequency view of the signals via the short time Fourier transform (STFT) spectrogram [41]. From the top down, the first two signals represent the ECG measurements, which are characterized by a stream of overlapping sharp peaks in the time domain and are reflected in the spectrogram as a cluster of information in the lower-mid frequency regions. From a visual perspective of the ECG-based spectrogram, it can be assumed that the rich content of the signal is in and around the earlier portion of the frequency spectrum. The subsequent set of plots is for the EMG signals, where it can be noted that an enveloped variant of the EMG signals was stored as opposed to the raw time series, which in general represents a fairly high amplitude signal and whose STFT representation yields a very low frequency transformation of the signal. Despite this, the resulting frequency representation appears to reflect a manifestation that is typically adjourned in the signal processing literature, in that the bulk of the frequency content related to neuromuscular EMG signals tends to be very low-frequency-based.
The final set of plots belongs to the EEG signals, which can be seen to represent a rather noisy and stochastic time series plot. This is largely due to the fact that the brain is physiologically viewed as a more complex organ than both the heart and muscle tissue but also operates on a varied set of frequency states that range from delta, theta, alpha, beta, and gamma [42]. Thus, from an abstraction standpoint, a sample resulting EEG waveform could comprise components and superpositions of the various brain states. Due to this, it can be seen that the frequency manifestations of the signal mostly symbolize a broadband phenomenon, which, although suggesting that a span of frequency components is a part of the signal itself, also shows that the bulk of the signal’s frequency information is intensely within the early portion of the frequency spectra.

2.3. Signal Decomposition

2.3.1. LSDL

As mentioned, the use of signal decomposition methods aims towards the systematic separation of a candidate signal into component parts as a way of boosting the overall quality of a signal, where the LSDL is one that is embedded upon metaheuristic reasoning from the area of artificial intelligence [43,44,45]. The process involves using a set of defined heuristics as well as a linear basis function as part of the decomposition process to split the signal into subsections, which are assessed using an embedded cost function [43,44,45]. The application of the LSDL was initially for source separation of signals from powder mixtures, from which it was seen that the performance of the LSDL superseded that of the classical wavelet transform from both prediction accuracy and computational time perspectives when decomposing highly nonlinear and stochastic signals [43,44,45]. Subsequent application of the LSDL has seen its use as a key preprocessing tool in an array of areas within clinical medicine where physiological time series are acquired, including rehabilitation, preterm pregnancy, and psychiatric medicine; and very recently, it has also been applied to infrared spectroscopy waveforms in the early prediction of reproductive cancers in both males and females [25,46,47,48].
The LSDL comprises an in-built performance index that is able to assess the information quality of each of the decomposed regions obtained from the heuristic-based segmentation of the signal. The normalized Euclidean distance metric was utilized as the embedded performance index of choice for the LSDL, for which, given two candidate signals from two distinct classes, the normalized Euclidean distance provides a dimensionless metric that represents the distance and, therein, class separability between the two sample sets [49]. A flow diagram of this can be seen in Figure 4, which showcases the various stages involved in the LSDL’s optimal region selection phase. For a full list of the designated heuristics as part of the LSDL, as well as the threshold parameters, Nsugbe et al. [46] and Nsugbe and Sanusi [47] contain more details. n was selected as 3 for the work done in this paper.

2.3.2. Optimal Threshold Search Results

As patient-specific DoA estimation models were designed as part of this paper, a specific DoA model was designed per patient, for which there were two physiological measurement modalities as the EMG could not be decomposed with the LSDL due to the format in which the data was recorded and captured. For each set of result tables, 12 optimization simulations were carried out in order to yield six sets of threshold “goodness” metrics. This was done for the two different measurement modalities to yield 24 optimization simulations per patient, for a total of 72 across all three patients.
The LSDL optimization results for the various patients and their associated modalities can be seen in Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7, from which it can be seen that the optimal decomposition for all patients and all modalities appears to be within the 3rd iteration of the upper threshold region, which indicates that the optimal information of the signal is within the high amplitude and low-frequency region.
  • Case Study 1
Table 2. LSDL optimization results for ECG.
Table 2. LSDL optimization results for ECG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.00262.00932.2619
Lower Threshold
Region
2.00002.00022.0046
Table 3. LSDL optimization results for EEG.
Table 3. LSDL optimization results for EEG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.00922.03902.1723
Lower Threshold
Region
2.02122.00132.0066
  • Case Study 2
Table 4. LSDL optimization results for ECG.
Table 4. LSDL optimization results for ECG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.57502.71902.8217
Lower Threshold
Region
2.01112.00202.0080
Table 5. LSDL optimization results for EEG.
Table 5. LSDL optimization results for EEG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.29812.44542.4794
Lower Threshold
Region
2.03712.06542.1704
  • Case Study 3
Table 6. LSDL optimization results for ECG.
Table 6. LSDL optimization results for ECG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.78642.79012.7996
Lower Threshold
Region
2.70402.69942.7071
Table 7. LSDL optimization results for EEG.
Table 7. LSDL optimization results for EEG.
Threshold Region1st Iteration2nd Iteration3rd Iteration
Upper Threshold
Region
2.58422.59562.6032
Lower Threshold
Region
2.28222.14942.0004
The threshold parameters used to obtain the optimal threshold regions shown in Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7 were used to perform any subsequent decomposition performed, when the LSDL algorithm was utilized.
-
DWS
The DWS presents an unsupervised signal decomposition and multiresolution approach that uniquely combines characteristics from both the classical wavelet decomposition and the deep learning architectures [50,51,52]. Its features are continuous as well as robust to translations, with preset wavelets and filters as opposed to the traditional iterative learning from data [50,51,52]. One of the strengths of the DWS is its ability to work with a small set of data. Figure 5 shows a flow diagram summarizing the various sub-stages and key properties of the DWS, which include convolutions via wavelets, modulus nonlinearity, and averaging via a scaling function [50,51,52].
For a signal f ( t ) being analyzed with Ø being a low pass filter and a wavelet function of Ψ for filtering and decomposition, the wavelet family indices possess an octave frequency resolution Q k and are denoted as k , alongside multiscale high pass filter banks { Ψ j k } j k k , which are assembled via dilation of the wavelet Ψ , for which Ø J ( t ) represents a low pass filter that is able to provide a localized translation invariance of the signal f ( t ) at a defined T [50,51,52].
The DWS is typically implemented using a convolutional neural network (CNN) which, based on its architecture, first performs convolutions through classical wavelets, which are followed by a nonlinear modulus, and finally by the averaging process [50,51,52]. The convolutional process is indicated as S 0 f ( t ) = f Ø J ( t ) , for which S 0 represents the zero-order scattering coefficients, which produce locally translation-invariant features for the signal f, which initially conveys a loss in high-frequency information but is duly recouped with the wavelet modulus | W 1 | , as expressed in Equation (11) [50,51,52]:
| W 1 | f = { S 0 f ( t ) , | f Ψ j 1 ( t ) | } j 1 1
In a hierarchical tree-like fashion, the first set of scattering coefficients emanates from an averaging process of the wavelet modulus coefficient Ø J ( t ) , as seen in Equation (12):
S 1 f ( t ) = { | f Ψ j 1 ( t ) | Ø J ( t ) } j 1 1
Information is lost due to the effect of the averaging process, but this can be subsequently recovered via the use of a wavelet modulus as part of the information recovery process, as expressed in Equation (13):
| W 2 | | f Ψ j 1 | = { S 1 f ( t ) , | | f Ψ j 1 | Ψ j 2 ( t ) | } j 2 2
This is succeeded by the set of second-order coefficients, which can be defined as S 2 f ( t ) = { | | f Ψ j 1 | Ψ j 2 | Ø J ( t ) } j 1 1 i = 1 , 2 , and continuously iterated via the described process to yield Equation (14):
U m f ( t ) = { | | f Ψ j 1 | | Ψ j m | } j 1 1 , i = 1 , 2 , m .
for which U m is an mth-order modulus. The mth order scattering coefficients for U m f ( t ) with Ø J can be obtained via Equation (15):
S m f ( t ) = { | | f Ψ j 1 | | Ψ j m | Ø J ( t ) } j 1 1 , i = 1 , 2 , m
The described approach is applied in an iterative fashion to obtain a final scatter matrix S f ( t ) = { S m f ( t ) } 0 m l , which represents a concatenation of all the prior scatter coefficients, as a means of characterizing the input signal, for which l indicates the maximum decomposition level [50,51,52].
A tree-like visualization of the scattering decomposition network can be seen in Figure 6.
Since the DWS is comprised of the dual characteristics of the classical wavelets and the CNN, it is robust to translation invariance as well as stable to local deformations. The main difference between the DWS and the CNN is that the final output of the DWS is a combination of the output from a multitude of preceding layers and not primarily from an output layer [50,51,52,53].
The physics of the energy dissipation within the DWS networks suggest that the energy attenuates with an increasing number of layers, which is why only the first two layers within the network are utilized for modeling purposes and have thus been adopted in this work [50,51,52,53]. The remainder of the configuration parameters used as part of the DWS include the Gabor wavelet as the mother wavelet and basis function, an invariance scale of 1 s, and 8 wavelets per octave in the primary filter bank, followed by 1 wavelet per octave in the second filter bank [54].

2.3.3. Feature Extraction

Several features were extracted from the EMG, ECG, and EEG signals, comprising a unique ensemble and concatenation of linear and nonlinear features, which can contribute towards the effective modeling of the various physiological signals [31]. The list of features is as follows: mean absolute value (MAV), waveform length (WL), zero-crossing (ZC), slope sign change (SSC), root mean square (RMS), fourth-order autoregression (AR), sample entropy (SampEN), cepstrum (Ceps), maximum fractal length (MFL), median frequency (MedFrq), peak frequency (PeakFrq), number of peaks (NP), simple squared integral (SSI), and variance (VAR) [31]. A threshold of 1 µv was used for all features that required a threshold, and the values of 2 and 0.2 were used for the m and r variables for the sample entropy feature [31].

2.3.4. Machine Learning Models

Decision Tree (DT): refers to a class of gray-box models that hinge upon Boolean logic as a systemic means towards class separation for the various sample sets in a hierarchical flow fashion [55]. Due to the configuration of the model, its decision-making process carries a degree of interpretability [55].
Logistic Regression (LR): is a form of statistically underpinned parametric model that outputs class scores in the range of 0–1 and then interpolates based on a threshold score to find out which class the sample set belongs to [56]. A sigmoidal function is used for interpolation purposes and provides an extra layer of robustness when dealing with outliers [56].
K-Nearest Neighbor (KNN): is a nonparametric classification model that utilizes a combination of majority voting and nearest neighbor criteria for the assignment of samples into various classes. In this work, the value for k was selected to be 1 in the interest of computational efficiency, while the Euclidean distance was adopted as the distance metric of choice [57].
Support Vector Machine (SVM): is based around the assumption that higher dimensions are better suited for the class separation of data classes that are within proximity of each other in a lower-dimensional space [58]. Class boundaries are placed in an iterative fashion in a higher-dimensional subspace, and their boundary coordinates are retained during the downscaling of the data projection, a feat regarded as a “kernel trick” [58]. The model is viewed as computationally intensive due to the requirement for coordinate transformation and iterative fitting of class boundaries [58]. In this work, a number of variants of the decision boundaries were used to separate the various classes, namely, linear SVM-LSVM, quadratic SVM-QSVM, cubic SVM-CSVM, and fine Gaussian SVM-FGSVM.
All models were built and validations were done using the MATLAB Classification Learner Application, which tunes models for the most appropriate hyperparameters given the pattern recognition task. All models were validated using the K-fold cross validation method, with K chosen as 10.

3. Results and Discussion

3.1. Case Study 1

Classification Problem: BIS over 40 and under 40

Table 8 shows the results for the raw signal modeling for Case Study 1.
For the modeling with the raw signal, it can be seen for the EMG that the results are fairly mediocre, with the best prediction accuracy being the QSVM model at 52.6%, while the same trend appears to happen for the ECG, with the DT and QSVM providing the best prediction performance. The EEG produced the best accuracy in this scenario, with an accuracy metric of 76.3% for the LSVM, which was also seen to supersede the fusion of the features from the three various modalities. Table 9 shows the results for the LSDL.
For the LSDL, the ECG provided a best accuracy of 89.5%, while the EEG provided 86.8%, both for the logistic regression model, which further showcases the compatibility between the LSDL and the logistic regression, as has been noted in a previous study [47]. However, it needs to be noted that although the ECG provided the best overall accuracy, the mean of all classifiers across the board for the EEG was seen to be greater than the ECG. Nevertheless, the combination of the LSDL for preprocessing and the logistic regression model showed that the ECG produces an exceedingly better performance when compared with that of the EEG. The results for the DWS can be seen in Table 10.
The DWS provided better prediction accuracy for the EMG signal, as a best performance metric of 67.8% was recorded. Strong metrics were seen to be provided for both the ECG and EEG modalities at 90.8% and 98.0%, respectively, albeit for different models, i.e., CSVM and KNN. This provides strong evidence for the decomposition power of the DWS when encountering physiological signals of this kind.
Table 11 shows a summary of the best prediction accuracies across the various preprocessing approaches and classification models. It can be seen that for this patient and under the observed conditions, the DWS was the optimal preprocessing mechanism, where the ECG was within 10% of the EEG, which further shows its appeal and potential as a low-cost anesthesia monitor, and also that the optimal classification model varied based on the modality being used for the patient.

3.2. Case Study 2

Classification Problem: BIS over 20 and under 20

The results for the raw signal can be seen in Table 12, from which it can be noted that the EMG prediction accuracy is a high value of 84.2% for the DT and is believed to be largely due to the simplified classification problem relative to the prior patient’s case study. Both the ECG and the EEG produced accuracies of 81.6% for the QSVM and DT, respectively, while the case of the modality feature fusion produced an accuracy of 84.2% for the DT model.
The results for the LSDL (Table 13) showed comparable metrics between the ECG and EEG, with the ECG delivering a slightly higher accuracy at 81.6% when compared with the EEG at 78.9%. The optimal classification model for both the ECG and EEG was the LR model.
The results for the DWS can be seen in Table 14, showing a strong set of prediction metrics across all three modalities and once again indicating the compatibility of the DWS with these kinds of physiological datasets. The kernel-based SVM was the best-performing classification model, with prediction accuracies of 89.8%, 99.7%, and 98.5% for the EMG, ECG, and EEG modalities, respectively.
Table 15 shows a summary of the best prediction accuracies across the various preprocessing approaches and classification models. It can be seen that the DWS was the best-performing preprocessing algorithm, which produced a 90%+ classification accuracy performance across all three measurement modalities, alongside the kernel-based SVM model. Once again, it can be noted that the ECG performance is within range of the EEG, thus showing its potential for being utilized to greater effect in this area, while the EMG also showed a high prediction performance for the DWS at this level of a deep sedation state.

3.3. Case Study 3

Classification Problem: BIS over 40 and under 40

The results for this case study show a generally low value for the EMG prediction accuracy, with the best performance being 55.3% for the case of the LSVM, as seen in Table 16. The ECG and the EEG both provided a prediction accuracy of 71.1% for the DT and CSVM, respectively, while the fusion of all modalities provided an accuracy of 76.3%.
Table 17 shows that the LSDL provided a strong set of classification metrics for both the ECG and EEG for this case study, with the best prediction accuracy being in the region of 94.7% for the ECG, while the EEG produced a perfect 100% prediction accuracy across a select number of classifiers. Due to the number of uncertainties about the dataset itself, it remains unclear if the high metrics attained in this case were due to the patient demographic, the kind of surgery, or anesthetic constituents.
The DWS provided a best prediction accuracy of 64% for the EMG modality alongside the DT model (see Table 18), while the ECG and EEG provided accuracies of 89.1% and 98.5%, respectively, showcasing high prediction accuracies for both modalities alongside the LSDL.
Table 19 shows the best performances across all three modalities for Case Study 3, where it can be seen that the LSDL alongside the LR models prevail as the optimal means of signal preprocessing and classification. In the case of the EMG, the best performance was seen to be a combination of the DWS and DT.
As mentioned, despite the classification problem for this case study being similar to Case Study 1, the optimal processing and classification models appear to be different, as do the accuracies. It remains unclear as to whether this is due to the anesthetic agents used or the kind of surgery conducted, as it is difficult to comment in this direction due to the uncertainties in the description of the dataset itself.
The results in the various case studies provide a degree of numerical and statistical evidence that the ECG signals—when preprocessed and modeled with an effective processing mechanism—can indeed provide a comparable accuracy to that of the traditionally favored EEG monitor. However, it needs to be said that this inference has been made from a relatively concise sample set, which contains uncertainties and sparse information around the nature of the anesthetic dosing, the surgery, and the patients themselves. Nevertheless, using a data-driven lens to interpret the results, the accuracy figures for the ECG and EEG are proven to be comparable. While the EMG occasionally provides strong figures, it is simply not sufficient to work as a primary DoA monitor given the requirements for things in this setting.
Further insights from the results obtained suggest that, given the confines of the investigations carried out, there does not appear to be a “one size fit all” in terms of the best decomposition method and best model to utilize for the processing of the physiological signals; thus, the proposition made involves a 2-phase, multi-stage process for an affordable DoA monitoring platform. This is envisaged to have a strong level of appeal to developing countries and under-resourced areas.
The modeling flow can be seen in Figure 7, for which, given a patient-specific anesthetic DoA monitor, the first stage involves physiological data acquisition using an affordable modality in the form of an ECG monitor, followed by the application of various signal decomposition methods as part of the selection process, feature extraction, and then an iterative sorting using the candidate machine learning models. Once this is complete, the candidate modeling methods can then be deployed to perform a real-time DoA estimation using the candidate decomposition and machine learning models, as indicated in the second phase of the flow diagram.

4. Conclusions and Future Work

Sufficient depth of anesthesia is crucial for most surgical processes, where it is of paramount importance to be able to infer and estimate the degree of patient sedation. This is typically done with the use of physiological monitors, the EEG in particular, which monitors brainwaves from the frontal cortex as part of the BIS model. Despite its wide array of uses, EEG has limitations with regards to the ability to closely monitor sedation extent, for example, when certain drugs are used, in addition to being expensive and complex to decode. This has raised the appeal and potential of other modalities, such as EMG and ECG, as surrogate and affordable means towards DoA monitoring. Thus, this work utilized a physiological database from the University of Queensland [27], alongside an array of signal decomposition algorithms and machine learning models, to observe the extent to which other modalities compare to the likes of EEG when tasked with DoA monitoring. Around 200 simulations were run across three different patient datasets, alongside a comparative use of various signal decomposition methods and machine learning models.
Different classification challenges were constructed for each of the three patients, while the various associated physiological datasets were postprocessed. It was seen across the three patients that a singular “one size fits all” algorithmic signal processing pipeline does not exist for the postprocessing of the signals due to the varied physiological make-up of each patient, the anesthetic agent, and surgery procedures. Nevertheless, it was seen on multiple occasions that the ECG data was on par and within range of the EEG, providing further evidence as to how this modality could serve as a potential physiological monitor for DoA purposes in resource-constrained settings. The modeling and postprocessing pipeline for the dataset itself includes a selection stage due to the lack of a single optimal method across all patients.
Due to the dataset itself, there continue to be a number of areas of uncertainty; despite the results presented, there is no detailed information on the patient demographics, the surgical process itself, or the anesthetic agents used. All of this, plus a concise sample set, leads the authors to believe that more work would need to be done to validate the findings from the paper, with emphasis on the areas highlighted.

Author Contributions

All authors contributed equally to the article. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data is available from a cited repository within the manuscript.

Acknowledgments

The authors would like to thank Brian Kerr for proofreading the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alwardt, C.M.; Redford, D.; Larson, D.F. General Anesthesia in Cardiac Surgery: A Review of Drugs and Practices. J. Extra Corpor. Technol. 2005, 37, 227–235. [Google Scholar]
  2. Lan, J.-Y.; Abbod, M.F.; Yeh, R.-G.; Fan, S.-Z.; Shieh, J.-S. Review: Intelligent Modeling and Control in Anesthesia. J. Med. Biol. Eng. 2012, 32, 293–307. [Google Scholar]
  3. Gruenewald, M.; Ilies, C.; Herz, J.; Schoenherr, T.; Fudickar, A.; Höcker, J.; Bein, B. Influence of Nociceptive Stimulation on Analgesia Nociception Index (ANI) during Propofol-Remifentanil Anaesthesia. Br. J. Anaesth. 2013, 110, 1024–1030. [Google Scholar] [CrossRef]
  4. Roy Chowdhury, M.; Madanu, R.; Abbod, M.F.; Fan, S.-Z.; Shieh, J.-S. Deep Learning via ECG and PPG Signals for Prediction of Depth of Anesthesia. Biomed. Signal Process. Control 2021, 68, 102663. [Google Scholar] [CrossRef]
  5. Kissin, I. Depth of Anesthesia and Bispectral Index Monitoring. Anesth. Analg. 2000, 90, 1114–1117. [Google Scholar] [CrossRef] [PubMed]
  6. Medical Advisory Secretariat. Bispectral Index Monitor: An Evidence-Based Analysis. Ont. Health Technol. Assess. Ser. 2004, 4, 1–70. [Google Scholar]
  7. Liu, Q.; Chen, Y.-F.; Fan, S.-Z.; Abbod, M.F.; Shieh, J.-S. Quasi-Periodicities Detection Using Phase-Rectified Signal Averaging in EEG Signals as a Depth of Anesthesia Monitor. IEEE Trans. Neural Syst. Rehabil. Eng. 2017, 25, 1773–1784. [Google Scholar] [CrossRef]
  8. Liu, Q.; Ma, L.; Chiu, R.-C.; Fan, S.-Z.; Abbod, M.F.; Shieh, J.-S. HRV-Derived Data Similarity and Distribution Index Based on Ensemble Neural Network for Measuring Depth of Anaesthesia. PeerJ 2017, 5, e4067. [Google Scholar] [CrossRef]
  9. Nsugbe, E.; Connelly, S. Multiscale Depth of Anaesthesia Prediction for Surgery Using Frontal Cortex Electroencephalography. Healthc. Technol. Lett. 2022, 9, 43–53. [Google Scholar] [CrossRef] [PubMed]
  10. Nsugbe, E.; Connelly, S. A Pilot on Intelligence Fusion for Anesthesia Depth Prediction during Surgery Using Frontal Cortex Neural Oscillations. Biomed. Eng. Adv. 2022, 4, 100051. [Google Scholar] [CrossRef]
  11. Ponde, V. Recent Trends in Paediatric Regional Anaesthesia. Indian J. Anaesth. 2019, 63, 746–753. [Google Scholar] [CrossRef]
  12. Tyers, M.R.; Russell, W.J.; Runciman, W.B. Electrocardiographic Monitoring in Anaesthesia. Anaesth. Intensive Care 1988, 16, 66–69. [Google Scholar] [CrossRef]
  13. Iohom, G. Basic Patient Monitoring during Anesthesia—UpToDate. Available online: https://www.uptodate.com/contents/basic-patient-monitoring-during-anesthesia (accessed on 10 June 2023).
  14. Zhou, Z.-H. Machine Learning; Springer Nature: Berlin, Germany, 2021; ISBN 9789811519673. [Google Scholar]
  15. Adarsh, S.L.; Venugopal Syam, K.; Philip, L.M.; Martin John, K.D.; Dileepkumar, K.M.; Krathiayini, K.; Ajithkumar, S.; Devanand, C.B. Electrocardiographic Evaluation of Balanced General Anaesthesia in Adult Domestic Cats (Felis Catus). Indian J. Canine Pract. 2022, 14, 22–24. [Google Scholar] [CrossRef]
  16. Jo, Y.-Y.; Jang, J.-H.; Kwon, J.; Lee, H.-C.; Jung, C.-W.; Byun, S.; Jeong, H.-G. Predicting Intraoperative Hypotension Using Deep Learning with Waveforms of Arterial Blood Pressure, Electroencephalogram, and Electrocardiogram: Retrospective Study. PLoS ONE 2022, 17, e0272055. [Google Scholar] [CrossRef] [PubMed]
  17. Obert, D.P.; Schweizer, C.; Zinn, S.; Kratzer, S.; Hight, D.; Sleigh, J.; Schneider, G.; García, P.S.; Kreuzer, M. The Influence of Age on EEG-Based Anaesthesia Indices. J. Clin. Anesth. 2021, 73, 110325. [Google Scholar] [CrossRef]
  18. Zhan, J.; Wu, Z.; Duan, Z.; Yang, G.; Du, Z.; Bao, X.; Li, H. Heart Rate Variability-Derived Features Based on Deep Neural Network for Distinguishing Different Anaesthesia States. BMC Anesthesiol. 2021, 21, 66. [Google Scholar] [CrossRef] [PubMed]
  19. Orfanidis, S.J. Introduction to Signal Processing; Prentice Hall Signal Processing Series; Prentice Hall: Englewood Cliffs, NJ, USA, 1996; ISBN 978-0-13-209172-5. [Google Scholar]
  20. Mallat, S.G. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 674–693. [Google Scholar] [CrossRef]
  21. Stallone, A.; Cicone, A.; Materassi, M. New Insights and Best Practices for the Successful Use of Empirical Mode Decomposition, Iterative Filtering and Derived Algorithms. Sci. Rep. 2020, 10, 15161. [Google Scholar] [CrossRef] [PubMed]
  22. Soltani, S. On the Use of the Wavelet Decomposition for Time Series Prediction. Neurocomputing 2002, 48, 267–277. [Google Scholar] [CrossRef]
  23. Kavakiotis, I.; Tsave, O.; Salifoglou, A.; Maglaveras, N.; Vlahavas, I.; Chouvarda, I. Machine Learning and Data Mining Methods in Diabetes Research. Comput. Struct. Biotechnol. J. 2017, 15, 104–116. [Google Scholar] [CrossRef]
  24. Nsugbe, E. Enhanced Recognition of Adolescents with Schizophrenia and a Computational Contrast of Their Neuroanatomy with Healthy Patients Using Brainwave Signals. Appl. AI Lett. 2023, 4, e79. [Google Scholar] [CrossRef]
  25. Nsugbe, E.; Ser, H.-L.; Ong, H.-F.; Ming, L.C.; Goh, K.-W.; Goh, B.-H.; Lee, W.-L. On an Affordable Approach towards the Diagnosis and Care for Prostate Cancer Patients Using Urine, FTIR and Prediction Machines. Diagnostics 2022, 12, 2099. [Google Scholar] [CrossRef] [PubMed]
  26. Nsugbe, E. Towards the Use of Cybernetics for an Enhanced Cervical Cancer Care Strategy. Intell. Med. 2022, 2, 117–126. [Google Scholar] [CrossRef]
  27. Liu, D.; Görges, M.; Jenkins, S.A. University of Queensland Vital Signs Dataset: Development of an Accessible Repository of Anesthesia Patient Monitoring Data for Research. Anesth. Analg. 2012, 114, 584–589. [Google Scholar] [CrossRef]
  28. Rodriguez-Falces, J.; Navallas, J.; Malanda, A. Computational Intelligence in Electromyography Analysis—A Perspective on Current Applications and Future Challenges. In Computational Intelligence in Electromyography Analysis—A Perspective on Current Applications and Future Challenges; Naik, G.R., Ed.; IntechOpen: London, UK, 2012; ISBN 978-953-51-0805-4. [Google Scholar]
  29. Cram, J.R.; Kasman, G.S.; Holtz, J. Introduction to Surface Electromyography; Aspen Publishers: Gaithersburg, MD, USA, 1998; ISBN 978-0-8342-0751-6. [Google Scholar]
  30. Petersen, E.; Rostalski, P. A Comprehensive Mathematical Model of Motor Unit Pool Organization, Surface Electromyography, and Force Generation. Front. Physiol. 2019, 10, 176. [Google Scholar] [CrossRef]
  31. Nsugbe, E.; Samuel, O.W.; Asogbon, M.G.; Li, G. Phantom Motion Intent Decoding for Transhumeral Prosthesis Control with Fused Neuromuscular and Brain Wave Signals. IET Cyber-Syst. Robot. 2021, 3, 77–88. [Google Scholar] [CrossRef]
  32. Darbas, M.; Lohrengel, S. Review on Mathematical Modelling of Electroencephalography (EEG). Jahresber. Dtsch. Math. Ver. 2019, 121, 3–39. [Google Scholar] [CrossRef]
  33. Doschoris, M.; Kariotou, F. Mathematical Foundation of Electroencephalography. In Electroencephalography; Sittiprapaporn, P., Ed.; IntechOpen: London, UK, 2017; ISBN 978-953-51-3638-5. [Google Scholar]
  34. Boulakia, M.; Cazeau, S.; Fernández, M.A.; Gerbeau, J.-F.; Zemzemi, N. Mathematical Modeling of Electrocardiograms: A Numerical Study. Ann. Biomed. Eng. 2010, 38, 1071–1097. [Google Scholar] [CrossRef]
  35. Pullan, A.J.; Cheng, L.K.; Buist, M.L. Mathematically Modelling the Electrical Activity of the Heart: From Cell to Body Surface and Back Again; World Scientific: Hackensack, NJ, USA, 2005; ISBN 978-981-256-373-6. [Google Scholar]
  36. Sundnes, J.; Lines, G.; Cai, X.; Nielsen, B.F.; Mardal, K.-A.; Tveito, A. Computing the Electrical Activity in the Heart; Springer: Berlin/Heidelberg, Germany, 2006; ISBN 978-3-540-33432-3. [Google Scholar]
  37. Tung, L. A Bi-Domain Model for Describing Ischemic Myocardial d-c Potentials. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 1978. [Google Scholar]
  38. De Luca, C.J.; Adam, A.; Wotiz, R.; Gilmore, L.D.; Nawab, S.H. Decomposition of Surface EMG Signals. J. Neurophysiol. 2006, 96, 1646–1657. [Google Scholar] [CrossRef]
  39. Sarazan, R.D. The QT Interval of the Electrocardiogram. In Encyclopedia of Toxicology, 3rd ed.; Wexler, P., Ed.; Academic Press: Oxford, UK, 2014; pp. 10–15. ISBN 978-0-12-386455-0. [Google Scholar]
  40. Eeg and Brainwaves. Bright Brain—London’s Eeg, Neurofeedback and Brain Stimulation Centre. Available online: https://www.brightbraincentre.co.uk/electroencephalogram-eeg-brainwaves/ (accessed on 29 August 2023).
  41. Durak, L.; Arikan, O. Short-Time Fourier Transform: Two Fundamental Properties and an Optimal Implementation. IEEE Trans. Signal Process. 2003, 51, 1231–1242. [Google Scholar] [CrossRef]
  42. Klimesch, W. The Frequency Architecture of Brain and Brain Body Oscillations: An Analysis. Eur. J. Neurosci. 2018, 48, 2431–2453. [Google Scholar] [CrossRef] [PubMed]
  43. Nsugbe, E.; Starr, A.; Ruiz-Carcel, C. Monitoring the Particle Size Distribution of a Powder Mixing Process with Acoustic Emissions: A Review. Eng. Technol. Ref. 2016, 1–12. [Google Scholar] [CrossRef]
  44. Nsugbe, E. Particle Size Distribution Estimation of a Powder Agglomeration Process Using Acoustic Emissions. Ph.D. Thesis, Cranfield University, Cranfield, UK, 2017. [Google Scholar]
  45. Nsugbe, E.; Starr, A.; Jennions, I.; Ruiz-Carcel, C. Estimation of Online Particle Size Distribution of a Particle Mixture in Free Fall with Acoustic Emission. Part. Sci. Technol. 2019, 37, 953–963. [Google Scholar] [CrossRef]
  46. Nsugbe, E.; Williams Samuel, O.; Asogbon, M.G.; Li, G. Contrast of Multi-Resolution Analysis Approach to Transhumeral Phantom Motion Decoding. CAAI Trans. Intell. Technol. 2021, 6, 360–375. [Google Scholar] [CrossRef]
  47. Nsugbe, E.; Sanusi, I. Towards an Affordable Magnetomyography Instrumentation and Low Model Complexity Approach for Labour Imminency Prediction Using a Novel Multiresolution Analysis. Appl. AI Lett. 2021, 2, e34. [Google Scholar] [CrossRef]
  48. Nsugbe, E. On the Use of Spectroscopy, Prediction Machines and Cybernetics for an Affordable and Proactive Care Approach for Endometrial Cancer. Biomed. Eng. Adv. 2022, 4, 100057. [Google Scholar] [CrossRef]
  49. Gower, J.C. Properties of Euclidean and Non-Euclidean Distance Matrices. Linear Algebra Its Appl. 1985, 67, 81–97. [Google Scholar] [CrossRef]
  50. Wavelet Scattering. Available online: https://uk.mathworks.com/help/wavelet/ug/wavelet-scattering.html (accessed on 29 August 2023).
  51. Mallat, S. Group Invariant Scattering. Commun. Pure Appl. Math. 2012, 65, 1331–1398. [Google Scholar] [CrossRef]
  52. Bruna, J.; Mallat, S. Invariant Scattering Convolution Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1872–1886. [Google Scholar] [CrossRef]
  53. Liu, Z.; Yao, G.; Zhang, Q.; Zhang, J.; Zeng, X. Wavelet Scattering Transform for ECG Beat Classification. Comput. Math. Methods Med. 2020, 2020, e3215681. [Google Scholar] [CrossRef] [PubMed]
  54. Nsugbe, E. On the Application of Metaheuristics and Deep Wavelet Scattering Decompositions for the Prediction of Adolescent Psychosis Using EEG Brain Wave Signals. Digit. Technol. Res. Appl. 2022, 1, 9–24. [Google Scholar] [CrossRef]
  55. Charbuty, B.; Abdulazeez, A. Classification Based on Decision Tree Algorithm for Machine Learning. J. Appl. Sci. Technol. Trends 2021, 2, 20–28. [Google Scholar] [CrossRef]
  56. LaValley, M.P. Logistic Regression. Circulation 2008, 117, 2395–2399. [Google Scholar] [CrossRef] [PubMed]
  57. Kramer, O. K-Nearest Neighbors. In Dimensionality Reduction with Unsupervised Nearest Neighbors; Intelligent Systems Reference Library; Springer: Berlin/Heidelberg, Germany, 2013; Volume 51, pp. 13–23. ISBN 978-3-642-38651-0. [Google Scholar]
  58. Noble, W.S. What Is a Support Vector Machine? Nat. Biotechnol. 2006, 24, 1565–1567. [Google Scholar] [CrossRef]
Figure 1. A hierarchical flow diagram of the various analytics done as part of the work presented in this paper for a sample patient.
Figure 1. A hierarchical flow diagram of the various analytics done as part of the work presented in this paper for a sample patient.
Biomedinformatics 03 00049 g001
Figure 2. Summary of the various physiological organs, their basic operating principles, and physiological sensing modality, alongside the associated pros and cons, and a sample time series segment from a patient under anesthesia [38,39,40].
Figure 2. Summary of the various physiological organs, their basic operating principles, and physiological sensing modality, alongside the associated pros and cons, and a sample time series segment from a patient under anesthesia [38,39,40].
Biomedinformatics 03 00049 g002
Figure 3. Waveforms for a candidate patient across the various BIS values, i.e., over and under. Left side: the various time series projections of the physiological signals. Right side: the time-frequency view of the signals via the short-time Fourier transform (STFT) spectrogram. Top two rows: ECG signals. Middle two rows: EMG signals. Bottom two rows: EEG signals.
Figure 3. Waveforms for a candidate patient across the various BIS values, i.e., over and under. Left side: the various time series projections of the physiological signals. Right side: the time-frequency view of the signals via the short-time Fourier transform (STFT) spectrogram. Top two rows: ECG signals. Middle two rows: EMG signals. Bottom two rows: EEG signals.
Biomedinformatics 03 00049 g003
Figure 4. The various stages involved in the LSDL’s optimal region selection phase.
Figure 4. The various stages involved in the LSDL’s optimal region selection phase.
Biomedinformatics 03 00049 g004
Figure 5. Stages of the deep wavelet scattering (DWS) method [52].
Figure 5. Stages of the deep wavelet scattering (DWS) method [52].
Biomedinformatics 03 00049 g005
Figure 6. A tree-like visualization of the scattering decomposition network.
Figure 6. A tree-like visualization of the scattering decomposition network.
Biomedinformatics 03 00049 g006
Figure 7. 2-phase process for an affordable DoA monitoring platform.
Figure 7. 2-phase process for an affordable DoA monitoring platform.
Biomedinformatics 03 00049 g007
Table 1. Summary of classification case studies.
Table 1. Summary of classification case studies.
Case StudySignal WindowingClassification ExerciseTotal Time Utilized (Minutes)
Case 19000 samples × 19
(for all modalities)
BIS Over 40 and Under 4028.5
Case 26000 samples × 19
(for all modalities)
BIS Over 20 and Under 2019
Case 39000 samples × 19
(for all modalities)
BIS Over 40 and Under 4028.5
Table 8. Results for modeling with the raw signal for BIS over 40 and under 40.
Table 8. Results for modeling with the raw signal for BIS over 40 and under 40.
Machine Learning ModelRaw EMG
Accuracy (%)
Raw ECG
Accuracy (%)
Raw EEG
Accuracy (%)
Raw EMG-ECG-EEG Accuracy (%)
Decision Tree44.752.665.831.6
Logistic Regression42.147.444.734.2
K-Nearest Neighbor47.452.647.447.4
Linear Support Vector Machine34.236.876.357.9
Quadratic Support Vector Machine52.652.650.057.9
Cubic Support Vector Machine44.734.265.860.5
Fine Gaussian Support Vector Machine42.150.057.950.0
Table 9. Results for modeling with the LSDL for BIS over 40 and under 40.
Table 9. Results for modeling with the LSDL for BIS over 40 and under 40.
Machine Learning ModelLSDL ECG Accuracy (%)LSDL EEG Accuracy (%)
Decision Tree55.360.5
Logistic Regression89.586.8
K-Nearest Neighbor47.476.3
Linear Support Vector Machine44.773.7
Quadratic Support Vector Machine44.778.9
Cubic Support Vector Machine47.476.3
Fine Gaussian Support Vector Machine26.365.8
Table 10. Results for modeling with the DWS for BIS over 40 and under 40.
Table 10. Results for modeling with the DWS for BIS over 40 and under 40.
Machine Learning ModelDWS EMG
Accuracy (%)
DWS ECG
Accuracy (%)
DWS EEG
Accuracy (%)
Decision Tree67.874.966.9
Logistic Regression61.276.270.5
K-Nearest Neighbor66.388.698.0
Linear Support Vector Machine58.974.269.1
Quadratic Support Vector Machine53.388.587.4
Cubic Support Vector
Machine
51.890.894.7
Fine Gaussian Support Vector Machine59.584.285.8
Table 11. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 1.
Table 11. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 1.
Best ModalityBest Prediction Accuracy (%)
EMGDWS DT: 67.8
ECGDWS CSVM: 90.8
EEGDWS KNN: 98.0
Table 12. Results for modeling with the raw signal for BIS over 20 and under 20.
Table 12. Results for modeling with the raw signal for BIS over 20 and under 20.
Machine Learning ModelRaw EMG
Accuracy (%)
Raw ECG
Accuracy (%)
Raw EEG
Accuracy (%)
Raw EMG-ECG-EEG Accuracy (%)
Decision Tree84.271.181.684.2
Logistic Regression50.071.173.763.2
K-Nearest Neighbor52.647.447.447.4
Linear Support Vector Machine50.076.373.778.9
Quadratic Support Vector Machine50.081.678.978.9
Cubic Support Vector Machine50.078.981.678.9
Fine Gaussian Support Vector Machine50.073.771.147.4
Table 13. Results for modeling with the LSDL for BIS over 20 and under 20.
Table 13. Results for modeling with the LSDL for BIS over 20 and under 20.
Machine Learning ModelLSDL ECG Accuracy (%)LSDL EEG Accuracy (%)
Decision Tree44.752.6
Logistic Regression81.678.9
K-Nearest Neighbor65.839.5
Linear Support Vector Machine47.452.6
Quadratic Support Vector Machine63.255.3
Cubic Support Vector Machine65.865.8
Fine Gaussian Support Vector Machine50.044.7
Table 14. Results for modeling with the DWS for BIS over 20 and under 20.
Table 14. Results for modeling with the DWS for BIS over 20 and under 20.
Machine Learning ModelDWS EMG
Accuracy (%)
DWS ECG
Accuracy (%)
DWS EEG
Accuracy (%)
Decision Tree85.796.590.6
Logistic Regression73.798.893.0
K-Nearest Neighbor87.498.898.0
Linear Support Vector Machine89.895.993.6
Quadratic Support Vector
Machine
88.699.497.7
Cubic Support Vector Machine88.699.798.5
Fine Gaussian Support Vector Machine86.895.984.8
Table 15. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 2.
Table 15. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 2.
Best ModalityBest Prediction Accuracy (%)
Best EMGDWS LSVM: 89.8
Best ECGDWS CSVM: 99.7
Best EEGDWS CSVM: 98.5
Table 16. Results for modeling with the raw signal for BIS over 40 and under 40.
Table 16. Results for modeling with the raw signal for BIS over 40 and under 40.
Machine Learning ModelRaw EMG
Accuracy (%)
Raw ECG
Accuracy (%)
Raw EEG
Accuracy (%)
Raw EMG-ECG-EEG Accuracy (%)
Decision Tree42.171.160.552.6
Logistic Regression42.152.660.555.3
K-Nearest Neighbor42.160.563.276.3
Linear Support Vector Machine55.342.157.950.0
Quadratic Support Vector Machine47.447.468.471.1
Cubic Support Vector
Machine
47.450.071.176.3
Fine Gaussian Support Vector Machine52.652.665.863.2
Table 17. Results for modeling with the LSDL for BIS over 40 and under 40.
Table 17. Results for modeling with the LSDL for BIS over 40 and under 40.
Machine Learning ModelLSDL ECG Accuracy (%)LSDL EEG Accuracy (%)
Decision Tree94.797.4
Logistic Regression94.7100
K-Nearest Neighbor89.5100
Linear Support Vector Machine94.7100
Quadratic Support Vector Machine94.7100
Cubic Support Vector Machine92.1100
Fine Gaussian Support Vector Machine84.292.1
Table 18. Results for modeling with the DWS for BIS over 40 and under 40.
Table 18. Results for modeling with the DWS for BIS over 40 and under 40.
Machine Learning ModelDWS EMG
Accuracy (%)
DWS ECG
Accuracy (%)
DWS EEG
Accuracy (%)
Decision Tree64.072.368.0
Logistic Regression57.175.771.7
K-Nearest Neighbor63.489.198.5
Linear Support Vector Machine56.876.569.8
Quadratic Support Vector
Machine
51.287.987.6
Cubic Support Vector Machine50.450.295.1
Fine Gaussian Support Vector Machine59.977.989.5
Table 19. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 3.
Table 19. Summary of the best prediction accuracies across the various preprocessing approaches and classification models for Case Study 3.
Best ModalityBest Prediction Accuracy (%)
Best EMGDWS DT: 64.0
Best ECGLSDL Logistic Regression: 94.7
Best EEGLSDL Logistic Regression: 100
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nsugbe, E.; Connelly, S.; Mutanga, I. Towards an Affordable Means of Surgical Depth of Anesthesia Monitoring: An EMG-ECG-EEG Case Study. BioMedInformatics 2023, 3, 769-790. https://doi.org/10.3390/biomedinformatics3030049

AMA Style

Nsugbe E, Connelly S, Mutanga I. Towards an Affordable Means of Surgical Depth of Anesthesia Monitoring: An EMG-ECG-EEG Case Study. BioMedInformatics. 2023; 3(3):769-790. https://doi.org/10.3390/biomedinformatics3030049

Chicago/Turabian Style

Nsugbe, Ejay, Stephanie Connelly, and Ian Mutanga. 2023. "Towards an Affordable Means of Surgical Depth of Anesthesia Monitoring: An EMG-ECG-EEG Case Study" BioMedInformatics 3, no. 3: 769-790. https://doi.org/10.3390/biomedinformatics3030049

Article Metrics

Back to TopTop