1. Introduction
Cochlear implant (CI) surgery is becoming a more viable surgical option for children and adults with substantial sensorineural hearing loss and who have not benefited from hearing aids [
1]. The CI system is composed of several parts (
Figure 1), such as the external audio processor, microphone, external coil, internal implant, radio frequency (RF) internal coil, and the electrode array that is supposed to be inserted into the scala tympani of the cochlea. The magnetic attraction between the implant magnet and the audio processor coil magnet is used to secure the audio processor coil in place on top of the implant coil. The RF receiver coil also provides the implant with electric power that is emitted from the audio processor coil [
2].
The function of the CI microphone is similar to that of the human pinna—it detects and captures the audible sound signals from the area around the patient. The audio processor subsequently uses signal processing algorithms to convert the sound signal into detailed digital signals and transmit them to the internal part (implantable electronics) via the inductive link between the external and internal coil. These digital signals are then converted into electric stimuli/impulses by the electronic system of the internal implant. Thereafter, the electrode array delivers these stimuli to the scala tympani of the cochlea and then to the auditory nerve [
3]. The auditory/cochlear nerve should be present in CI candidates as it helps to restore their hearing abilities properly [
4]. This nerve acts as a conductor, carrying electrical stimulation generated and decoded by the CI system to the brainstem and then on to the auditory cortex.
CI surgery is typically indicated for pre-lingual and post-lingual deafness. Recently, it has also been approved for patients with single-sided deafness (SSD). Patients who received implants early, before the age of two years, showed better audiological and speech performance [
5]. On the other hand, pre-lingually deafened patients of an older age may not show satisfactory outcomes, and they might sometimes be non-users of implants. The explanation behind this is that the brain may utilize the region for auditory sensation at the level of the auditory cortex for other somatosensory inputs after a lengthy period of deafness [
6].
The cochlear implant’s (CI) function and the proper placement of the electrode array can be evaluated using different methods, one of which is electrical impedance (EI) measurement [
1]. Cochlear implant function is largely based on the device’s ability to reliably send electrical signals to the auditory nerve fibers [
7]. This electrical signal can spread easily and widely since the perilymph and endolymph (i.e., intracochlear fluids) are essential electrolytes [
2,
8]. Therefore, electrical impedance (EI) can be defined as the measure of the reluctance that is expressed by the perilymph to the current flow when voltage is applied [
1]. A normal measured impedance range is indicative of a good current flow in the fluid and tissue of the cochlea. Any big variation from the normal range can reflect an abnormal current flow [
9].
The value of the impedance is mainly affected by the relationship between the electrode–tissue interface and the surrounding tissues around the electrode. After CI surgery, fibrotic tissue formation is induced by the foreign-body reaction to the electrode [
2,
10]. The amount of fibrotic tissue surrounding the electrode array has been found to be significantly correlated with EI value. EI reaches its highest value after 4 weeks due to an increase in fibrous tissue formation; then, it stabilizes after 8 weeks. EI declines after the activation of the device due to the disrupted adhesion of the fibrous tissue to the electrode [
11].
In clinical settings, testing the EI has the following main uses: for the evaluation of the electrode’s overall function, measurement of the impedance level, detection of problems such as short- and/or open-circuits, guidance for audio processor fitting, and determination of the power consumption level [
12]. Clinicians’ familiarity with EI variation trends at different time points after device activation is crucial. This familiarity could also help non-experienced clinicians with any problems they might have with their patients, in case irregular EI values are found. However, there is still a lack of objective tools for predicting the EI at different time points after CI surgery. Therefore, the primary aim of this work is to develop a machine learning model that can utilize specific characteristics of each CI patient to predict the EI at different time points after the surgery and up to twelve months post-activation.
3. Material and Methods
3.1. Subjects
This is a retrospective study that was conducted at a tertiary CI center after obtaining ethical approval from the institutional review board. Our inclusion criteria were all pre-lingual children with severe hearing loss who received the same type of CI device (MED-EL, Innsbruck, Austria) at our tertiary center between 2016 and 2018, who had a complete and smooth intraoperative insertion that was assured by post-op X-ray, and who recorded normal EI during the surgery and throughout the follow-up after the device activation. All enrolled patients were full-time CI users, with an average daily use of at least 8.5 h/day and had used their devices for at least 2 years. Patients were excluded if they had cochlear ossifications, inner ear anomalies, or deficient language skills.
3.2. Machine Learning Model
The dataset used in the current study includes 80 patients. For each patient, we compiled the data for age at implantation and the electrode impedance at each electrode contact (from Channel 1 to Channel 12) during the surgery.
Dataset preparation required some formatting standardization processes. These processes were automated using programming scripts written in the Ring programming language [
21]. These scripts were generated using the Programming Without Coding Technology (PWCT) software, which is a free/open-source visual programming language for the development of applications and systems [
22]. We selected Ring because of its powerful GUI tools in addition to its capabilities, which are comparable to those of Python and Ruby [
23].
This study used various algorithms for the regression analyses. We selected some of the popular machine learning algorithms in the literature [
24,
25,
26], such as:
Neural Networks Regression (NNR);
Linear Regression (LR);
Modern decision tree algorithms;
- 3.1
Decision Forest Regression (DFR);
- 3.2
Boosted Decision Tree Regression (BDTR);
Bayesian Linear Regression (BLR).
Picking one algorithm could lead to limited results. Additionally, we cannot try every possible algorithm to save computational cost and time. So, we decided to pick five different algorithms. Algorithm selection could be based on different factors like the size and the structure of the data, Training time, Accuracy, etc. The survey paper in [
18] indicates that the most popular machine learning algorithm used with cochlear implantation datasets in the literature review is the Neural Networks (47.5%). In this survey we notice that the other algorithms do not have huge popularity like the neural networks and the difference between most of them with respect to the usage percentage is not huge. So based on the literature we decided to select (Neural Networks) as our first choice in our list of algorithms that we will use with our dataset. Since we have a regression problem, we decided to select one of the popular regression algorithms in general like linear regression.
After looking at our first two choices (Neural Networks, Linear Regression), We noticed that the function representation of these models is classified as Numerical functions. To have a variety of algorithms, we decided to extend our list with algorithms that use symbolic functions too. and we picked decision trees which is one of the popular algorithms that are useful when working with small dataset as we have. We picked modern decision trees algorithms that uses Ensemble learning like Decision Forest Regression and Boosted Decision Tree Regression (BDTR). Finally, we added the Bayesian Linear Regression algorithm to our list of algorithms as an algorithm that uses (Probabilistic Graphical Models).
We created ML models using Microsoft Azure Machine Learning. We selected this tool to reduce the development time and to facilitate the performance of huge experiments with different parameters that make use of visual tools [
27].
Since the dataset is small, and the number of features is limited (from 1 to 13), we used the default settings for each algorithm as provided by Microsoft Azure ML. For neural-network regression the model hyperparameters are (Three layers, Number of hidden nodes = 100, learning rate = 0.005, learning iterations = 100 and the type of normalizer is Min-Max normalizer). For decision forest regression the model hyperparameters are (Number of decision trees = 8, the maximum depth of the decision tree = 32, the number of random splits per node = 128 and the resampling method is bagging). For the boosted decision tree regression, the model hyperparameters are (maximum number of leaves per tree = 20, minimum number of samples per leaf node = 10, the learning rate = 0.2 and the total number of trees constructed = 100). For linear regression the solution method is ordinary least squares and the L2 regularization weight = 0.001.
Since this is a regression problem, we used the Root Mean Squared Error (RMSE) to compare between the different algorithms with the winning algorithm being the one that provides the minimum RMSE. We published the dataset used in this study and the design of the experiments using Microsoft Azure ML in the GitHub website [
28].
The following steps were used to build and test the proposed model:
Preparation of the dataset using the Ring language scripts (generated by PWCT);
Uploading the dataset to Microsoft Azure Machine Learning;
Splitting the data (training data and test data);
- 3.1
A total of 70% of the data was used for training (56 patients);
- 3.2
A total of 30% of the data was used for testing (24 patients);
- 3.3
Random seed was used for the splitting process;
Selecting columns (using features group 1 or 2);
Selecting the algorithm (from a choice of 5 different algorithms);
Training the model using the training data and the selected algorithm;
Evaluating the model (calculating the root mean squared error—RMSE);
Repeating the experiment and changing the features group and/or the algorithm;
Comparing the results of different groups of features and algorithms.
3.3. Outcome Measures
The ML-predicted electrode impedance values were compared to those that were objectively measured and recorded from the included patients. This comparison was carried out for each electrode from the 12 contacts of the tested electrode array at the following time points: one, three, six-, and twelve months post-activation.
3.4. Data Analyses
GraphPad Prism version 9.3.0 was used for all statistical analyses (GraphPad Software, La Jolla, CA, USA). The mean, standard deviation, and range (minimum and maximum values) were used to describe the characteristics of the participants. The normality of the data was first checked before comparing the preoperative and postoperative data. To test the significance of the group data with a normal distribution, a parametric paired t-test was used, and the Wilcoxon non-parametric test was used for the rest of the data. The statistical significance level was set at p = 0.05. The inter-rater reliability, i.e., the degree of agreement among predicted and recorded EI values, was computed using Cohen’s Kappa.
5. Discussion
A growing body of evidence has demonstrated the utility of machine learning algorithms in predicting and optimizing auditory outcomes following CI. By analyzing large amounts of data, machine learning algorithms can be used to create personalized sound processing strategies for each patient and help in improving the performance of cochlear implants over time. In [
23], Alohali et al. evaluated several machine learning algorithms (
Table 1) to predict EI one month after surgery. The results showed that the performance of machine learning algorithms was based on the electrode channels, with an accuracy ranging from 66 to 100% [
10].
Our experiments indicate that the electrode impedance of CI devices could be predicted at the electrode contacts (from 1 to 12) at different time points using different machine-learning algorithms.
Table 6 indicates that the Bayesian linear regression (BLR) provides the best results, followed by the neural networks (NN), which have been used in most research papers related to cochlear implantation [
18]. The BLR algorithm provides the best results for seven electrodes when predicting the electrode impedance (EI) after one month of implantation. Furthermore, it provides the best results for three electrodes when predicting the EI after three months. The algorithm provides the best results for six electrodes when predicting the EI after six months, while it provides the best results for seven electrodes when predicting the EI after one year. Therefore, on average, the BLR algorithm provided the best results for half of the electrodes. On the other hand, we noticed that the linear regression algorithm did not provide good results as compared to the other algorithms. When predicting the EI after six months or one year, we noticed that each group of features provided the best results for half of the channels, as demonstrated in
Table 7.
Figure 2 provides a deeper look at the feature groups for predicting the EI of each channel. We noticed that some electrodes provide the best results with a specific group of features, while the other channels shift between the groups based on the period of time. For example, electrode number 11 always provides the best prediction results when using group 1 of the features. However, electrode numbers 1, 3, and 7 always provide the best prediction results when using group 2 of the features. Concerning electrode number 10, after one month, the best prediction results were observed when using feature group 2, while after 3, 6, and 12 months, the best results were achieved when using feature group 1.
When looking at the features that were used to predict EI at different time points, we found that age at implantation and intraoperative EI were the best predictors of EI, with variable accuracies according to the electrode number and time point. Intraoperative EI values can play a significant role in predicting array positioning and helping correct the anatomic and functional placement of the implants [
29]. However, the role of intraoperative EI in predicting postoperative function is unclear. Previous reports have shown that abnormal intraoperative EI can significantly predict postoperative electrode function. In [
7], there was a trend of persistent abnormalities in the intraoperative EI, which extended to the immediate postoperative period; however, the trend was not statistically significant. Carlson et al. showed that nearly 58% of postoperative electrode abnormalities were observed during the intraoperative period [
30]. In the present study, we found that intraoperative EI values can predict EI at different postoperative points.
Electrode numbers 4, 5, and 6 provided the best results for EI prediction using features group 2 after 1, 3, and 6 months, while they provided the best prediction for EI after 12 months using features group 1. On the other hand, electrode numbers 2, 8, and 9 provided the best results for EI prediction using features group 2 after 1, 3, and 12 months, while they provided the best prediction for EI after 6 months using features group 1.
The statistical comparison of the ML-model prediction results and the objectively measured values showed non-significant differences for most electrode contacts at different time points. This could confirm the accuracy and sensitivity of the proposed model. Furthermore, the statistically significant differences could be attributed to the low sample size and the limited number of characteristics that were used to predict the post-op EI levels. However, the application of such a model needs more investigations with a larger sample size before generalizing the findings of the current study. This is due to the small sample size of the current work which could be considered as a study limitation in addition to its retrospective nature. Another limitation is the probability of the existence of some other factors that might influence EI despite the clear inclusion/exclusion criteria of the current study. On the other hand, this model could be used as an initial step for having an objective tool for predicting EI at different time points after CI surgery. Such a tool could assure that the clinicians, even the inexperienced person, will be conversant with EI variation trends at various time points following device activation, and help them to discover any unusual EI values with their patients as earlier as it happens.
6. Conclusions
The results of this study prove that machine learning is an efficient and effective tool in the field of cochlear implantation. Using data processing, feature selection, and machine learning models that employ modern and advanced algorithms, we can predict the electrode impedance of different channels to help professionals make early decisions that would improve the hearing quality of cochlear patients [
13,
14,
15]. In terms of predicting the EI following CI, in this article, we presented a machine learning model that predicted the cochlear impedance in 12 different channels using different algorithms such as linear regression, Bayesian linear regression, random forest regression, boosted decision tree regression, and neural networks. We performed several experiments to evaluate the performance of each model and to determine which model provided the best results at one month, three months, six months, and one year after the surgery.
We developed the model using the Microsoft Azure Machine Learning tool after preparing the dataset using scripts written in the Ring programming language. Through feature selection, we discovered that patient’s age could be used alone to provide the best prediction results for half of the channels at six months or one year after the cochlear implant surgery. Our results also showed that the use of a specific prediction algorithm for each channel provided better results. Furthermore, the accuracy level was between 83% and 100% in half of the channels one year after the surgery, when an error range between 0 and 3 KO was set as an acceptable threshold.