Next Article in Journal
Amide Proton Transfer–Chemical Exchange Saturation Transfer Imaging of Intracranial Brain Tumors and Tumor-like Lesions: Our Experience and a Review
Previous Article in Journal
A Collaborative Learning Model for Skin Lesion Segmentation and Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Whale Optimization Algorithm with a Hybrid Relation Vector Machine: A Highly Robust Respiratory Rate Prediction Model Using Photoplethysmography Signals

1
School of Life and Environmental Sciences, Guilin University of Electronic Technology, Guilin 541004, China
2
Guangxi Colleges and Universities Key Laboratory of Biomedical Sensors and Intelligent Instruments, Guilin 541004, China
3
Guangxi Engineering Technology Research Center of Human Physiological Information Noninvasive Detection, Guilin 541004, China
*
Authors to whom correspondence should be addressed.
Diagnostics 2023, 13(5), 913; https://doi.org/10.3390/diagnostics13050913
Submission received: 15 January 2023 / Revised: 18 February 2023 / Accepted: 25 February 2023 / Published: 28 February 2023
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

:
Due to the simplicity and convenience of PPG signal acquisition, the detection of the respiration rate based on the PPG signal is more suitable for dynamic monitoring than the impedance spirometry method, but it is challenging to achieve accurate predictions from low-signal-quality PPG signals, especially in intensive-care patients with weak PPG signals. The goal of this study was to construct a simple model for respiration rate estimation based on PPG signals using a machine-learning approach fusing signal quality metrics to improve the accuracy of estimation despite the low-signal-quality PPG signals. In this study, we propose a method based on the whale optimization algorithm (WOA) with a hybrid relation vector machine (HRVM) to construct a highly robust model considering signal quality factors to estimate RR from PPG signals in real time. To detect the performance of the proposed model, we simultaneously recorded PPG signals and impedance respiratory rates obtained from the BIDMC dataset. The results of the respiration rate prediction model proposed in this study showed that the MAE and RMSE were 0.71 and 0.99 breaths/min, respectively, in the training set, and 1.24 and 1.79 breaths/min, respectively, in the test set. Compared without taking signal quality factors into account, MAE and RMSE are reduced by 1.28 and 1.67 breaths/min, respectively, in the training set, and reduced by 0.62 and 0.65 breaths/min in the test set. Even in the nonnormal breathing range below 12 bpm and above 24 bpm, the MAE reached 2.68 and 4.28 breaths/min, respectively, and the RMSE reached 3.52 and 5.01 breaths/min, respectively. The results show that the model that considers the PPG signal quality and respiratory quality proposed in this study has obvious advantages and application potential in predicting the respiration rate to cope with the problem of low signal quality.

1. Introduction

Impedance spirometry is a clinically important method for measuring the respiratory rates of patients; however, it is not very convenient and comfortable for patients and is especially not suitable for dynamic monitoring [1]. To overcome these restrictions, studies have given more attention to physiological signals, such as electrocardiogram (ECG) and photoplethysmography (PPG) signals. However, PPG signals are more attractive than ECG signals because of their simplicity, portability, and low number of sensors [2,3].
Due to the low cost and portability of PPG signal acquisition, the continuous, non-invasive, and reliable monitoring of patients’ respiratory rate based on PPG signals has attracted many researchers in recent years [4], which will help monitor primary health status and contribute to the diagnosis of cardiorespiratory diseases, including pneumonia [5] and obstructive sleep apnoea (OSA) [6], and physiological situations, such as hypercarbia [7] and pulmonary embolism [8]. However, for critically ill patients in hospital care situations, where human respiration is relatively weaker and venous blood is at a very low pressure, flowing with the frequency of movement [9], the pulsatile alternating current (AC) component is also easily affected by physiological activity and movement, causing the PPG signals to contain more noise than physiological information [10], which would make estimating the respiratory rate based on PPG signals specifically challenging.
In the 1990s, the method of estimating the respiration rate (RR) based on PPG signals was investigated by Nakajima et al. [11], who developed a digital filter technology to estimate the RR from PPG signals. This method proved the feasibility of RR estimation based on PPG, but lacked universality. To solve this problem, Karlen et al. [12] used an intelligent fusion algorithm to fuse three respiration-modulated changes, including pulse amplitude modulation (AM), frequency modulation (FM), and baseline wander (BW), from PPG, which estimated a relatively accurate RR. Similarly, Meredith et al. [13] explained that respiratory components are reflected by the AM, FM, and BW of PPG signals. To reduce the effect of poor respiratory modulation on the accuracy of the final estimated respiration rate, Birrenkott et al. [14] proposed three respiratory quality indices (RQIs), which set an adjustable threshold to fuse the respiratory rate estimated by three respiratory modulations, on the dataset of elective surgery or routine anaesthesia (CapnoBase Dataset), which showed good results with a mean absolute error (MAE) of only 0.71 ± 0.89 bpm. However, for critically ill patients during hospital care (BIDMC dataset), the MAE reached 3.12 ± 4.39 bpm. Selvakumar et al. [3] also showed that the BIDMC dataset was more challenging.
Faint physiological conditions and the more complex pulse wave morphology in critically ill patients lead to the inaccurate extraction of respiratory modulation [15]. To enhance the robustness of the respiration rate estimation algorithm. Ambekar el al. [16] used a data-driven algorithm, ensemble empirical mode decomposition (EEMD), to obtain the RR from PPG signals. Compared to the empirical mode decomposition algorithm (EMD), EEMD overcomes the problem of modal aliasing [17]. Adami et al. [18] and Mohammod et al. [19] compared the performance of EMD and its derivative algorithms, such as EEMD, CEEMD, CEEMDAN, and ICEEMDAN, to decompose the respiratory component from PPG signals and demonstrated that both EEMD and CEEMDAN have better performance but that the EEMD method has a lower computational cost than CEEMDAN. To further isolate the respiratory components from the selected inherent modal functions (IMF), a Kalman filter (KF) was used to exclude other components [20]. Considering the nonlinearity of the PPG signal, Adami et al. [18] introduced the PPG signal quality indicator as a way to adjust the Kalman gain to implement extended Kalman filtering (EKF). However, these methods are dependent on the stability of the previous window, and if the respiratory rate error in the first window is large, it will significantly increase the overall estimation error level. The principal component analysis (PCA) method avoids this problem and does not require parameter adjustment [21]. Mohammod et al. proposed an EEMD-PCA method, demonstrated its advantages for respiration rate estimation based on PPG signals [22], and explained the failure as high-amplitude noise in the RR band range [23]. Machine-learning methods can be used to correct respiration rate estimation errors due to these high-amplitude noises [24], For example, the RRWaveNet [25] and CAGAB [26] methods demonstrate the feasibility of using machine-learning methods to improve the accuracy of PPG-based respiratory rate detection.
In summary, both PPG signal quality and respiratory signal quality influence the accuracy of the PPG-based estimation of respiration rate to further reduce their effect on respiration rate estimation. In this paper, we use machine-learning methods to construct a respiration rate estimation model by fusing PPG signal quality indices and respiratory quality indices (RQIs) to limit the influences of other physiological activities and noise on respiration rate estimation based on PPG, developing a respiration rate estimation model with high accuracy and robustness.
The remainder of this study is summarized as follows: Section 2 introduces the dataset and respiration rate prediction model construction process with model performance evaluation metrics. In Section 3, the performance of the model proposed in this study on the BIDMC dataset is reported, followed by a discussion of the advantages and disadvantages of the model in Section 4. Section 5 concludes the study and provides recommendations for future work.

2. Materials and Methods

Figure 1 shows a clear block diagram of the respiration rate model construction process. The whole process is divided into three stages: (a) pre-extracted respiratory wave and respiratory rate, (b) signal quality index calculation, and (c) respiratory rate prediction model construction.
In the first stage, the EEMD-PCA method was selected to pre-extract the respiration wave and respiration rate. In the second stage, the PPG signal quality indices and RQIs were calculated for further processing. The HRVM method was employed to fuse the respiration rate estimated by the EEMD-PCA method (RRP) and the signal quality indicators that have an impact on the estimation of RRP for RR prediction model construction, and the kernel parameters were optimized by the WOA algorithm to prevent falling into a local optimum. To evaluate the performance of the constructed model, it is tested with the BIDMC dataset.

2.1. Database

The BIDMC dataset was collected from 53 subjects (20 males and 33 females; age range: 19–90 years old) and acquired from critically ill patients during hospital care at the Beth Israel Deaconess Medical Centre (Boston, MA, USA). For each subject, over an 8 min duration, each subject contains physiological signals that are sampled at 125 Hz, such as PPG signals, impedance respiratory signals, and electrocardiogram (ECG) signals; simultaneously, reference physiological parameters such as respiratory rate (RR) and heart rate (HR) are sampled at 1 Hz. Two annotators manually annotated the start and end time points of each single respiration in all recordings via the impedance respiration signal, and the corresponding PPG signal segment with a 1-s difference in the duration of single respiration annotated by the two annotators was removed. Due to a severe loss of the reference respiration rate in the 13th subject, the remaining 52 subjects were retained with the PPG signal split into 8-s nonoverlapping windows with a 32-s length. This process resulted in 2719 (93.4%) windows being retained. The distribution of the impedance respiration rate (RRI) values for all windows is shown in Figure 2, which reveals that the distribution of the reference respiration rate ranges from 3 to 30 bpm, mainly between 16 and 20 bpm, follows a regular distribution, and reflects well the real-world respiration rate distribution. In this study, the dataset was randomly divided into a training set with a validation part (70%) and a test set (30%).

2.2. Pre-Extracted Respiratory Wave and Respiratory Rate by the EEMD-PCA Method

EEMD-PCA is a novel, data-driven method for estimating the respiration rate based on PPG signals that was proposed by Mohammod et al. [22]. In this study, the pre-extracted respiratory waves and RRP will be extracted by this method for use in the next stage. The pre-extraction process is subdivided into four steps: (a) EEMD is applied to PPG signals to separate the respiratory components and other components, (b) intrinsic mode functions (IMFs) dominated by respiratory components are selected for further processing, (c) the selected IMFs are used to reconstruct the respiratory waves and are further denoised with principal component analysis (PCA), and (d) fast Fourier transform (FFT) is applied to the pre-extracted respiratory waves from the previous step to calculate the RRP. Figure 3 shows the time domain (Figure 3a) and frequency domain (Figure 3b) of the impedance respiratory signals and the pre-extracted respiratory waves for the first 32-s window of subject BIDMC 10. They have strong consistency in the waveform period in the time domain, and the main frequency components are similar in the frequency domain.
FFT was applied to the pre-extracted respiratory signals, which are dominated by respiration, and the frequency corresponding to the maximum peak of the spectrogram is expressed as the frequency corresponding to the respiration rate and then converted to the R R P using Formula (1).
R R P = f R R P 60 b r e a t h s / m i n

2.3. Signal Quality Index Calculation

Both PPG signal quality indices [18,20,25] and respiration quality indices (RQIs) [14,26] affect the accuracy of RR estimation based on PPG signals. An optimal PPG signal quality index ( S S Q I ) and three typical RQIs ( Q R 1 , Q R 2 , and Q R 3 ) are calculated in this section to fuse the RRP to reduce the error of RR estimation and enhance the robustness of the algorithm proposed in this paper.

2.3.1. PPG Signal Quality Index (SQI) Calculation

Skewness is a measure of the symmetry of the probability distribution. Mohamed et al. [27] discovered that the skewness value of a 2-s PPG signal significantly varies with the change in the quality of the PPG signals, with an accuracy of 82.86% in determining between high-quality PPG signals and damaged unusable PPG signals, which is calculated by Formula (2).
S S Q I = 1 N i = 1 N x i μ ^ x / σ 3
where x i is the i t h sample point value of PPG, μ ^ x and σ are the empirical estimates of the mean and standard deviation of x i , respectively, and N is the number of samples in the PPG signals.
For each 32-s PPG signal with a two-second nonoverlapping sliding window, a total of 16 skewness values are calculated, and the average of 16 S S Q I represents the overall quality level of the 32-s PPG signal for that segment. The specific process is expressed as follows:
S S Q I ¯ = 1 n w = 1 n S S Q I w
where S S Q I w denotes the S S Q I of the PPG signals for the w t h 2-s window and n is the number of windows. S S Q I ¯ denotes the quality level of the PPG signal for each 32-s window.

2.3.2. Respiratory Quality Index (RQI) Calculation

The autocorrelation RQI, FFT RQI, and autoregression RQI ( Q R 1 , Q R 2 , and Q R 3 ) were proposed by Birrenkott et al. [14], who directly calculated their RQIs on the PPG signal after filter processing with a fixed cut-off frequency and down-sampled to 4 Hz, which still contains much low-frequency motion noise and cardiac components. In contrast, this paper will calculate the three RQIs on the pre-extracted respiration waves down-sampled to 4 Hz, which are dominated by respiratory components that are more reflective of respiratory signal quality.

2.4. WOA-HRVM Model

In the previous stage, the RRP, S S Q I ¯ , and three RQIs were obtained and used as features in this stage, and the RRI corresponding to each window was applied as labels. The WOA-HRVM [28] method was applied to the training set to construct an RR prediction model, and the testing set was employed to evaluate the performance of the model.
As a highly sparse model that provides probabilistic predictions by Bayesian inference, the central idea of related vector machines (RVMs) is to obtain the correlation vectors and weights by maximizing the marginal likelihood [29]. RVMs are often utilized as a machine-learning method for regression prediction, and its kernel function and kernel parameters are adjusted according to the requirements, which are also important parameters affecting the final regression performance. To improve the performance of the regression, a hybrid relation vector machine (HRVM) was employed in this paper. Since convex combinations of finitely many elementary kernel functions can always generate optimal kernels, hybrid kernel learning methods are more efficient than single kernel learning methods [30]. The multiple heterogeneous kernel learning method is defined as
K x i , x = m = 1 M d m K m x i , x
where d m is the weight of the m t h kernel function with d m ≥ 0, and K x i , x denotes the m t h kernel function, Gaussian kernel function, sigmoid kernel function, polynomial kernel function, and Laplacian kernel function as common kernel functions used in this study.
In addition, the initial values of the kernel parameters are highly random; the convergence of the regression model constructed based on HRVM will be greatly affected as a result, and it is easy to fall into the local optimum. Aimed at the limitations of the HRVM algorithm, the whale optimization algorithm (WOA) [31] has the advantages of few adjustment parameters, simple operation, and strong ability for a global search. The optimal parameters and weights of the kernel function are obtained by continuous iteration of the WOA algorithm to prevent local optimality, so the respiration rate model proposed in this study based on the WOA-HRVM algorithm can be represented by Formula (4), it is a hybrid function consisting of Gaussian kernel function, sigmoid kernel function, polynomial kernel function and Laplace kernel function.

2.5. Performance Measurement

The ability of our RR prediction model was assessed using three methods: (i) Bland–Altman plot: the plot visualizes the consistency of the predicted respiration rate by the model proposed in this study (RRM) with the RRI; (ii) mean absolute error (MAE): the accuracy of the model was demonstrated by averaging the absolute value of the difference between RRM and RRI over all windows; and (iii) root-mean-square error (RMSE): RMSE is used to reflect the precision of the model proposed in this study; it is very sensitive to the very large or very small errors of the RRM compared to RRI.

3. Results

The model in this study was constructed and tested based on MATLAB 2020a (MathWorks, Natick, MA, USA). The Bland–Altman plot visualizes the relationship among RRP (RR estimated by the EEMD-PCA method), RRM (RR estimated by the prediction model proposed in this study), and RRI in Figure 4. In the training set, the difference in RRP and RRI was 0.07 bpm, with limits of agreement from −5.138 to 5.278 bpm, and the difference in RRM and RRI was almost 0 bpm, with smaller limits of agreement from −1.930 to 1.930 bpm. In the test set, the difference in RRP and RRI was 0.121 bpm, with limits of agreement from −4.660 to 4.906 bpm. The difference in RRM and RRI is only −0.015 bpm, with narrowed limits of agreement from −3.564 to 3.533 bpm.
The figure includes a total of three lines for each method, with the middle line indicating the mean of the differences, and the upper and lower lines showing the upper and lower limits, respectively, of the 95% consistency limits (mean ± 1.96SD). The closer the line showing the mean of the differences is to 0 bpm, the higher the agreement between the two measurement methods and the smaller the 95% confidence interval. The closer the method is to the impedance respiration test, the higher the clinical acceptability. Therefore, Figure 4 and Table 1 show that, compared to the EEMD-PCA method, the respiration rate estimated by the proposed method is more consistent with the respiration rate measured by the impedance spirometry.
Table 2 shows the MAE and RMSE of the PPG-derived RR (RRP and RRM) with RRI in the training set and test set. Even in the test set, the MAE and RMSE of the respiration rate prediction model proposed in this study are only 1.24 and 1.79 bpm, respectively, which are 0.62 lower and 0.65 bpm lower, respectively, than those of the EEMD-PCA method.
Figure 5 illustrates the performance of the proposed method for the continuous monitoring of RR based on PPG signal segments at different reference respiration rates in different people. The top half shows the training set results, and the bottom half shows the test set results, both for an 8 min duration.
The PPG signal segments from different people are mixed and have many sudden changes in RR, so we can check the capability of the respiration rate prediction model proposed in this study in tracking sudden changes and adaptability among different people. As shown in Figure 5, the proposed method shows good performance for the continuous detection of the respiration rate with mixed PPG signals at different respiration rates in different people. The proposed method is capable of estimating the sharp change in RR better than the EEMD-PCA algorithm.
Considering the performance in different respiratory rate ranges, Table 3 shows the performance in different respiratory rate ranges in the training set and testing set of the proposed model. According to Table 3, both the training set and the test set show good performance in the normal respiratory rate ranges of 12–16 bpm, 17–20 bpm, and 21–24 bpm. Even in the test set, the MAE is less than 2 bpm for the respiration rate prediction model proposed in this study, especially in the ranges of 17–20 bpm and 21–24 bpm, and the MAE decreases nearly twofold. In the range of human respiratory rates that are too fast or too slow (<12 bpm and >24 bpm), the MAE on the test set reaches 2.68 bpm and 4.28 bpm, respectively. However, compared to the EEMD-PCA method, the MAE still decreased by 3.57 bpm and 2.34 bpm.

4. Discussion

In this study, we consider both the PPG signal quality and respiration signal quality to estimate the respiration rate based on PPG signals and validate its accuracy and robustness on the BIDMC dataset. This proposed model is developed based on the HRVM and WOA algorithm. The use of hybrid kernel functions allows an exploration of the relationship between the error in the respiration rate estimated by the EEMD-PCA method and the signal quality indicators in a wider range of dimensions, and the WOA algorithm avoids falling into the local optimum by continuously iterating to identify the most suitable kernel function width and weight parameters. The respiration rate prediction model proposed in this study has the advantages of both a local kernel function and nonlocal kernel function. Therefore, the method has a higher accuracy in RR detection compared with other methods.
Previous studies used fixed threshold filters to pre-process PPG signals, which will inevitably filter out some respiration information. Therefore, we did not use any filters to pre-process the PPG signal. Instead, the data-driven EEMD-PCA method was directly utilized to exclude motion and cardiac noise and to pre-extract respiratory waves. In addition, the EEMD method is robust to noise, and PCA further reduces noise and cardiovascular signal interference based on variance, making the initially extracted respiratory signal highly reliable. Therefore, RQIs are calculated based on the pre-extracted respiratory waves better than the pre-processed PPG signal.
The complexity of the physiological condition of critically ill patients and the uncertainty of external noise produce complex changes in the error rate of the RR estimated by the EEMD-PCA method. Due to the differences in the human body, for different subjects, there is a significantly different Pearson correlation coefficient between the error rate of the RR estimated by the EEMD-PCA method and the four signal quality indices. Table 4 presents the R1, R2, R3, and R4 for 52 subjects, from which we observe that the R1, R2, R3 and R4 are significantly different for different subjects. For example, the correlation of four signal quality indicators of Subject 01 with the error in the respiration rate estimated based on the EEMD-PCA method is clearly more relevant than that of Subject 02. In addition, the sensitivity of different signal quality indicators differed for the same subject; for example, for Subject 04, compared to other signal quality indices, Q R 2 was not as relevant, while for Subject 01, it was Q R 1 that was less relevant. However, the mean values of these four correlation coefficients show that none of these four signal quality indicators is better than the other three signal quality indicators. To reduce the influence of signal quality indicators with a low correlation with the respiration rate estimation, the current state-of-the-art approach is to improve the accuracy of the estimates at the expense of discarding unusable data by ‘intelligent fusion’ methods. RQI Fusion reduces the percentage of discards by setting an adjustable signal quality indicator threshold. In this paper, we use the sparsity of the HRVM algorithm to select a certain percentage of data from the training set for the model construction and optimization by the WOA algorithm. This method is data-driven to determine the percentage of discards without setting a threshold parameter, and it is only necessary to give an objective function that yields an estimated respiration rate that is closest to the impedance respiration rate, which provides better robustness than other RR estimation methods.
In this paper, an end-to-end respiration rate prediction model is constructed. The advantages of the model proposed in this paper, in comparison with the end-to-end respiration rate prediction methods based on PPG signals proposed by other researchers, are shown in Table 5. According to our results and those of other authors on the BIDMC dataset in recent years, the MAE and RMSE of the model proposed in this paper on the test set are only 1.24 bpm and 1.79 bpm, respectively, which are much lower than those of other methods. As shown in Table 6, although the framework proposed in [18] and the EEMD + KF method [20] both show better results, the framework is too complicated to calculate; each time, it needs to use the EMD and DWT methods to calculate seven different predicted respiratory waves to fuse, and it takes 30 s to update the RR, 22 s more than the model proposed in this paper. The reason for choosing 8 s to update the breathing rate in this paper is based on matlab2020a with the EEMD-PCA method to decompose a 32-s PPG signal to extract the predicted respiratory rate and respiratory waves, plus the time to calculate four signal quality indicators is close to 8 s. Although the method in [20] is simpler to calculate, it relies on the signal quality of the first PPG window, and the error in estimating the respiration rate in the first window can lead to large errors in all subsequent windows. Neither the conventional respiratory-modulation-based methods in [14,32] in the paper showed good results, which is caused by the challenging respiratory modulation extraction when the signal quality is poor. For some advanced machine-learning methods or deep learning methods [26,27,32,33], they are affected by the accuracy of feature extraction or feature selection, resulting in their outcomes not being better than this paper’s. This paper uses the EEMD-PCA method to pre-extract the respiration rate and respiration wave, which not only avoids the challenging and inaccurate extraction based on the traditional respiration modulation, but also improves the accuracy and robustness of the respiration rate prediction by incorporating signal quality factors into the respiration prediction model using machine-learning methods. The RMSE of the CAGBA method was much lower than that of the other methods as only 20 subjects were selected.
To balance the continuity of the respiration rate detection and the accuracy of the respiration rate estimation, the appropriate PPG signal length is also important. It is evident from recent literature that the performance of respiration rate detection algorithms decreases as the PPG signal data length decreases. It is well-known that short data lengths are important for real-time respiratory rate detection in critical care or wearable devices, but a PPG signal that is too short is not conducive to accurate respiratory rate detection. In [34], the authors concluded that a length of 32 s is the most stable and shortest length for extracting respiratory signals based on PPG signals. Table 6 compared with other respiration rate estimation methods in recent years, the model proposed in this study showed better robustness and accuracy in estimating RR than other existing methods. A limitation of this study is the method for calculating PPG signal quality indicators and respiratory signal quality indicators. We calculated four signal quality indicators and tested them on different people. For some subjects, the sensitivity was poor, and a more sensitive signal quality index should be investigated. The choice of kernel function and the number of iterations of the WOA algorithm are also key factors affecting the accuracy of the final respiration rate prediction model. Other kinds of kernel functions and larger numbers of iterations should continue to be explored. In addition, the model was only tested on the BIDMC dataset; other datasets or an autonomous collection of real-world data should be collected by the latter to further validate the stability of the proposed method. The advantage of the proposed method is the end-to-end estimation of the respiration rate based only on PPG signals without a complex parameter adjustment, and the performance is significantly improved compared to other respiration rate estimation methods.
Table 6. Performance comparison of different PPG-signal-based respiration rate detection methods on BIDMC dataset.
Table 6. Performance comparison of different PPG-signal-based respiration rate detection methods on BIDMC dataset.
ReferencesMethodLength
(sec)
SubjectsOverlap (sec)MAERMSEDis (%)
This studyWOA-HRVM3253241.241.796.6%
Adami [18]EMD and DWT + EKF6053300.73--
Pongpanut [25]RRWaveNet3253-1.62-1.9%
Sharma [20]EEMD + KF3253291.90--
Aqajari [33]CycleGAN3053-1.90--
Lee [26]CAGBA322001.940.6162.26%
Bian [32]Deep learning6053592.50--
Karlen [12]SmartQualityFusion method6053-2.60--
Birrenkott [14]RQI calculation and fusion3253173.124.3923.2%
Notes: Dis (%) indicates the percentage of discarded data in the entire dataset.

5. Conclusions

In this paper, we used the WOA-HRVM method to fuse the PPG signal quality and respiratory signal quality with the respiratory rate estimated based on the EEMD-PCA method to construct a highly accurate and robust respiratory rate prediction model based on the PPG signal. The method is data-driven and does not require complex parameter tuning, which affects the stability of the respiratory rate prediction model, and overcomes the problem of difficult and inaccurate extraction when the signal quality is poor with traditional respiratory modulation methods. It also does not require the extraction of PPG morphological features and screening with feature selection methods as with other machine-learning methods or deep learning methods, and the final performance is affected by the accuracy of feature recognition and the performance of feature selection methods. After comparing the performance of the PPG-signal-based estimation of the respiration rate on the BIDMC dataset with that of previous investigators, the proposed methods showed more accurate results in estimating the RR than other existing methods for subjects from the BIDMC dataset with a short data length and a 32-s PPG signal. In future studies, we will validate the method using other datasets or an autonomous collection of real-world data in large cohorts of short data lengths while exploring more effective PPG and respiratory signal quality metrics to further improve the accuracy of the respiratory rate prediction model so that it can eventually be applied to the real-time detection of the respiratory rate on wearable devices or be utilized instead of impedance detection for the real-time detection of the respiratory rate in patients under intensive care. It promotes the development of portability for the real-time respiratory detection of patients under intensive care, and has a very important theoretical value for realizing the real-time detection of the respiratory rate in wearable devices and telemedicine, and improving the accuracy of respiratory rate measurement.

Author Contributions

X.D. designed the study. Z.W., L.C., Y.L. and Z.C. conceived the study, provided directions, feedback, and/or revised the manuscript. X.D. led the investigation and drafted the manuscript for submission with revisions and feedback from the contributing authors. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China (62101148), Natural Science Foundation of Guangxi (2020GXNSFBA297156, 2021GXNSFBA220051) and Guangxi Innovation Driven Development Project (Guike AA19254003).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this manuscript can be downloaded from this link https://www.physionet.org/content/bidmc/1.0.0/ (accessed on 14 January 2023).

Conflicts of Interest

The authors declare that there are no competing interest.

References

  1. AL-Khalidi, F.Q.; Saatchi, R.; Burke, D.; Elphick, H.; Tan, S. Respiration rate monitoring methods: A review. Pediatr. Pulmonol. 2011, 46, 523–529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Orphanidou, C. Derivation of respiration rate from ambulatory ECG and PPG using ensemble empirical mode decomposition: Comparison and fusion. Comput. Biol. Med. 2017, 81, 45–54. [Google Scholar] [CrossRef] [PubMed]
  3. Selvakumar, K.; Kumar, E.V.; Sailesh, M.; Varun, M.; Allan, A.; Biswajit, N.; Namrata, P.; Upasana, S. Realtime PPG based respiration rate estimation for remote health monitoring applications. Biomed. Signal Process. Control. 2022, 77, 103746. [Google Scholar] [CrossRef]
  4. Alian, A.A.; Shelley, K.H. Photoplethysmography. Best Pract. Res. Clin. Anaesthesiol. 2014, 28, 395–406. [Google Scholar] [CrossRef]
  5. Ginsburg, A.S.; Lenahan, J.L.; Izadnegahdar, R.; Ansermino, J.M. A systematic review of tools to measure respiratory rate in order to identify childhood pneumonia. Am. J. Respir. Crit. Care Med. 2018, 197, 1116–1127. [Google Scholar] [CrossRef]
  6. Zhang, X.; Kassem MA, M.; Zhou, Y.; Shabsigh, M.; Wang, Q.; Xu, X. A brief review of non-invasive monitoring of respiratory condition for extubated patients with or at risk for obstructive sleep apnea after surgery. Front. Med. 2017, 4, 26. [Google Scholar] [CrossRef] [Green Version]
  7. Rolfe, S. The importance of respiratory rate monitoring. Br. J. Nurs. 2019, 28, 504–508. [Google Scholar] [CrossRef]
  8. Ruangsetakit, C.; Nuntiwattana, N.; Hahtapornsawan, S.; Hongku, K.; Wangwanit, C.; Chinsakchai, K.; Sermthanasawadi, N.; Mutirangura, P. Digital photoplethysmography in the diagnosis of deep vein thrombosis in Thai patients. J. Med. Assoc. Thai. 2017, 100, S155–S161. [Google Scholar]
  9. Li, D.; Wang, X.; Liu, D. Association between elevated central venous pressure and outcomes in critically ill patients. Ann. Intensive Care 2017, 7, 1–7. [Google Scholar] [CrossRef] [Green Version]
  10. Park, S.H.; Choi, S.J.; Park, K.S. Advance continuous monitoring of blood pressure and respiration rate using denoising auto encoder and LSTM. Microsyst. Technol. 2022, 28, 2181–2190. [Google Scholar] [CrossRef]
  11. Nakajima, K.; Tamura, T.; Miike, H. Monitoring of heart and respiratory rates by photoplethysmography using a digital filtering technique. Med. Eng. Phys. 1996, 18, 365–372. [Google Scholar] [CrossRef]
  12. Karlen, W.; Raman, S.; Ansermino, J.M.; Dumont, G.A. Multiparameter respiratory rate estimation from the photoplethysmogram. IEEE Trans. Biomed. Eng. 2013, 60, 1946–1953. [Google Scholar] [CrossRef]
  13. Meredith, D.J.; Clifton, D.; Charlton, P.; Brooks, J.; Pugh, C.W.; Tarassenko, L. Photoplethysmographic derivation of respiratory rate: A review of relevant physiology. J. Med. Eng. Technol. 2012, 36, 1–7. [Google Scholar] [CrossRef] [Green Version]
  14. Birrenkott, D.A.; Pimentel, M.A.; Watkinson, P.J.; Clifton, D.A. A robust fusion model for estimating respiratory rate from photoplethysmography and electrocardiography. IEEE Trans. Biomed. Eng. 2017, 65, 2033–2041. [Google Scholar] [CrossRef]
  15. Charlton, P.H.; Birrenkott, D.A.; Bonnici, T.; Pimentel, M.A.F.; Johnson, A.E.W.; Alastruey, J.; Tarassenko, L.; Watkinson, P.J.; Beale, R.; Clifton, D.A.; et al. Breathing rate estimation from the electrocardiogram and photoplethysmogram: A review. IEEE Rev. Biomed. Eng. 2017, 11, 2–20. [Google Scholar] [CrossRef] [Green Version]
  16. Ambekar, M.R.; Prabhu, S. A novel algorithm to obtain respiratory rate from the PPG signal. Int. J. Comput. Appl. 2015, 126, 9–12. Available online: https://www.researchgate.net/profile/Sapna-Prabhu-2/publication/281910024_A_Novel_Algorithm_to_Obtain_Respiratory_Rate_from_the_PPG_Signal/links/59f0122b0f7e9baeb26ad4bc/A-Novel-Algorithm-to-Obtain-Respiratory-Rate-from-the-PPG-Signal.pdf (accessed on 14 January 2023).
  17. Colominas, M.A.; Schlotthauer, G.; Torres, M.E. Improved complete ensemble EMD: A suitable tool for biomedical signal processing. Biomed. Signal Process. Control. 2014, 14, 19–29. [Google Scholar] [CrossRef]
  18. Adami, A.; Boostani, R.; Marzbanrad, F.; Charlton, P.H. A new framework to estimate breathing rate from electrocardiogram, photoplethysmogram, and blood pressure signals. IEEE Access 2021, 9, 45832–45844. [Google Scholar] [CrossRef]
  19. Motin, M.A.; Karmakar, C.K.; Palaniswami, M. Selection of empirical mode decomposition techniques for extracting breathing rate from PPG. IEEE Signal Process. Lett. 2019, 26, 592–596. [Google Scholar] [CrossRef]
  20. Sharma, H. Extraction of respiratory rate from PPG using ensemble empirical mode decomposition with Kalman filter. Electron. Lett. 2020, 56, 650–653. [Google Scholar] [CrossRef]
  21. Madhav, K.V.; Raghuram, M.; Krishna, E.H.; Komalla, N.R.; Reddy, K.A.; Komalla, A.R. Use of multi scale PCA for extraction of respiratory activity from photoplethysmographic signals. In Proceedings of the 2012 IEEE International Instrumentation and Measurement Technology Conference Proceedings, Graz, Austria, 13–16 May 2012; pp. 1784–1787. [Google Scholar]
  22. Motin, M.A.; Karmakar, C.K.; Palaniswami, M. Ensemble empirical mode decomposition with principal component analysis: A novel approach for extracting respiratory rate and heart rate from photoplethysmographic signal. IEEE J. Biomed. Health Inform. 2017, 22, 766–774. [Google Scholar] [CrossRef] [PubMed]
  23. Prathyusha, B.; Rao, T.S.; Asha, D. Extraction of respiratory rate from PPG signals using PCA and EMD. Int. J. Res. Eng. Technol. 2012, 1, 164–184. [Google Scholar]
  24. Shuzan, M.N.I.; Chowdhury, M.H.; Hossain, M.S.; Chowdhury, M.E.H.; Reaz, M.B.I.; Monir Uddin, M.; Khandakar, A.; Mahbub, Z.B.; Hamid, M.; Ali, S. A novel non-invasive estimation of respiration rate from motion corrupted photoplethysmograph signal using machine learning model. IEEE Access 2021, 9, 96775–96790. [Google Scholar] [CrossRef]
  25. Osathitporn, P.; Sawadwuthikul, G.; Thuwajit, P.; Ueafuea, K.; Mateepithaktham, T.; Kunaseth, N.; Choxtchawathi, T.; Punyabukkana, P.; Mignot, E.; Wilaiprasitporn, T. RRWaveNet: A Compact End-to-End Multi-Scale Residual CNN for Robust PPG Respiratory Rate Estimation. arXiv 2022, arXiv:2208.08672. [Google Scholar]
  26. Lee, S.; Moon, H.; Son, C.-H.; Lee, G. Respiratory Rate Estimation Combining Autocorrelation Function-Based Power Spectral Feature Extraction with Gradient Boosting Algorithm. Appl. Sci. 2022, 12, 8355. [Google Scholar] [CrossRef]
  27. Elgendi, M. Optimal signal quality index for photoplethysmogram signals. Bioengineering 2016, 3, 21. [Google Scholar] [CrossRef] [Green Version]
  28. Gao, P.; Yue, M.; Wu, Z. A novel intrusion detection method based on WOA optimized hybrid kernel RVM[C]//2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS). In Proceedings of the 2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS), Chengdu, China, 23–26 April 2021; pp. 1063–1069. [Google Scholar]
  29. Rafi, M.; Shaikh, M.S. A comparison of SVM and RVM for Document Classification. arXiv 2013, arXiv:1301.2785. [Google Scholar]
  30. Zhang, C.; He, Y.; Yuan, L.; Xiang, S. A multiple heterogeneous kernel RVM approach for analog circuit fault prognostic. Clust. Comput. 2019, 22, 3849–3861. [Google Scholar] [CrossRef]
  31. Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
  32. Bian, D.; Mehta, P.; Selvaraj, N. Respiratory rate estimation using PPG: A deep learning approach. In Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020; pp. 5948–5952. [Google Scholar]
  33. Aqajari, S.A.H.; Cao, R.; Zargari AH, A.; Rahmani, A.M. An end-to-end and accurate ppg-based respiratory rate estimation approach using cycle generative adversarial networks. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Mexico, 1–5 November 2021; pp. 744–747. [Google Scholar]
  34. Kyriacou, P.A.; Allen, J. (Eds.) Photoplethysmography: Technology, Signal Analysis and Applications; Academic Press: Cambridge, MA, USA, 2021. [Google Scholar]
Figure 1. Flow chart of respiration rate prediction model construction.
Figure 1. Flow chart of respiration rate prediction model construction.
Diagnostics 13 00913 g001
Figure 2. Distribution of each 32-s PPG window corresponding to the RRI. The horizontal axis indicates the value of the RRI for each 32-s window, and the vertical axis indicates the number of windows at the corresponding RRI.
Figure 2. Distribution of each 32-s PPG window corresponding to the RRI. The horizontal axis indicates the value of the RRI for each 32-s window, and the vertical axis indicates the number of windows at the corresponding RRI.
Diagnostics 13 00913 g002
Figure 3. (a) Time domain comparison of impedance respiratory signals with pre-extracted respiratory signals based on the EEMD-PCA method. (b) Frequency domain comparison of impedance respiratory signals with pre-extracted respiratory signals based on the EEMD-PCA method. The blue line represents the pre-extracted respiratory signals in the time-frequency domain, and the red line represents the impedance respiratory signals in the time-frequency domain.
Figure 3. (a) Time domain comparison of impedance respiratory signals with pre-extracted respiratory signals based on the EEMD-PCA method. (b) Frequency domain comparison of impedance respiratory signals with pre-extracted respiratory signals based on the EEMD-PCA method. The blue line represents the pre-extracted respiratory signals in the time-frequency domain, and the red line represents the impedance respiratory signals in the time-frequency domain.
Diagnostics 13 00913 g003
Figure 4. Bland–Altman plot of the RRM (RR estimated by the prediction model proposed in this study), RRP (RR estimated by the EEMD-PCA method), and RRI for the training set and test set. The figure includes a total of 3 lines for each method, with the middle line indicating the mean of the differences, and the upper and lower lines showing the upper and lower limits, respectively, of the 95% consistency limits. The horizontal axis represents the average of RRP and RRI or the average of RRM and RRI, and the vertical axis represents the difference, RRP-RRI or RRM-RRI: (a) Bland–Altman plot of the RRM and RRI in the training set, (b) Bland–Altman plot of the RRp and RRI in the training set, (c) Bland–Altman plot of the RRM and RRI in the test set, and (d) Bland–Altman plot of the RRp and RRI in the test set.
Figure 4. Bland–Altman plot of the RRM (RR estimated by the prediction model proposed in this study), RRP (RR estimated by the EEMD-PCA method), and RRI for the training set and test set. The figure includes a total of 3 lines for each method, with the middle line indicating the mean of the differences, and the upper and lower lines showing the upper and lower limits, respectively, of the 95% consistency limits. The horizontal axis represents the average of RRP and RRI or the average of RRM and RRI, and the vertical axis represents the difference, RRP-RRI or RRM-RRI: (a) Bland–Altman plot of the RRM and RRI in the training set, (b) Bland–Altman plot of the RRp and RRI in the training set, (c) Bland–Altman plot of the RRM and RRI in the test set, and (d) Bland–Altman plot of the RRp and RRI in the test set.
Diagnostics 13 00913 g004
Figure 5. The EEMD-PCA method and respiration rate prediction model proposed in this study with different subjects and time windows for the estimation of mutant respiration rates were assessed for consistency with impedance respiration rates. (The red dotted line represents the referenced impedance respiration rate (RRI), the blue dotted line denotes the respiration rate predicted by the respiration rate prediction model proposed in this study (RRM), and the green dotted line indicates the respiration rate estimated by the EEMD-PCA method (RRP)).
Figure 5. The EEMD-PCA method and respiration rate prediction model proposed in this study with different subjects and time windows for the estimation of mutant respiration rates were assessed for consistency with impedance respiration rates. (The red dotted line represents the referenced impedance respiration rate (RRI), the blue dotted line denotes the respiration rate predicted by the respiration rate prediction model proposed in this study (RRM), and the green dotted line indicates the respiration rate estimated by the EEMD-PCA method (RRP)).
Diagnostics 13 00913 g005
Table 1. A literature review of respiratory rate detection based on PPG.
Table 1. A literature review of respiratory rate detection based on PPG.
ReferencesDatabaseSubjectsMethodologyInnovationDrawbacks
Nakajima [11]Self-collection11 Digital filtering techniqueReal-time PPG-based respiration rate detectionLack of universal applicability
Karlen [12]CapnoBase94 Smart fusion methodSet discard thresholds with PPG signal quality metricsDiscard 45% of data
Birrenkott [14]CapnoBase42 RQI calculation and fusionAdjustable threshold value to change accuracyInaccurate estimation of low-quality PPG signals
Selvakumar [3]CapnoBase42 RIAV based on FFTRespiration rate detection on low-cost hardwareLow accuracy in detecting too-fast breathing
Sharma [20]BIDMC53 EEMD + KFKalman filtering is applied to the reconstructed signalKF is not suitable for non-linear PPG signals
Adami [18]BIDMC53 CEEMDAN +
DWT + EKF
Leverage time and frequency domain informationFramework calculation is too complicated
Mohammad [19]MIMIC121 EMD family and PCAFree from parameter selectionSensitivity to high-amplitude noise in the respiratory range
Shuzan [24]VORTAL39 Machine-learning modelHyperparameter optimizationTested only on resting young people
Pongpanut [25]BIDMC53 RRWaveNetImprove model robustness using transfer learning Discarded low-quality signals by SQI metric
Table 2. Mean of the differences, and the upper and lower limits of the 95% consistency limits for the RRM and RRP with the RRI in the training set and test set.
Table 2. Mean of the differences, and the upper and lower limits of the 95% consistency limits for the RRM and RRP with the RRI in the training set and test set.
DatasetMethodMeanMean +1.96SDMean −1.96SD
training setThis study01.930−1.930
EEMD-PCA0.0705.278−5.138
test setThis study−0.0153.533−3.564
EEMD-PCA0.1214.906−4.660
Table 3. Mean absolute error (MAE) and root mean square error (RMSE) for the RRM (RR estimated by the prediction model proposed in this study) and RRP (RR estimated by the EEMD-PCA method) with the RRI in the training set and testing set.
Table 3. Mean absolute error (MAE) and root mean square error (RMSE) for the RRM (RR estimated by the prediction model proposed in this study) and RRP (RR estimated by the EEMD-PCA method) with the RRI in the training set and testing set.
DatasetMethodMAERMSE
training setThis study0.710.99
EEMD-PCA1.992.66
test setThis study1.241.79
EEMD-PCA1.862.44
Table 4. Prediction performance in different ranges of respiratory rates.
Table 4. Prediction performance in different ranges of respiratory rates.
RRI (bpm)Training SetTest Set
NMAEthis study
[MAEEEMD-PCA]
RMSEthis study
[RMSEEEMD-PCA]
NMAEthis study
[MAEEEMD-PCA]
RMSEthis study
[RMSEEEMD-PCA]
below 12611.08 [6.40]1.56 [6.57]162.68 [6.25]3.52 [6.34]
12–165630.79 [2.34]1.08 [2.65]2021.45 [2.21]1.90 [2.50]
17–2011180.63 [1.05]0.82 [1.36]4070.91 [1.03]1.21 [1.32]
21–242110.75 [3.72]1.16 [3.89]881.80 [3.41]2.45 [3.56]
above 24400.98 [7.17]2.14 [7.47]134.28 [6.62]5.01 [6.77]
Notes: MAEthis study and RMSEthis study denote the MAE and RMSE of RRM and RRI, MAEEEMD-PCA and RMSEEEMD-PCA denote the MAE and RMSE of RRP and RRI, and N denotes the number of windows.
Table 5. Pearson correlation coefficient (PCC) between signal quality indices and error rate of respiration rate estimated based on the EEMD-PCA method for different subjects.
Table 5. Pearson correlation coefficient (PCC) between signal quality indices and error rate of respiration rate estimated based on the EEMD-PCA method for different subjects.
SubjectR1R2R3R4
Subject 010.140.70−0.24−0.70
Subject 020.30−0.100.070.08
Subject 03−0.220.07−0.27−0.54
Subject 04−0.070.43−0.16−0.43
Subject 53−0.32−0.29−0.29−0.16
Average0.280.240.270.28
Notes: R1 denotes the Pearson correlation coefficient between Q R 1 and the error rate of respiration rate estimated based on the EEMD-PCA method; R2 denotes the Pearson correlation coefficient between Q R 2 and the error rate of respiration rate estimated based on the EEMD-PCA method; R3 denotes the Pearson correlation coefficient between Q R 3 and the error rate of respiration rate estimated based on the EEMD-PCA method; R4 denotes the Pearson correlation coefficient between S S Q I ¯ and the error rate of respiration rate estimated based on the EEMD-PCA method.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dong, X.; Wang, Z.; Cao, L.; Chen, Z.; Liang, Y. Whale Optimization Algorithm with a Hybrid Relation Vector Machine: A Highly Robust Respiratory Rate Prediction Model Using Photoplethysmography Signals. Diagnostics 2023, 13, 913. https://doi.org/10.3390/diagnostics13050913

AMA Style

Dong X, Wang Z, Cao L, Chen Z, Liang Y. Whale Optimization Algorithm with a Hybrid Relation Vector Machine: A Highly Robust Respiratory Rate Prediction Model Using Photoplethysmography Signals. Diagnostics. 2023; 13(5):913. https://doi.org/10.3390/diagnostics13050913

Chicago/Turabian Style

Dong, Xuhao, Ziyi Wang, Liangli Cao, Zhencheng Chen, and Yongbo Liang. 2023. "Whale Optimization Algorithm with a Hybrid Relation Vector Machine: A Highly Robust Respiratory Rate Prediction Model Using Photoplethysmography Signals" Diagnostics 13, no. 5: 913. https://doi.org/10.3390/diagnostics13050913

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop