Next Article in Journal
Optical Frequency Domain Reflectometry Based on Multilayer Perceptron
Next Article in Special Issue
A Frequency Estimation Scheme Based on Gaussian Average Filtering Decomposition and Hilbert Transform: With Estimation of Respiratory Rate as an Example
Previous Article in Journal
Design and Simulation of Au/SiO2 Nanospheres Based on SPR Refractive Index Sensor
Previous Article in Special Issue
Motion Smoothness-Based Assessment of Surgical Expertise: The Importance of Selecting Proper Metrics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Convolutional Neural Network for Individual Identification Using Phase Space Reconstruction of Electrocardiogram

1
Department of Electrical Engineering, Chang Gung University, Taoyuan 333, Taiwan
2
Biomedical Engineering Research Center, Chang Gung University, Taoyuan 333, Taiwan
3
Neuroscience Research Center, Chang Gung Memorial Hospital, Linkou, Taoyuan 333, Taiwan
4
Department of Neurology, Cardinal Tien Hospital Yung Ho Branch, New Taipei City 234, Taiwan
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(6), 3164; https://doi.org/10.3390/s23063164
Submission received: 10 February 2023 / Revised: 8 March 2023 / Accepted: 14 March 2023 / Published: 16 March 2023
(This article belongs to the Special Issue Sensors and Signal Processing for Biomedical Application)

Abstract

:
Electrocardiogram (ECG) biometric provides an authentication to identify an individual on the basis of specific cardiac potential measured from a living body. Convolutional neural networks (CNN) outperform traditional ECG biometrics because convolutions can produce discernible features from ECG through machine learning. Phase space reconstruction (PSR), using a time delay technique, is one of the transformations from ECG to a feature map, without the need of exact R-peak alignment. However, the effects of time delay and grid partition on identification performance have not been investigated. In this study, we developed a PSR-based CNN for ECG biometric authentication and examined the aforementioned effects. Based on a population of 115 subjects selected from the PTB Diagnostic ECG Database, a higher identification accuracy was achieved when the time delay was set from 20 to 28 ms, since it produced a well phase-space expansion of P, QRS, and T waves. A higher accuracy was also achieved when a high-density grid partition was used, since it produced a fine-detail phase-space trajectory. The use of a scaled-down network for PSR over a low-density grid with 32 × 32 partitions achieved a comparable accuracy with using a large-scale network for PSR over 256 × 256 partitions, but it had the benefit of reductions in network size and training time by 10 and 5 folds, respectively.

1. Introduction

Electrocardiogram (ECG) has been widely used for physiological monitoring and clinical diagnosis. ECG also discloses specific individual characteristics, which can be used for biometric authentication with an advantage in security because it is acquired from a living body. Other people cannot easily obtain it from the daily environment, unlike a fingerprint, which can be taken from the devices that have been touched by fingers. In addition, the acquired ECG during telecare monitoring provides a biometric authentication to know the identity of the patient [1]. On the other hand, ECG data taken for biometric authentication are beneficial for the individual’s daily healthcare recording.
A typical ECG is composed of P, QRS complex, T, and U waves which are generated by a sequence of atrial depolarization, ventricular depolarization, and repolarization. Because each person has a respective cardiac conduction system and projection of the generated electrical activity on body surface, the waveform of the recorded ECG varies from person to person. Early ECG biometrics use several morphological parameters, such as widths, amplitudes, slopes, and intervals on the basis of the characteristic waves [2,3,4,5,6,7]. However, some morphological information remains undisclosed by these reduced features, but it is useful for individual identification. A good quality of the acquired ECG signals is essential, especially for the early ECG biometric methods. The identifications of small-amplitude characteristics such as P, Q, and S waves are commonly affected by the presence of noise contaminations or when the characteristic waves are difficult to discern.
Direct use of waveform [5,8,9] or transformed coefficients [7,10,11] as input features solves the aforementioned problem. An alternative approach is to directly transform ECG to a three-dimensional phase-space trajectory using a time-delay technique [12,13]. Pattern match [5,9,13] and morphological difference [7,8,9,10,11,12,13] are mostly used to quantify similarity and dissimilarity between two ECGs. Even though the waveform-based methods do not need the detections of the small-amplitude waves, QRS detection is still needed for the segmentation of a cardiac cycle for individual identification. Digital filtering is commonly used to attenuate low-frequency baseline wandering and high-frequency noises [14]. Advanced methods using multi-resolution wavelet analysis [15,16] and kalman filtering [17,18] are also proposed to enhance ECG waveforms for QRS detection and extract ECG features.
Convolutional neural networks (CNNs) outperform the aforementioned methods in terms of feature extraction because convolutions can produce discernible features from ECG. In the convolutional layers, the network weights are derived using machine learning on the basis of multiple individuals’ data. Several transformations of raw ECG are used to generate input features for the CNNs. A simple method is to put single-beat ECG samples in a vector, then one-dimensional (1D) convolutional networks are used to produce distinguishing features for a succeeding connected network to identify individuals [19]. The other methods transform the 1D ECG signals to two-dimensional (2D) images by plotting each ECG beat as an individual grayscale image (time–amplitude representation) [20], decomposing each ECG beat on various scales using continuous wavelet transform (time–scale representation) [21], and embedding each ECG beat onto a 2D phase-space image using a time-delay technique [22]. In addition, multi-beat ECG is also transformed to an image by stacking beat-aligned amplitudes [23,24] or merging multiple phase-space trajectories [25]. The 2D CNNs are therefore used to extract discernible features for succeeding connected networks to identify individuals.
Phase space reconstruction (PSR) uses a time-delay technique to reconstruct the phase-space trajectory of a signal. The signal is embedded into a multi-dimensional phase space by a plot of each sample x(t) versus its respective samples after fixed time delays x(t + τ), x(t + 2τ), …, x(t + (m − 1)τ), which was proposed initially for the purpose of disclosing nonlinear dynamics of the signal on the basis of chaos physics [26]. PSR was firstly used to analyze ECG for examining whether ventricular fibrillation is an instance of deterministic chaos [27]. Moreover, the reconstructed phase portraits reveal various morphologies associated with normal and pathological ECGs. Thus, several studies employed the PSR with m = 2 to detect QRS complex [28], recognize ventricular extrasystoles [29], and classify the type of ventricular arrhythmia [30] because the 2D reconstruction displays a concise phase space trajectory and lends itself more readily to feature extraction. In addition, PSR transforms the temporal patterns of the ECG signal (P, QRS, and T waves) to specific spatial loops in phase space without the need of characteristic wave detections, extending its application to individual identification [12,13,22,24].
The value of time delay determines the morphology of the reconstructed phase-space trajectory and whether the ECG characteristics are sufficiently expanded in the phase space. However, the time delay is empirically determined in the majority of studies without an investigation to explore the effect of various time delays on the accuracy of the ECG biometric authentication. On the other hand, the phase space is partitioned by a series of intersecting vertical and horizontal lines. The grid partition generates a 2D image array to record the absence or presence of the reconstructed trajectory in each partitioned square. The density of the grid partition determines whether the generated portrait image is fine-grained or coarse-grained, which may affect the accuracy of individual identification, but it has not been reported in previous works. In this study, we propose a novel ECG biometric model which utilizes a deep CNN to learn and identify individuals’ patterns over the reconstructed phase portrait. We also investigate the aforementioned effects on the accuracy of the proposed ECG biometric model using the underlying combinations of various time delays (τ = 2, 8, 16, 24, and 36 ms) and various-density grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256) on the basis of the data from the PTB Diagnostic ECG Database, with a purpose of determining an optimal time delay and a lower-complexity model with the maintenance of identification performance.
The rest of this paper is organized as follows. The method section describes ECG biometric data, phase space reconstruction of ECG, and ECG biometric network. In the result section we present the accuracies of individual identification on the basis of phase-space features using various time delays and various-density grid partitions. In the discussion section we talk over beat-to-beat variation, the effects of time delay and grid partition on identification accuracy, and the scale-down of the proposed network.

2. Materials and Methods

2.1. ECG Biometric Data

This study used the data from the PTB Diagnostic ECG Database established by the Physikalisch-Technische Bundesanstalt (PTB), National Metrology Institute of Germany [31]. The PTB database contains 549 records from 290 subjects, with different time durations of around 2 min. Only 52 subjects are healthy, 148 subjects suffer from myocardial infarction, and 18 have cardiomyopathy or heart failure. Each signal is digitized at 1 kSa/s (kilo samples per second) with a resolution of 16 bits. ECG recordings which had a time duration of less than 100 s or were seriously corrupted by noises or artifacts were excluded, so that the included recordings contained more than 125 heart beats with less noise contaminations for network training and testing. Accordingly, a total of 115 ECG recordings covering 17 healthy subjects, 81 subjects with myocardial infarction, and 15 subjects with cardiomyopathy or heart failure were selected to develop our ECG biometric model.
Compared to lead augmented vector left/right (aVL/aVR) that is electrical voltage difference between left/right arm electrode and a combination of right/left arm electrode and left leg electrode, lead I ECG is defined as the voltage difference between the electrodes on left and right arms, which can also be obtained through the fingers of both hands and is the most practical measurement for ECG biometric authentication. Therefore, we chose lead I ECG as an input for the ECG biometric model in this study.

2.2. Data Processing and Phase Space Reconstruction

Since a good quality of the ECG signals is important for recognizing individuals’ ECG patterns, we employed digital filtering to keep a high signal-to-noise ratio in the processed ECGs prior to the application to the ECG biometric model. The following two anti-causal Butterworth filters were used: (1) a high-pass filter with a cutoff frequency of 1 Hz to remove low-frequency baseline wander, and (2) a lowpass filter with a cutoff frequency of 40 Hz to suppress high-frequency noises. The segmentation of individual ECG signal into heartbeats was performed using the following steps:
(1)
QRS complex was detected using a Python ECG QRS detector [32] based on the Pan-Tompkins algorithm [33].
(2)
Each detected QRS was further verified to exclude wrong detections and ventricular premature contraction beats.
(3)
For each inclusive heartbeat, the preceding 35% and the succeeding 65% of the samples around the QRS fiducial point in a cardiac cycle were extracted.
After the aforementioned processing, 125 available heartbeats could be extracted for network training and testing.
We used PSR to delineate ECG’s morphological features by the following steps: the ECG was upsampled by a factor of 10 using Gaussian interpolation, then normalized to a series of values from 0 to 1 by
x n ( t ) = x ( t ) x min x max x min
where xmax and xmin were the maximum and minimum of x(t). The normalized, upsampled ECG was embedded into a series of 2D vectors with time delay τ.
X ( t ) = [ x n ( t )   x n ( t + τ ) ]
Plotting these vectors on a 2D coordinate plane produced a phase-space trajectory. As shown in Figure 1, the phase-space trajectory characterizes the morphological relationship of samples related to their delayed samples. QRS complex is transformed to an outer loop, which is the major loop in the phase space. T wave forms an inner loop and P wave, a minor loop. A phase portrait was finally generated as input features for ECG biometric network.

2.3. ECG-Biometric Network

A PSR-based CNN was developed for ECG biometric authentication. Single-cycle ECG was transformed to a phase portrait used as input features to the CNN. As listed in Table 1, five convolutional layers, modified from the AlexNet [34], were used to extract low- and high-level features from the phase portrait. The output of the last convolution layer was flattened and fed into a fully connected network, which contained two hidden layers and one output layer of 115 nodes corresponding to 115 individuals. The activation function was chosen as ReLu function in the hidden layers, where the output layer used softmax function to represent a categorical probability distribution.

2.4. Network Training and Testing

The proposed ECG biometric network was implemented in the Visual Studio Code Ver. 1.57.0 (Microsoft Corp., Redmond, WA, USA) using Python Ver. 3.8.10 (Python Software Foundation, Wilmington, DE, USA) and Pytorch Ver. 1.9.0 (Pytorch Org., USA). Networks were then trained and validated in a server computer with an eight-core Intel CoreTM i9-9900K CPU (Intel, Santa Clara, CA, USA) and a 24-GB Titan RTX GPU (Nvidia, Santa Clara, CA, USA) using CUDA technology.
Each ECG recording was partitioned into the following two parts: the former 100 heartbeats were included as training data, and the latter 25 heartbeats were used to validate the trained network. A total of 75 epochs were applied to train the network using an Adam optimization algorithm based on cross entropy loss function. To prevent overfitting and force the network to learn general and robust patterns from the data, a fraction of the hidden units in the fully connected network were randomly dropped at every iteration at a probability of 0.5. At every training epoch, the trained network was tested by the validation data. The accuracy was defined as the percentage of correct identifications among all validation data.
To examine the effect of time delay and grid partition on the accuracy of individual identification, a total of 50 combinations of ten time delays (τ = 2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms) and five types of grids (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256 partitions) were used to generate phase portraits for individual identifications. To have all examinations be performed on the basis of the same network architecture, all phase portraits were interpolated to 2D images of the same size (256 × 256 pixels).

3. Results

Figure 2 shows the phase portraits transformed from the single-cycle ECG using various time delays over various-density grid partitions. Its QRS, T, and P waves are respectively transformed to a major loop, a secondary loop, and a minor loop on each portrait image, but no visible U-related loops are observed because of unapparent U wave in Figure 1. Using a small time delay (τ = 2 or 4 ms) narrows QRS loops and condenses P and T loops. When the time delay is set to 8, 12, and 16 ms, smooth, well-expanded QRS loops are produced. As the time delay is further increased, T and P loops are better expanded, but QRS loops are flattened (τ = 20 and 24 ms) and twisted (τ ≥ 28 ms). On the other hand, ECG phase portraits on the basis of a 256 × 256 grid partition disclose the most distinct phase-space trajectory. As the grid partition is reduced, phase-space trajectories becomes ambiguous, in particular, the parts attributed to P and T waves.
Figure 3a shows the accuracy of individual identification over the first 30 epochs on the underlying phase portraits embedded by various time delays. No matter which time delay was adopted, more training epochs were required to reach a saturated accuracy when a low-density grid partition was used compared to the use of high-density grid partition. That is, more epochs were required for learning discernible features from coarse-phase portraits. As shown in Figure 3b, more training epochs were also needed to reach a saturated accuracy when individual identification was performed on the basis of phase portraits embedded using a small time delay than those using a large time delay, no matter which density was adopted in grid partition. That is, more epochs were required for learning discernible features from poorly expanded phase portraits.
Figure 4 shows the average accuracy of individual identification on the underlying phase portraits using the combinations of five grid partitions (r = 16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256 partitions) and ten time delays (τ = 2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms). The representative accuracies are listed in Table 2. In the early training stage (epochs 16~30), identification performances were poor when a small time delay and low-density grid partition were used (97.05% at τ = 2 ms and r = 16 × 16 partitions) because of insufficient phase-space expansion and poor portrait details. Even though more training epochs (epochs 61~75) had been executed, its accuracy (98.14%) was still lower than those using a high-density grid partition (99.05% at τ = 2 ms and r = 256 × 256 partitions), large time delay (99.26% at τ = 36 ms and r = 16 × 16 partitions), or both (99.51% at τ = 36 ms and r = 256 × 256 partitions). The average accuracies are also plotted in Figure 5. Higher identification accuracies were achieved when the time delays were set to 20, 24 and 28 ms.

4. Discussion

The intervals between ECG characteristics, particularly between R and T, vary as heart period, creating shifts of the associated samples in the featured vector. Fortunately, the phase-space reconstruction maps ECG characteristics to specific loops in the phase space, which are less affected by interval variations. In addition, respiration-induced movements of the thoracic wall related to the heart typically cause beat-to-beat variations of QRS morphology, thereby producing scaling and shifting of phase-space loops [35]. Normalization of ECG amplitudes between 0 and 1 can fix the problem of QRS morphological variation, as shown in Figure 6a. Even so, some loop variations are still observed among consecutive normalized ECG beats, as shown in Figure 6b. In this study, we used 100 consecutive single-beat phase portraits from each individual to train the network. The existence of loop variations in the training data also provided a kind of data augmentation to increase the robustness of CNN to the loop variation.
PSR coverts morphological characteristics (temporal patterns) in an ECG signal into phase-space loops (spatial patterns). The converted spatial patterns are different among individuals, leading to good performances in ECG biometrics [12,13,22,25]. In particular, each spatial pattern has specific locations in the phase space, allowing 2D CNN to efficiently capture the phase-space fingerprints. The adopted time delay in these studies was chosen to produce a transformed phase-space trajectory that is as smooth as possible. In this study, we examined the effect of time delay on ECG biometrics. The use of a small time delay caused insufficient expansions of ECG characteristic waves, whereby more training epochs were required to achieve a saturated accuracy, but the accuracy was still lower than that achieved from using a large time delay. In addition, QRS loop was flattened or twisted, in addition to a better expansion of P and T waves, when the time delay was increased to 20 ms or larger. This study demonstrated a better identification performance when the time delays were set as 20, 24 and 28 ms. We regarded that this was because P, QRS, and T waves were well expanded in these configurations.
The PSR of ECG samples over the grid with a high number of partitions depicts a phase-space trajectory in more details, but it also increases the size of the generated portrait images; accordingly, the scale of the convolution layers should be enlarged to learn the portrait features. In this study, all phase portraits were interpolated to 256 × 256 pixels no matter how many partitions were used to generate ECG phase portrait; therefore, the achieved accuracy could be compared on the basis of the same-scale neural network. As a result, phase space expansion based on 16 × 16 partitions produced the lowest accuracy and needed more training epochs to reach a saturated accuracy. The use of 32 × 32 partitions improved this situation, and the obtained accuracy was close to those using 64 × 64, 128 × 128, or 256 × 256 partitions. A similar, scaled-down network, as illustrated in Table 3, was designed especially for portrait images with 32 × 32 pixels. Figure 7 shows the comparison of identification accuracy using the scaled-down network and original network which were designed for portrait images with 32 × 32 and 256 × 256 pixels, respectively, but the portrait images were derived on the basis of PSR over 32 × 32 partitions. The scaled-down network produced comparable accuracies to the original network. The scaled-down network could largely reduce the number of weights from 72,362,963 to 6,711,251, and the training time from 26 min 32 s to 5 min 37 s.
Pathological heartbeats caused by extra-systolic ventricular contractions are commonly different from the healthy heartbeats in the same individual, so they cannot be recognized as the individual’s pattern by the trained networks on the basis of healthy heartbeats. Fortunately, extra-systolic heartbeats commonly occur intermittently or interlaced (bigeminy or trigeminy). There are still healthy heartbeats that can be recognized by the trained network. That is why most ECG biometric authentication methods select healthy heartbeats to develop their biometric networks.
Table 4 shows the state of the art in ECG biometric authentication methods using two-dimensional convolutional neural networks. Two studies used three-layer CNN to extract discernible features from ECG images [22,25]. Two studies [20,21] and the present study used deeper neural networks, such as VGG [36], AlexNet [34], GoogleNet [37], and ResNet [38], to capture deep-learning ECG features. A study used the ensemble of two three-layer CNNs and a recurrent neural network to identify individuals’ ECGs [24]. As shown in a study by Byeon et al. [39], the accuracy of individual identification was slightly increased as the network went deeper in sequence of AlexNet, GoogleNet, and ResNet. Although, it is not easy to compare the accuracy of individual identification among various studies because of various data selections in different datasets. The proposed architecture on the basis of the AlexNet and PSR could achieve high accuracy in individual identification. The effect of network depth and architecture on individual identification over the ECG phase-space features can be further studied in future work.

5. Conclusions

PSR of ECG discloses individual trajectory patterns no matter which time delay was adopted. An appropriate time delay from 20 to 28 ms was suggested since it produced a good phase-space expansion of P, QRS, and T waves and a better performance in individual identification using the proposed CNN. In addition, using a high-density grid with 256 × 256 partitions depicted a detailed phase-space trajectory and yielded a high identification accuracy, but needed a large-scale neural network to deal with the augmented information. The use of a scaled-down network on the basis of PSR over a low-density grid with 32 × 32 partitions achieved a comparable accuracy with the benefit of reductions in network size and training time by 10 and 5 folds, respectively. In summary, temporal-to-spatial mapping through PSR directly produces 2D images with disclosed ECG characteristics, enabling an efficient learning of individual fingerprints by the 2D CNN. The use of the scaled-down CNN with appropriate time delay largely reduces the complexity of the model with a maintenance of identification performance, which is beneficial for practical implementation of the ECG biometric on authentication systems.

Author Contributions

Conceptualization, H.-L.C. and S.-C.F.; methodology, H.-W.C. and S.-C.F.; software, H.-W.C. and W.-Y.H.; validation, W.-Y.H. and P.-J.H.; writing—original draft preparation, H.-L.C.; writing—review and editing, H.-L.C. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to acknowledge the support and grants from the Chang Gung Memorial Hospital, Taoyuan, Taiwan (CMRPD2I0033) and the National Science and Technology Council, Taiwan (107-2221-E-182-008-MY3).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Conflicts of Interest

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

  1. Huang, P.; Guo, L.; Li, M.; Fang, Y. Practical privacy-preserving ECG-based authentication for IoT-based healthcare. IEEE Internet Things J. 2019, 6, 9200–9210. [Google Scholar] [CrossRef]
  2. Biel, L.; Pettersson, O.; Philipson, L.; Wide, P. ECG analysis: A new approach in human identification. IEEE Trans. Instrum. Meas. 2001, 50, 808–812. [Google Scholar] [CrossRef] [Green Version]
  3. Israel, S.A.; Irvine, J.M.; Cheng, A.; Wiederhold, M.D.; Wiederhold, B.K. ECG to identify individuals. Pattern Recognit. 2005, 38, 133–142. [Google Scholar] [CrossRef]
  4. Kyoso, M.; Uchiyama, A. Development of an ECG Identification System. In Proceedings of the 2001 Conference 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, 25–28 October 2001; Volume 3724, pp. 3721–3723. [Google Scholar]
  5. Shen, T.W.; Tompkins, W.J.; Hu, Y.H. One-Lead ECG for Identity Verification. In Proceedings of the Second Joint 24th Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society Engineering in Medicine and Biology, Houston, TX, USA, 23–26 October 2002; Volume 61, pp. 62–63. [Google Scholar]
  6. Palaniappan, R.; Krishnan, S.M. Identifying Individuals Using ECG beats. In Proceedings of the 2004 International Conference on Signal Processing and Communications, 2004. SPCOM’ 04., Bangalore, India, 11–14 December 2004; pp. 569–572. [Google Scholar]
  7. Tan, R.; Perkowski, M. Toward improving electrocardiogram (ECG) biometric verification using mobile sensors: A two-stage classifier approach. Sensors 2017, 17, 410. [Google Scholar] [CrossRef]
  8. Wübbeler, G.; Stavridis, M.; Kreiseler, D.; Bousseljot, R.-D.; Elster, C. Verification of humans using the electrocardiogram. Pattern Recognit. Lett. 2007, 28, 1172–1175. [Google Scholar] [CrossRef]
  9. Lee, W.; Kim, S.; Kim, D. Individual biometric identification using multi-cycle electrocardiographic waveform patterns. Sensors 2018, 18, 1005. [Google Scholar] [CrossRef] [Green Version]
  10. Chan, A.D.C.; Hamdy, M.M.; Badre, A.; Badee, V. Wavelet distance measure for person identification using electrocardiograms. IEEE Trans. Instrum. Meas. 2008, 57, 248–253. [Google Scholar] [CrossRef]
  11. Irvine, J.M.; Israel, S.A.; Todd Scruggs, W.; Worek, W.J. eigenPulse: Robust human identification from cardiovascular function. Pattern Recognit. 2008, 41, 3427–3435. [Google Scholar] [CrossRef]
  12. Fang, S.-C.; Chan, H.-L. Human identification by quantifying similarity and dissimilarity in electrocardiogram phase space. Pattern Recognit. 2009, 42, 1824–1831. [Google Scholar] [CrossRef]
  13. Fang, S.-C.; Chan, H.-L. QRS detection-free electrocardiogram biometrics in the reconstructed phase space. Pattern Recognit. Lett. 2013, 34, 595–602. [Google Scholar] [CrossRef]
  14. Luo, S.; Johnston, P. A review of electrocardiogram filtering. J. Electrocardiol. 2010, 43, 486–496. [Google Scholar] [CrossRef]
  15. Adochiei, N.; David, V.; Adochiei, F.; Tudosa, I. ECG Waves and Features Extraction Using Wavelet Multi-Resolution Analysis. In Proceedings of the 2011 E-Health and Bioengineering Conference (EHB), Iasi, Romania, 24–26 November 2011; pp. 1–4. [Google Scholar]
  16. Zhang, Q.; Zhou, D.; Zeng, X. HeartID: A multiresolution convolutional neural network for ECG-based biometric human identification in smart health applications. IEEE Access 2017, 5, 11805–11816. [Google Scholar] [CrossRef]
  17. Akhbari, M.; Shamsollahi, M.B.; Jutten, C.; Armoundas, A.A.; Sayadi, O. ECG denoising and fiducial point extraction using an extended Kalman filtering framework with linear and nonlinear phase observations. Physiol. Meas. 2016, 37, 203. [Google Scholar] [CrossRef] [Green Version]
  18. Chee-Ming, T.; Salleh, S.H. ECG based Personal Identification Using Extended Kalman Filter. In Proceedings of the 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), Kuala Lumpur, Malaysia, 10–13 May 2010; pp. 774–777. [Google Scholar]
  19. Lynn, H.M.; Pan, S.B.; Kim, P. A Deep bidirectional GRU network model for biometric electrocardiogram classification based on recurrent neural networks. IEEE Access 2019, 7, 145395–145405. [Google Scholar] [CrossRef]
  20. Hammad, M.; Liu, Y.; Wang, K. Multimodal biometric authentication systems using convolution neural network based on different level fusion of ECG and fingerprint. IEEE Access 2019, 7, 26527–26542. [Google Scholar] [CrossRef]
  21. Byeon, Y.-H.; Pan, S.-B.; Kwak, K.-C. Intelligent deep models based on scalograms of electrocardiogram signals for biometrics. Sensors 2019, 19, 935. [Google Scholar] [CrossRef] [Green Version]
  22. Ciocoiu, I.B.; Cleju, N. Off-the-Person ECG Biometrics Using Convolutional Neural Networks. In Proceedings of the 2019 International Symposium on Signals, Circuits and Systems (ISSCS), Iasi, Romania, 11–12 July 2019; pp. 1–4. [Google Scholar]
  23. Ranjan, A. Permanence of ECG Biometric: Experiments Using Convolutional Neural Networks. In Proceedings of the 2019 International Conference on Biometrics (ICB), Crete, Greece, 4–7 June 2019; pp. 1–6. [Google Scholar]
  24. Kim, M.-G.; Ko, H.; Pan, S.B. A study on user recognition using 2D ECG based on ensemble of deep convolutional neural networks. J. Ambient. Intell. Humaniz. Comput. 2020, 11, 1859–1867. [Google Scholar] [CrossRef] [Green Version]
  25. Zhang, Q.; Zhou, D. Deep arm/ear-ECG image learning for highly wearable biometric human identification. Ann. Biomed. Eng. 2018, 46, 122–134. [Google Scholar] [CrossRef]
  26. Takens, F. Detecting Strange Attractors in Turbulence. In Dynamical Systems and Turbulence, Warwick 1980; Springer: Berlin/Heidelberg, Germany, 2006; pp. 366–381. [Google Scholar]
  27. Kaplan, D.T.; Cohen, R.J. Is fibrillation chaos? Circ. Res. 1990, 67, 886–892. [Google Scholar] [CrossRef] [Green Version]
  28. Lee, J.-W.; Kim, K.-S.; Lee, B.; Lee, B.; Lee, M.-H. A real time QRS detection using delay-coordinate mapping for the microcontroller implementation. Ann. Biomed. Eng. 2002, 30, 1140–1151. [Google Scholar] [CrossRef]
  29. Chan, H.-L.; Wang, C.-L.; Fang, S.-C.; Chao, P.-K.; Wei, J.-D. Recognition of ventricular extrasystoles over the reconstructed phase space of electrocardiogram. Ann. Biomed. Eng. 2010, 38, 813–823. [Google Scholar] [CrossRef]
  30. Chen, H.; Das, S.; Morgan, J.M.; Maharatna, K. Prediction and classification of ventricular arrhythmia based on phase-space reconstruction and fuzzy c-means clustering. Comput. Biol. Med. 2022, 142, 105180. [Google Scholar] [CrossRef] [PubMed]
  31. Bousseljot, R.; Kreiseler, D.; Schnabel, A. Nutzung der EKG-Signaldatenbank CARDIODAT der PTB über das Internet. J. Biomed. Tech. Biomed. Eng. 1995, 40, 317–318. [Google Scholar] [CrossRef]
  32. Sznajder, M.; Łukowska, M. Python Online and Offline ECG QRS Detector based on the Pan-Tomkins algorithm (Version v1.1.0). Zenodo 2017. [Google Scholar] [CrossRef]
  33. Pan, J.; Tompkins, W.J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 1985, BME–32, 230–236. [Google Scholar] [CrossRef]
  34. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
  35. Sornmo, L. Vectorcardiographic loop alignment and morphologic beat-to-beat variability. IEEE Trans. Biomed. Eng. 1998, 45, 1401–1413. [Google Scholar] [CrossRef]
  36. Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
  37. Szegedy, C.; Wei, L.; Yangqing, J.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going Deeper with Convolutions. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
  38. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  39. Byeon, Y.-H.; Lee, J.-N.; Pan, S.-B.; Kwak, K.-C. Multilinear EigenECGs and FisherECGs for individual identification from information obtained by an electrocardiogram sensor. Symmetry 2018, 10, 487. [Google Scholar] [CrossRef] [Green Version]
  40. da Silva, H.P.; Lourenço, A.; Fred, A.; Raposo, N.; Aires-de-Sousa, M. Check Your Biosignals Here: A new dataset for off-the-person ECG biometrics. Comput. Methods Programs Biomed. 2014, 113, 503–514. [Google Scholar] [CrossRef]
  41. Wahabi, S.; Pouryayevali, S.; Hari, S.; Hatzinakos, D. On evaluating ECG biometric systems: Session-dependence and body posture. IEEE Trans. Inf. Forensics Secur. 2014, 9, 2002–2013. [Google Scholar] [CrossRef]
  42. Goldberger, A.L.; Amaral, L.A.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation 2000, 101, E215–E220. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Zhang, Q.; Zhou, D.; Zeng, X. Highly wearable cuff-less blood pressure and heart rate monitoring with single-arm electrocardiogram and photoplethysmogram signals. Biomed. Eng. Online 2017, 16, 23. [Google Scholar] [CrossRef] [Green Version]
Figure 1. (a) A single-cycle ECG with P, QRS complex, and T waves. (b) The transformed phase portrait using a time delay of 20 ms.
Figure 1. (a) A single-cycle ECG with P, QRS complex, and T waves. (b) The transformed phase portrait using a time delay of 20 ms.
Sensors 23 03164 g001
Figure 2. Phase portraits transformed from single-cycle ECG, using various time delays over various-density grid partitions. QRS complex, T, and P waves are transformed to a major outer loop, a secondary inner loop, and a minor loop, respectively, in each portrait. The distinctness of the phase portrait decreases as its grid partition is reduced from 64 × 64 to 16 × 16 partitions. For a visual comparison, all phase portraits are displayed with the same picture size.
Figure 2. Phase portraits transformed from single-cycle ECG, using various time delays over various-density grid partitions. QRS complex, T, and P waves are transformed to a major outer loop, a secondary inner loop, and a minor loop, respectively, in each portrait. The distinctness of the phase portrait decreases as its grid partition is reduced from 64 × 64 to 16 × 16 partitions. For a visual comparison, all phase portraits are displayed with the same picture size.
Sensors 23 03164 g002
Figure 3. Accuracy of individual identification over the first 30 epochs on the underlying phase portraits (a) embedded using five time delays (τ = 2, 8, 16, 24, and 36 ms) and (b) generated over five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256).
Figure 3. Accuracy of individual identification over the first 30 epochs on the underlying phase portraits (a) embedded using five time delays (τ = 2, 8, 16, 24, and 36 ms) and (b) generated over five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256).
Sensors 23 03164 g003
Figure 4. Map of average accuracy of individual identification on the underlying phase portraits using the combinations of five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256) and ten time delays (2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms) during four training stages (epochs 16~30, 31~45, 46~60, and 61~75).
Figure 4. Map of average accuracy of individual identification on the underlying phase portraits using the combinations of five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256) and ten time delays (2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms) during four training stages (epochs 16~30, 31~45, 46~60, and 61~75).
Sensors 23 03164 g004
Figure 5. Plot of average accuracies of individual identification on the underlying phase portraits using the combinations of five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256) and ten time delays (2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms) during four training stages: (a) epochs 16~30, (b) epochs 31~45, (c) epochs 46~60, and (d) epochs 61~75.
Figure 5. Plot of average accuracies of individual identification on the underlying phase portraits using the combinations of five grid partitions (16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256) and ten time delays (2, 4, 8, 12, 16, 20, 24, 28, 32, and 36 ms) during four training stages: (a) epochs 16~30, (b) epochs 31~45, (c) epochs 46~60, and (d) epochs 61~75.
Sensors 23 03164 g005
Figure 6. (a) Waveforms of 100 consecutive single-beat ECG in a subject. (b) The transformed phase space trajectories using a time delay of 20 ms.
Figure 6. (a) Waveforms of 100 consecutive single-beat ECG in a subject. (b) The transformed phase space trajectories using a time delay of 20 ms.
Sensors 23 03164 g006
Figure 7. The identification accuracy using the scaled-down network and original network which were designed for portrait images with 32 × 32 and 256 × 256 pixels, respectively, but the portrait images were derived on the basis of phase-space reconstruction over 32 × 32 grid partition.
Figure 7. The identification accuracy using the scaled-down network and original network which were designed for portrait images with 32 × 32 and 256 × 256 pixels, respectively, but the portrait images were derived on the basis of phase-space reconstruction over 32 × 32 grid partition.
Sensors 23 03164 g007
Table 1. The network architecture used as ECG biometric based on ECG phase portrait with 256 × 256 pixels.
Table 1. The network architecture used as ECG biometric based on ECG phase portrait with 256 × 256 pixels.
LayerInputOutputKernelPaddingStride
Feature extractionConv2d256 × 256 × 162 × 62 × 9612 × 12 4
Max pooling62 × 62 × 9631 × 31 × 962 × 2 2
Conv2d31 × 31 × 9631 × 31 × 2565 × 521
Max pooling31 × 31 × 25615 × 15 × 2563 × 3 2
Conv2d15 × 15 × 25615 × 15 × 3843 × 311
Conv2d15 × 15 × 38415 × 15 × 3843 × 311
Conv2d15 × 15 × 25615 × 15 × 2563 × 311
Max pooling15 × 15 × 2567 × 7 × 2563 × 3 2
ClassifierFCN12,5444096
Dropout p = 0.5
FCN40964096
Dropout p = 0.5
FCN4096115
The activation function is chosen as ReLu function in the hidden layers, where the output layer uses softmax function to represent a categorical probability distribution.
Table 2. Average accuracy of individual identification on the underlying phase portraits using the combinations of five grid partitions and five time delays during four training stages.
Table 2. Average accuracy of individual identification on the underlying phase portraits using the combinations of five grid partitions and five time delays during four training stages.
16 × 1632 × 3264 × 64128 × 128256 × 256
Epochs 16~30
   τ = 2 ms97.0597.4198.5598.5398.62
   τ = 8 ms98.1498.9199.1999.1198.83
   τ = 16 ms98.7699.0399.2899.2599.33
   τ = 24 ms99.1799.3999.2799.5199.41
   τ = 36 ms98.8499.1699.1899.3899.07
Epochs 31~45
   τ = 2 ms97.1897.9298.8198.8098.81
   τ = 8 ms98.6599.2499.0199.2699.07
   τ = 16 ms99.1499.1399.2699.4499.36
   τ = 24 ms99.4299.4999.4799.4599.42
   τ = 36 ms99.0499.0999.1799.3599.36
Epochs 46~60
   τ = 2 ms97.6398.4399.1598.8898.71
   τ = 8 ms98.8999.0399.1799.0199.13
   τ = 16 ms99.2899.3499.3499.5099.55
   τ = 24 ms99.3999.5299.5999.4499.39
   τ = 36 ms99.2299.3799.2399.4699.34
Epochs 61~75
   τ = 2 ms98.1498.6299.0498.9399.05
   τ = 8 ms98.8299.3199.3799.299.17
   τ = 16 ms99.3699.3799.5999.5499.58
   τ = 24 ms99.4599.6399.5899.5899.54
   τ = 36 ms99.2699.4999.4999.5199.51
Table 3. The scaled-down network used as ECG biometric on the basis of phase portrait with 32 × 32 pixels.
Table 3. The scaled-down network used as ECG biometric on the basis of phase portrait with 32 × 32 pixels.
LayerInputOutputKernelPaddingStride
Feature extractionConv2d32 × 32 × 128 × 28 × 325 × 5 1
Max pooling28 × 28 × 3214 × 14 × 322 × 2 2
Conv2d14 × 14 × 3213 × 13 × 322 × 2 1
ClassifierFCN54081024
Dropout p = 0.5
FCN10241024
Dropout p = 0.5
FCN1024115
The activation function is chosen as ReLu function in the hidden layers, where the output layer uses softmax function to represent a categorical probability distribution.
Table 4. The state of the art in ECG biometric authentication methods using two-dimensional convolutional neural networks.
Table 4. The state of the art in ECG biometric authentication methods using two-dimensional convolutional neural networks.
AuthorsDatasetSubject No.Input ECGTransformationModelAccuracy, %
Hammad et al. [20]PTB [31]
CYBHi [40]
100
65
single beatplot of amplitude vs. timeVGG [36]96.8
97.2
Byeon et al. [21]PTB, CU [39] 211
100
single beatcontinuous wavelet transformAlexNet [34] GoogleNet [37]
ResNet [38]
97.4, 92.3
97.8, 93.1
98.1, 93.2
Ciocoiu and Cleju [22]UofT [41]20single beatphase-space
reconstruction
3-layer
CNN
97.2
Kim et al. [24]MIT-BIH NSRDB [42]31multiple beatsplot of amplitude vs. time with beats stackingensemble of
2 CNNs and RNN
98.9
Zhang and Zhou [25]SADB [43]10multiple beatsphase-space
reconstruction
3-layer
CNN
98.8
Present studyPTB115single beatphase-space
reconstruction
AlexNet99.5
All models are constructed using convolutional neural networks (CNN) with or without recurrent neural network (RNN).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chan, H.-L.; Chang, H.-W.; Hsu, W.-Y.; Huang, P.-J.; Fang, S.-C. Convolutional Neural Network for Individual Identification Using Phase Space Reconstruction of Electrocardiogram. Sensors 2023, 23, 3164. https://doi.org/10.3390/s23063164

AMA Style

Chan H-L, Chang H-W, Hsu W-Y, Huang P-J, Fang S-C. Convolutional Neural Network for Individual Identification Using Phase Space Reconstruction of Electrocardiogram. Sensors. 2023; 23(6):3164. https://doi.org/10.3390/s23063164

Chicago/Turabian Style

Chan, Hsiao-Lung, Hung-Wei Chang, Wen-Yen Hsu, Po-Jung Huang, and Shih-Chin Fang. 2023. "Convolutional Neural Network for Individual Identification Using Phase Space Reconstruction of Electrocardiogram" Sensors 23, no. 6: 3164. https://doi.org/10.3390/s23063164

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop