Next Article in Journal
High-Performance Motion Estimation for Image Sensors with Video Compression
Previous Article in Journal
Low Power Wireless Smoke Alarm System in Home Fires
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

ECG Sensor Card with Evolving RBP Algorithms for Human Verification

1
Department of Computer Science and Technology, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen 518055, China
2
Department of Mathematics, Tunghai University, Taichung 40704, Taiwan
3
Department of Mathematics, University of Michigan, Flint 48502, MI, USA
*
Authors to whom correspondence should be addressed.
Sensors 2015, 15(8), 20730-20751; https://doi.org/10.3390/s150820730
Submission received: 14 June 2015 / Revised: 28 July 2015 / Accepted: 6 August 2015 / Published: 21 August 2015
(This article belongs to the Section Physical Sensors)

Abstract

:
It is known that cardiac and respiratory rhythms in electrocardiograms (ECGs) are highly nonlinear and non-stationary. As a result, most traditional time-domain algorithms are inadequate for characterizing the complex dynamics of the ECG. This paper proposes a new ECG sensor card and a statistical-based ECG algorithm, with the aid of a reduced binary pattern (RBP), with the aim of achieving faster ECG human identity recognition with high accuracy. The proposed algorithm has one advantage that previous ECG algorithms lack—the waveform complex information and de-noising preprocessing can be bypassed; therefore, it is more suitable for non-stationary ECG signals. Experimental results tested on two public ECG databases (MIT-BIH) from MIT University confirm that the proposed scheme is feasible with excellent accuracy, low complexity, and speedy processing. To be more specific, the advanced RBP algorithm achieves high accuracy in human identity recognition and is executed at least nine times faster than previous algorithms. Moreover, based on the test results from a long-term ECG database, the evolving RBP algorithm also demonstrates superior capability in handling long-term and non-stationary ECG signals.

1. Introduction

Access control systems using fingerprint, face, and iris biometric authentications [1] are extensively applied, but some issues, such as privacy and cost, still need to be addressed. Recently, security-related studies have shown a tendency to focus on realistic electrocardiogram (ECG or EKG) identifications/verifications for access control.
An ECG is a voltage variation signal that detects the electrical changes of the heart on the skin which are caused when the heart muscle depolarizes during each heartbeat.
The purpose of this paper is to propose a statistical-based algorithm using the reduced binary pattern (RBP) for human identity recognition. The suggested algorithm meets the accuracy and cost requirements in an ECG verification system. Any ECG signal will first be converted into concise binary patterns and then statistical counting and ranking processes follow for verification purposes. Its advantages will be illustrated as follows:
  • Signal preprocessing, such as de-noising, adjusting signal means, and so on, can be totally neglected. The noise can simply be viewed as one verification feature.
  • Unlike other published methods, ECG QRS (Quantitative Regression Swedish) detection can be waived completely in our scheme. The algorithm can be performed directly and still maintains robustness to dynamic variation of ECG signals.
  • Variations in length and sampling rates of matching signals are absolutely allowed.
  • The algorithm requires less ECG information content and performs in a timely manner with low computational complexity. It does not need ECG information content like R-R intervals, mean and variation of ECG signals, and so on.
The remaining parts of the paper are organized as follows. An overview of related works on ECG verification is presented in Section 2; the outline of the proposed algorithm is introduced in Section 3, followed by a detailed description; the experimental results are shown and discussed in Section IV; and some concluding remarks are drawn in Section V.

2. Related Studies

The classification of ECG algorithms is shown in Figure 1. In this figure, FFT and DCT denote the fast Fourier transform and discrete cosine transform, respectively; SVM denotes the support vector machine; ICA, PCA, and LDA denote independent component analysis, principal component analysis, and linear discriminant analysis, respectively; GA and PSO denote genetic algorithm and particle swarm optimization, respectively. In addition to ECG-based diagnostics for heart diseases [2,3], ECG has also been applied to data compression [4,5,6], information watermarking [7,8], and human verification/identification [9,10,11,12,13,14,15,16,17,18,19,20,21,22,23].
Signal pre-processing, feature extraction, data classification, data reduction, and intelligence optimization are the key research focuses in human identification/verification. High- and low-pass filters and QRS detection are the main schemes employed in signal preprocessing, while fuzzy rule [24], SVM [13], neural network [2], Bayesian [25], and rule-based [26] algorithms are frequently adopted in data classification. For data reduction, ICA [27], PCA [17], and LDA [28] have been used in ECG signal processing. Intelligence optimization techniques such as GA [29], PSO, ant-colony, and so on are commonly utilized for tuning the parameters of the aforementioned algorithms.
Figure 1. The classification of ECG algorithms.
Figure 1. The classification of ECG algorithms.
Sensors 15 20730 g001
Most ECG-based human identification/verification methods rely on feature extraction derived from the ECG signals. The features are usually extracted according to three models: transform-based, waveform-based, and statistical-based.
The transform-based algorithms consist of wavelet transforms [9,10,11,12,13,14] and frequency domain transforms, including Fourier transform [15] or DCT [16]. Since the wavelet transform contains information in the time and frequency domain, it is more popular than the frequency domain transform.
Correlation coefficients and measurements of wavelet distances have been used in matching acquired ECG signals and modules [9]. Since this identification/verification method requires heavy calculation, its implementation is restricted in the statistical sense. The feature selection algorithm in [12] applied the feature set evaluation (FSE) k-nearest neighbor (k-NN) algorithm to improve low recognition rates and used the eigenspace method to reduce data dimensions; however, this approach is both complicated and time-consuming. In [13], morphological characteristics are first extracted through the wavelet transform and the independent component analysis; SVM follows for identification/verification purposes. Although a high identification rate could be reached, a lengthy feature extraction process seems unavoidable.
Waveform-based algorithms [17,18,19,20,21,22,23] extract different time domain characteristics (distance, height, and area) from fiducial points inside the ECG waveform. These waveform descriptors will be used to match or classify ECG signals in the identification/verification process. These algorithms usually have good accuracy in recognizing regular ECG signals but show opposite results for irregular data.
Some researchers combined a precision-matched result with a waveform neural network in the signal preprocessing stage [18]. This model extracted seven features from the ECG signals based on their amplitude and the interval to be analyzed by the decision-based neural network. The computational complexity depends heavily on the forms of those time-domain ECG signals and the level of difficulty of the matching process carried out by the neural network. Nineteen characteristics are extracted from the time interval, amplitude, and angle of deflection and studied [22]; the identification is examined using Euclidian distances and an adaptive threshold. The eigenvectors used in feature-matching take time but are necessary for all band waves in the ECG signals.
An ECG signal can be described as a non-stationary time series that presents some irregularities in the waveform. Unlike the waveform-based algorithms, the transform-based algorithms analyze the non-stationary information based on the signal’s presentation in the frequency domain. Not only is this process slow, but it is also difficult to extract good features for the purpose of identification.
Statistical-based algorithms usually depend on statistical evaluations (count, mean, and variance) of human identification. They are usually less time-consuming but definitely need a well-designed statistical model to assure high-quality accuracy. A method based on rank order statistics was presented to analyze the human heart beat [30].
The non-stationary behavior of ECG has been utilized in many studies. The fetal ECG was reconstructed with higher-order statistical tools exploiting ECG non-stationary properties associated with post-de-noising wavelets [27]. A de-trended fluctuation analysis to quantify the correlation property in the non-stationary physiological time series was presented [31]. Our previous work for an ECG card access control system [32] focused on the architecture in ECG human identification.
Compared with algorithms presented in the literature, our proposed scheme is capable of providing secure and accurate results with a user-possessed controller. Moreover, it can be easily embedded into the field application structure to ensure the implementation of a feasible ECG identification hardware.

3. System Architecture and Application Example

Even though some ECG biometric identifications have been demonstrated, there is a serious issue regarding the use of a centralized ECG database. Due to implementation cost and accuracy issues, there is not yet a feasible application. In our previous work [32], we put this idea into practice and introduced a portable ECG card for access control. This small ECG card provides a cheap and convenient way to enhance door access security.
An ECG access control system consists of a personal ECG sensor card and an access control device. An ECG card is a small device for storing personal ECG data and will be useful for identity recognition. As suggested in Figure 2, applications of ECG cards include secure personal keys to open cars, houses, deposit boxes, and mobile phones.
Figure 2. Application diagram of ECG sensor card for human verification.
Figure 2. Application diagram of ECG sensor card for human verification.
Sensors 15 20730 g002
The following advantages mean that the ECG approach outperforms the most popular fingerprint approach.
  • The ECG signal is not visible and photo-copying is impossible. Replication of ECG signals is more difficult.
  • ECG data can be measured only by a smaller, low-power-consumption, low-cost, simple circuit.
The blueprint of the architecture of an ECG sensor card is shown on the left of Figure 3. Data are obtained through a contact pad (denoted by “DOT”), and the processing unit, the integrated chip “INA321A”, is in charge of common mode noise removal from the original signal. The main processing unit, the integrated microprocessor “MSP430FG439”, controls and transmits all data to the ZigBee module, a short-range wireless transmission module that communicates with the access control device. The other modules include “SBLCDA4” and “JTAG”, which will be used for LCD (Liquid Crystal Display) display and debugging the microprocessor, respectively.
Figure 3. Architecture and hardware implementation of ECG sensor card.
Figure 3. Architecture and hardware implementation of ECG sensor card.
Sensors 15 20730 g003
One real implementation of the ECG sensor card is shown on the right of Figure 3. An ECG card contains two voltage-sensitive contacts, noise filter modules, a microprocessor, and a wireless transmission module. This small and low-cost device allows practical ECG identification in real life.
A door access control, as shown in Figure 4, serves as one real application of ECG access cards. This card checks whether an ECG signal provided by the user matches that stored inside the card. The controller is an embedded system or personal computer connected with an ECG card via the wireless transmission module. The flowchart in Figure 5 shows the ECG verification process in the door access control.
Figure 4. Door access control: Application of ECG sensor card.
Figure 4. Door access control: Application of ECG sensor card.
Sensors 15 20730 g004
Figure 5. Flow chart of ECG verification processing.
Figure 5. Flow chart of ECG verification processing.
Sensors 15 20730 g005

4. Algorithm

4.1. The Basic RBP Algorithm

The idea of our proposed algorithm, RBP for ECG verification, is related to Yang’s [30] and Kumar’s [33] works, but we expand it to a different field of application. The differences between Yang’s model and our model are as follows:
  • Their approach focuses on the human heartbeats; ours focuses on just the bare ECG signals.
  • They convert, count, and rank P waves in the ECG signals only; we perform these procedures on every sample of ECG data to obtain the reduced binary pattern.
  • They aim for heart disease classification; we focus on human identity recognition through ECG signals.
The processing in our design can be roughly divided into three necessary steps that will be illustrated as follows.
  • Step 1: Reduced Binary Pattern Conversion 
All ECG signals are non-stationary. Consider an ECG signal as   x = { x 1 ,   x 2 ,   x 3 , , x N } , where the real-valued x i corresponds to the i th input datum. Each pair of consecutive input signals is compared and the data are categorized into one of the two cases: a decrease or increase in x i . A preliminary reduced function then maps these two cases to 0 or 1, respectively, according to the rule:
y i = { 0   ,   x i + 1     x i 1   ,     x i + 1   >   x i
This procedure converts the ECG signal of length N to a binary sequence   Y = { y 1 ,     y 2 , , y N 1 } of length   N 1 . Every   m   bits in Y are grouped to construct a reduced binary sequence of length   m , referred to as an m-bit word, and then all such words are collected to form a reduced binary pattern B = { b 1 ,     b 2 , , b N m } where   b k = {   y k ,     y k + 1 ,   , y k + m 1   } . We then convert each m-bit word b k   to its decimal expansion w k .
An example of the reduced binary pattern conversion for   m = 4   is depicted in Figure 6. For instance, the first four-bit word b 1 = { 0001 }   is labeled as w 1 = 1 , which equates to   0 × 2 3 + 0 × 2 2 + 0 × 2 1 + 1 × 2 0 .
Figure 6. Process of the reduced binary pattern conversion.
Figure 6. Process of the reduced binary pattern conversion.
Sensors 15 20730 g006
  • Step 2: Counting and Ranking Processes 
The main theme in this step, as shown in Figure 7, is to count the occurrences of all   w k   and sort them in order of descending frequency.
Let w k be an integer for k = 1 ,   2 ,   ,   N m . It is obvious that values of   w k range from 0 to   2 m 1 . Let integer j w k , and let p ( j ) and n j be the corresponding relative frequency and occurrence of   j . To be exact, p ( j ) = n j N m   and j = 0 ( 2 m 1 ) n j = N m . Next, j   is ranked according to its frequency n j from the largest to the smallest. For example, R ( j ) = 1 means the m -bit words b k , which converts to the same j as those that appear the most frequently in the reduced binary pattern.
Figure 7. The counting and ranking process.
Figure 7. The counting and ranking process.
Sensors 15 20730 g007
  • Step 3: Measurement of Similarity 
Consider two segments of ECG data S 1 and S 2 , which may belong to two distinct subjects. To understand how closely they are related, the measurement of similarity needs to be defined. We incorporate a weighted distance formula [30] to measure the similarity between S 1 and S 2 :
d ( S 1 , S 2 ) = j = 0 ( 2 m 1 )   | R 1 ( j ) R 2 ( j ) |   p 1 ( j )   p 2 ( j ) ( 2 m 1 ) · j = 0 ( 2 m 1 ) p 1 ( j ) p 2 ( j )
where the segment means a sequence of sampled ECG data of 10 sample periods which serves as a basic unit for our analysis. Each sample period denotes the ECG signal in an R-R interval. p ( j ) and   R ( j ) represent the relative frequency and rank of   j   in the sequence S i ,   i = 1  or  2 . The absolute difference between two ranks is multiplied by the normalized probabilities as a weighted sum; the factor   1 2 m 1   assures that all values of measurements lie within the scope of (0, 1).
Consider two groups of ECG data, S L and S K , each containing   m L and m K segments, respectively. We define the measurement of similarity between these two groups:
D ( S L , S K ) = 1 m L · m K S 1 S L S 2 S K d ( S 1 , S 2 )
where S 1 and S 2 are the corresponding segments from S L and S K , respectively; d ( S 1 , S 2 ) denotes the associated distance between these segments. D ( S L , S K ) is the average distance of all segments from S L and S K . If   S L = S K , D ( S L , S K ) is referred to as the intra-group distance; otherwise it is the inter-group distance.

4.2. The Advanced RBP Algorithm

Next, we consider a new scaling factor   α , which is an increment in the length of the interval, in the RBP algorithm. All steps in this algorithm are similar to those performed in the basic design, except that the binary sequence will be obtained from the comparison of x ( α · i + 1 ) and x ( α · i 1 ) instead of x i + 1 and x i , where   x i   represents the raw ECG data. The reduced function (1) is now replaced with:
y ( α · i 1 ) = { 0   ,   x ( α · i + 1 )     x ( α · i 1 ) 1   ,   x ( α · i + 1 )   >   x ( α · i 1 )
Figure 8 represents the process of the modified RBP conversion of m = 4   and α = 2 . The first four-bit word b 1 = { 0111 }   is now labeled as w 1 = 7 = 2 2 + 2 1 + 2 0   .
Figure 8. Process of the modified reduced binary pattern conversion.
Figure 8. Process of the modified reduced binary pattern conversion.
Sensors 15 20730 g008
If we are concerned with the effect of the size of variations in amplitude, the reduced function will be:
y i = { 0   ,   x i + 1     β + x i 1   ,     x i + 1   >   β + x i
where β denotes the jump in amplitude. This model allows variation (noises) from two consecutive data points to differ by β units, which in turn reduces a small variation in amplitude by requiring a large rise in amplitude. In the case where variations in time and amplitude are allowed, the general reduced function can be set as:
y ( α · i 1 ) = { 0   ,   x ( α · i + 1 )   β +   x ( α · i 1 ) 1   ,     x ( α · i + 1 )   > β +   x ( α · i 1 )
A proper choice of   β may reduce impacts from noises and a suitable scaling factor α could result in an ideal reduced binary pattern. Therefore, appropriate tuning of either parameter will improve the verification accuracy.
Table 1 and Algorithm 1 include notations and the main pseudo code of the advanced RBP algorithm. The main pseudo code consists of two code segments: an ECG datum x is converted to the statistical counter C T and the rank values are sorted to obtain R K . To expedite our computation process, an unsigned   σ of length   m   is created and each bit of σ stores the corresponding value of b k = { y k + 1 ,     y k + 2 , , y k + m 1 } . We also set   C T , of length 2 m , to accumulate the repeated number of m -bit words in the reduced binary pattern.
Table 1. Notation for the advanced RBP (Reduced Binary Pattern) pseudo code.
Table 1. Notation for the advanced RBP (Reduced Binary Pattern) pseudo code.
SymbolsDescription
CTm-bit word counter of size 2m for the reduced binary patterns
NMaximum length of the ECG segment for verification
pRelative frequency array
RKRank array
mNumber of bits in a word for the reduced binary pattern
xInput ECG data
αScaling factor of interval for the reduced binary sequence
βGranularity of amplitude for reduced binary function
θCTThe total number of m-bit words for counting
σUnsigned integer with m-bits for the purpose of shifting
τPosition variable
Algorithm 1 Main Pseudo Code of the Advanced RBP Algorithm
1: LOOP i UNTIL N BY α
2:    σ = 1
3:   IF x α · i + 1 - x α · i 1 > β
4:     σ + +
5:   IF i m  
6:     θ C T ++
7:     C T σ + +
8: LOOP j UNTIL θ C T
9:   τ = 1
10:   LOOP k UNTIL θ C T
11:     IF j < k
12:      IF C T j   C T k
13:        τ + +

4.3. The Evolving RBP Algorithm

Since ECG signals change slightly day by day, modifying our algorithm to handle this issue seems crucial. This model utilizes an incremental learning process to improve the advanced version. The advanced RBP algorithm evolves an incremental-update mechanism for the rank order of the m -bit word   j = w k . If both ECG signals, the obtained one and the original one, come from the same individual, the identity match passes. The original relative frequency p ( j ) is now replaced by the new relative frequency p n e w ( j ) from the new input ECG.
p ( j ) = ( 1 γ ) · p ( j ) + γ · p n e w ( j )
where γ is the weighted factor controlling the degree of impact from the new frequency p n e w ( j ) , j = 0 , 1 , , 2 m 1 . The value of γ is affected by the degree of non-stationarity in the old and new ECG signals. A larger γ indicates that the new data are more non-stationary and the rank R ( j ) , j = 0 , 1 , , 2 m 1 will then be recalculated and updated. This non-stationary behavior can be modeled by verifying the cross-correlation between these two ECG signals in the future.

5. Evaluation and Discussion

5.1. Compared Algorithms

Next, we will compare the proposed algorithm with two other schemes from the same feature extraction category. It is noted that both selected waveform-based and wavelet-based algorithms require R-R detection and noise preprocessing, which can be totally bypassed in our model.
In a waveform-based study [13], a total of 19 features are extracted from the four classes: amplitude (PQ, RQ, TQ, RT, PS, RP, TS, RS, PT, QS), duration (QS, PR, QR, ST, QT), slope (RS, ST, and QR), and area (area of the QRS triangle). Descriptions of these features are presented in Figure 9. These features form a feature vector S . The closeness between two feature vectors S 1 and S 2 is considered as their distance d ( S 1 , S 2 ) ; the intra- and inter-group distances can be evaluated through Equation (3).
Figure 9. Waveform feature extraction for verification.
Figure 9. Waveform feature extraction for verification.
Sensors 15 20730 g009
The procedures of the wavelet-based algorithm [12] in comparison include the following: each R-R cardiac cycle is obtained through R-R detection; an interpolation is performed on the R-R interval so each R-R cardiac cycle holds 284 data points; every R-R cycle is cut into three parts, each containing 85, 156, and 43 points; the first 85 and the last 43 points in each R-R cycle are assembled to form a 128-point segment; every four segments are grouped and an n-level discrete wavelet transform (DWT) is performed to obtain the corresponding wavelet coefficients. Four of the computed wavelet coefficients are gathered as a wavelet vector and expressed as:
S = [ a n , d n , d n 1 , d 1 ]
The Euclidean distance between two wavelet vectors S 1 and S 2 is regarded as their distance d ( S 1 , S 2 ) ; the intra- and inter-group distances can then be calculated through Equation (3). An example with n = 9 is illustrated in Figure 10.
Figure 10. Procedure of the wavelet-based algorithm.
Figure 10. Procedure of the wavelet-based algorithm.
Sensors 15 20730 g010

5.2. ECG Database

We conducted a comprehensive experiment on three public ECG databases: the MIT-BIH Arrhythmia Database, the MIT-BIH Normal Sinus Rhythm Database, and a Long-Term Database. Descriptions of these three databases are given below.
  • MIT-BIH Normal Sinus Rhythm Database [34]: This database contains 18 long-term ECG recordings from five men aged 26 to 45 and 13 women aged 20 to 50 who have no significant arrhythmias. These ECG signals were sampled at a rate of 128 Hz and represented by a 12-bit binary sequence.
  • MIT-BIH Arrhythmia Database [35,36]: The database includes 48 groups, each comprising a half-hour two-lead ECG recording, giving a total of 24 hours of information. The data contains 47 individuals’ ECG information (dataset IDs 201 and 202 are duplicated); subjects consist of 25 men aged between 32 and 89 and 22 women aged from 23 to 89. These ECG data were sampled at 360 Hz and use a 12-bit binary storage.
  • Long-Term Database [33]: The evolving RBP algorithm was tested on a database consisting of different segments of ECG signals collected for up to six months. These 310 ECG recordings, containing a 20-second Lead-I signal digitized at 500 Hz and a 12-bit binary expression, are obtained from 44 men and 46 women aged 13 to 75. The number of recordings for each person varies from two to 20.

5.3. Measurement Approaches

Two approaches are used to evaluate our implemented algorithms:
  • Success rate: This is a metric used for accuracy measurement. Based on the results of comparisons between the individuals, when the inter-subject distance is smaller than the average inter-subject distance, we considered it an identification error. Summing up these errors gives us the total number of errors; then we divided this figure by the total number of comparisons to give the success rate.
  • False Acceptance (FA) and False Rejection (FR) rates: These are also the metrics used for accuracy performance. The FR denotes the relative ratio of subjects that should be accepted but are actually rejected by the classifier; similarly, the FA is the ratio of subjects that should be rejected but are actually accepted by the classifier. The threshold for FA/FR is obtained from the training set, which aims to minimize F A + F R 2 .

5.4. Experimental Results

5.4.1. Basic RBP Algorithm

To verify how efficient this algorithm is in human verification by ECG, two types of comparisons are considered: self- and subject-comparison.
  • Self-comparison: Two eight-segment data are arbitrarily selected from one individual and their corresponding distances are measured using Equation (2). Each segment contains 3600 data points (10 s). All 64 intra-subject distances obtained from segments 1 to 8 for the subject ID number 100 in the MIT-BIH Arrhythmia Database are listed in Table 2. It is noted that all entries are symmetric with diagonal entries being zero since they denote the distances between two identical segments.
Table 2. Self-comparison for subject ID 100.
Table 2. Self-comparison for subject ID 100.
12345678
100.0330.0330.0350.0400.0370.0360.033
20.03300.0280.0390.0470.0450.0360.045
30.0330.02800.0330.0420.0420.0310.039
40.0350.0390.03300.0350.0410.0300.034
50.0400.0470.0420.03500.0320.0370.023
60.0370.0450.0420.0410.03200.0430.033
70.0360.0360.0310.0300.0370.04300.034
80.0330.0450.0390.0340.0230.0330.0340
2
Subject-comparison: Within the same database, two eight-segment data are chosen from two distinct individuals, one from each, and the distance between each pair of subjects is evaluated. The results of all 64 inter-subject distances for the pair of subjects with ID numbers 100 and 101 from the MIT-BIH arrhythmia database are shown in Table 3.
Table 3. Subject-comparison between subject IDs 100 and 101.
Table 3. Subject-comparison between subject IDs 100 and 101.
12345678
10.0450.0400.0510.0570.0470.0460.0450.044
20.0510.0410.0550.0550.0520.0480.0490.052
30.0520.0410.0560.0600.0520.0460.0430.048
40.0490.0450.0510.0540.0500.0530.0370.047
50.0430.0410.0380.0600.0370.0450.0370.042
60.0520.0390.0520.0650.0460.0480.0450.040
70.0520.0460.0570.0560.0540.0550.0440.052
80.0430.0410.0430.0600.0390.0470.0370.041
Next, we measure the average distance between two subjects from the same database using Equation (3). Table 4 lists all 64 average intra-group distances for subject IDs 100 to 107 from the MIT-BIH Arrhythmia Database. For example, the first row and first column record the average of all entries in Table 2; the average of all values in Table 3 is listed in the first row and second column and in the second row and first column.
Table 4. Basic RBP (Reduced Binary Pattern) comparison result for eight subject IDs: 100–107.
Table 4. Basic RBP (Reduced Binary Pattern) comparison result for eight subject IDs: 100–107.
100101102103104105106107
1000.0320.0480.0450.0610.0580.0590.0560.062
1010.0480.0280.0400.0440.0410.0360.0390.031
1020.0450.0400.0310.0480.0440.0420.0480.032
1030.0610.0440.0480.0160.0340.0300.0430.009
1040.0580.0410.0440.0340.0290.0290.0390.013
1050.0590.0360.0420.0300.0290.0170.0310.008
1060.0560.0390.0480.0430.0390.0310.0270.022
1070.0620.0310.0320.0090.0130.0080.0220.001
Intuition suggests that a strong association should exist between certain patterns with each individual; therefore, the intra-subject distances must be smaller than the intra-group distances. Thus, it makes sense to treat the opposite cases as verification errors. Similarly, the RBP algorithm is applied to subjects in the MIT-BIH normal sinus rhythm database as well. The experimental results show that the success rates for the two groups of people, with and without significant arrhythmias, are 95.791% and 90.196%, respectively.
To seek better accuracy, the effect of the length of the m -bit word is considered. Using 10 s periods as the duration of the input data should be reasonable for a fair evaluation. A self-comparison experiment is conducted to examine whether the value of   m and the stability of identity detection are related. Two 31-segment data, each containing a 10 s period, are selected from subject ID 100 in the MIT-BIH Arrhythmia Database. The results of all intra-subject distances obtained using Equation (3) are measured for different values of   m , as shown in Table 5. It is clear that the distances vary abruptly when m = 4 and become more stable as   m increases. However, a bigger   m leads to computational and space complications. To balance the trade-offs, we decide to set m = 8 in this study.
Table 5. Mean and standard deviation of 30 distances for each m .
Table 5. Mean and standard deviation of 30 distances for each m .
Intra-Subject Distances for Each m
m 4567891011
Mean0.02730.03550.04440.03730.03610.03450.03220.0292
Standard deviation0.01730.01100.01040.00450.00360.00280.00250.0023

5.4.2. Advanced RBP Algorithm

Two parameters, interval and amplitude, are considered in this advanced design. Therefore, the pair of data points x ( α · i + 1 ) and x ( α · i 1 ) , instead of x i + 1 and x i , are compared to obtain the reduced binary pattern via Equation (4), and the ECG data are examined not only locally but globally.
Experiments with 1 to 36 intervals were conducted; Figure 11 and Figure 12 show their effects on the total number of verification errors in the two databases.
Figure 11. Total number of errors versus interval size for Arrhythmia Database.
Figure 11. Total number of errors versus interval size for Arrhythmia Database.
Sensors 15 20730 g011
Figure 12. Total number of errors versus interval size for Normal Database.
Figure 12. Total number of errors versus interval size for Normal Database.
Sensors 15 20730 g012
The total number of errors shows a sharp drop followed by a short stable zone for α = 15   in the Arrhythmia Database and for α = 5   in the Normal Database. Since these two databases are sampled at 360 Hz and 128 Hz, respectively, α = 15 and α = 5 correspond to down-sample signals at 24 Hz and 25.6 Hz, respectively. These frequencies are quite close to the 25 Hz bandwidth. This new finding is analogous to the bandwidth of digitized ECG data in [4]. If a signal is sampled at a higher or lower rate, it may cause unnecessary noise or provide insufficient information, respectively. Therefore, 25 Hz seems to be a suitable sampling rate.
In addition to interval tuning, an experiment using the MIT-BIH Arrhythmia Database was conducted to check how recognition accuracy changes when the amplitude is adjusted. Table 6 and Table 7 record the results of both algorithms with α = 10 . The data show that the success rate in the advanced model is a lot better than that in the basic model. When the increase in amplitude β = 1 , the impact of the signal noise is reduced to give the results showing the best performance; when β > 1 , certain distinguishing features may be removed for verification and result in a lower success rate. β of different amplitudes using basic RBP algorithm with α = 10 .
Table 6. Performance for β of different amplitudes using advanced RBP algorithm with α = 10 .
Table 6. Performance for β of different amplitudes using advanced RBP algorithm with α = 10 .
Amplitude β 12345101520
Total number of errors71256329396437410388422
Success rate0.9670.8810.8470.8160.7970.8100.8200.804
Table 7. Performance for β of different amplitudes using advanced RBP algorithm with α = 10 .
Table 7. Performance for β of different amplitudes using advanced RBP algorithm with α = 10 .
Amplitude β 12345
Total number of errors 293373159222
Success rate0.9870.9850.9660.9260.897
The evaluation of our algorithms, the basic and the advanced RBP with α = 5 an β = 1 , will depend on comparisons with two other feature extraction algorithms: a waveform-based algorithm with 19 waveform features extracted [13] and a transform-based scheme with wavelet feature extraction [12]. It is worth mentioning that R-R detection and noise preprocessing are required in both of the other algorithms but can be completely bypassed in ours.
In the evaluation using the MIT-BIH Arrhythmia and Normal databases, it is obvious from the comparison of outcomes shown in Table 8 that the waveform-based algorithm with 19 features performs well, but our advanced RBP algorithm still excels, having an extremely high success rate in both public databases.
Table 8. Total number of errors and success rate for comparison.
Table 8. Total number of errors and success rate for comparison.
AlgorithmArrhythmiaNormal
Total Number of ErrorsSuccess RateTotal Number of ErrorsSuccess Rate
Waveform1499.352%199.673%
Wavelet Transform1498.242%1993.791%
Basic RBP9795.513%3092.484%
Advanced RBP799.676%0100.000%
The FA and FR ratios for the normal sinus rhythm and arrhythmia databases are listed in Table 9. Here the associated parameters for the normal sinus rhythm database are m = 8 and α = 5 with a sampling rate of 128 Hz and m = 8 and α = 15 with a sampling rate of 360 Hz. For the purpose of personal verification, the false rejection and acceptance rates should be as small as possible. The advanced RBP algorithm has been tested 18 × 8 = 144 times and 47 × 3 = 147 times for Normal and Arrhythmia databases, respectively, and advanced RBP has a false rejection rate of around 1.67% and a false acceptance rate of 1.43% for the Normal Database. Thus the performance of our algorithm should be acceptable.
Table 9. False rejection and acceptance ratios of advanced RBP algorithm.
Table 9. False rejection and acceptance ratios of advanced RBP algorithm.
MIT-BIH DatabaseFRFA
Normal0.0167480.014319
Arrhythmia0.0689180.059783
Average0.0428330.037051
Table 10 provides the information on execution time for all algorithms under comparison. It is clear that the execution time for the advanced scheme is shorter than that for the basic RBP algorithm, whose performance is definitely at least nine times faster than those of the waveform- and transform-based algorithms.
Table 10. Execution times of the compared algorithms.
Table 10. Execution times of the compared algorithms.
AlgorithmsExecution Time Per Cycle
Waveform0.113675 s
Wavelet0.225854 s
Basic RBP0.013305 s
Advanced RBP0.008565 s

5.4.3. Evolving RBP Algorithm

The performance of the evolving RBP algorithm is evaluated on the long-term ECG database, where 20 individuals’ ECG data recorded over 54 days were selected. All subjects had their ECGs measured on a minimum of two and a maximum of six days. Table 11 contains the data for the subject ID person_01. In this study, the evolving RBP algorithm, tested on the long-term database, is implemented with α = 13 and β = 1   in the advanced RBP model.
Table 11. Records of distribution for person_01 on five days.
Table 11. Records of distribution for person_01 on five days.
DateRecord Number
2004-12-071, 2
2004-12-283, 4, 5, 6, 7, 8, 9
2005-03-1510, 11, 12, 13
2005-04-0514, 15, 16, 17
2005-04-2618, 19, 20
Not all subjects have six days’ worth of ECG data; for most of them, only two or three days’ worth of data is on record. Each recorded ECG is 20 s long and is cut into two equal-length segments of 10 s each. Each segment is sorted into rank statistics after reduced binary conversion and m -bit word counting and, finally, we get the mean rank statistic R K m e a n of rank statistics R K 1 and R K 2 for the two segments, respectively. This approach is illustrated in Figure 13. Table 12 gives us the result for person_01 by applying the non-evolving advanced RBP approach to calculate the similarity distance using Equation (2) between the first record (record No. 1) and other records.
Figure 13. Mean rank statistic for a new ECG signal.
Figure 13. Mean rank statistic for a new ECG signal.
Sensors 15 20730 g013
Table 12. Similarity distance of non-evolving RBP for person_01.
Table 12. Similarity distance of non-evolving RBP for person_01.
DateRecords
2004-12-070.079
2004-12-280.1200.0930.1040.1020.0920.1170.226
2005-03-150.1030.1000.1050.103
2005-04-050.1000.0960.0980.112
2005-04-260.1010.0940.071
The evolving RBP scheme Equation (6) models the rank statistics on the average relative frequency array of the first record on the first day from the same individual, and follows with an update of the first record on another day. The weight distribution between the model and the new record is 0.5:0.5. To be precise, each day, we simply use the first valid record to update the model, while the other record serves for comparison purposes only and is not used for the update. The similarity distance between the model and recorded data is shown in Table 13, where cells with grey backgrounds indicate that the corresponding recorded data are updated into the model. Comparing these results with those presented in Table 11, it can be seen that the evolving RBP approach has a much smaller similarity distance than the advanced RBP one.
Table 13. Similarity distance when applying the evolving RBP to person_01.
Table 13. Similarity distance when applying the evolving RBP to person_01.
DateRecords
2004-12-070.079
2004-12-280.1190.0470.0550.0440.0490.0710.201
2005-03-150.0490.2270.01740.019
2005-04-050.0150.0120.0180.017
2005-04-260.0120.0120.018
In order to obtain a whole valid evaluation between the evolving and advanced RBP algorithms, we sum all the similarity distances after the update points on each date as Sarbp and Serbp for the advanced RBP algorithm and the evolving RBP algorithm, respectively, which are the non-grey records in the second-to-fifth rows in Table 13. Their values are shown in Table 14, which demonstrates an improved rate of evolving RBP to advanced RBP is 26.47%, which is obtained by (SarbpSerbp)/Sarbp, confirming that the evolving process is effective. This result reveals that the evolving process of the RBP algorithm does improve the verification performance for the non-stationary behavior in long-term ECG signals.
Table 14. Improved rate of the evolving process.
Table 14. Improved rate of the evolving process.
SarbpSerbpImproved Rate
4.05162.979026.47%
As shown in Table 15, we use the same database and a procedure similar to the similarity distances evaluation to measure the false acceptance (FA) and false rejection (FR) rates for the evolving RBP, and the result shows that evolving RBP has improved rates of 43.77% and 9.57% for FA and FR, respectively. The average improved rate is 25.25%, which is quite close to the improvement of the success rate, which was 26.47%.
Table 15. FA (false acceptance)/FR (false rejection) of the evolving process.
Table 15. FA (false acceptance)/FR (false rejection) of the evolving process.
ItemAdvanced RBPEvolving RBPImproved Rate
FA0.35560.19990.4377
FR0.42000.37970.0957
( F A + F R ) / 2 0.38780.28980.2525

6. Conclusions

In this paper, a novel ECG card architecture and algorithm for ECG human verification are proposed. Verifications tested on subjects from the two public MIT-BIH databases confirm that the RBP algorithm performs in a timely manner with low computational complexity and is rather efficient in ECG human identity recognition. Moreover, the RBP scheme is enhanced by tuning the parameters, interval, and amplitude between sample points. The advanced RBP design demonstrates good accuracy with much shorter execution duration than those of the waveform- and transform-based algorithms. Furthermore, the modified evolving RBP algorithm cannot only easily merge the new rank data into the old one, but it is also capable of handling non-stationary ECG signals.

Supplementary Materials

Supplementary materials can be accessed at: https://www.mdpi.com/1424-8220/15/8/20730/s1.

Acknowledgments

This work of Huang-Nan Huang was partially supported by Ministry of Science and Technology of Taiwan under the grand MOST No. 104-2115-M-029-004.

Author Contributions

The following statements are the author contributions that “Kuo-Kun Tseng, Huang-Nan Huang and Fufu Zeng conceived and designed the experiments; Fufu Zeng performed the experiments; Kuo-Kun Tseng, Huang-Nan Huang and Fufu Zeng analyzed the data; Kuo-Kun Tseng, Huang-Nan Huang, Fufu Zeng and Shu-Yi Tu wrote the paper”.
For research this articles, the following paragraph statements giving the individual contribution of each co-author to the reported research and writing of the paper.
  • Kuo-Kun Tseng conceived and designed the experiments, analyzed the data and wrote the paper.
  • Huang-Nan Huang conceived and designed the experiments, analyzed the data and wrote the paper.
  • Fufu Zeng conceived and designed the experiments, performed the experiments, analyzed the data and wrote the paper.
  • Shu-Yi Tu wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Prabhakar, S.; Pankanti, S.; Jain, A.K. Biometric recognition: Security and privacy concerns. IEEE Secur. Priv. 2003, 1, 33–42. [Google Scholar] [CrossRef]
  2. Stamkopoulos, T.; Diamantaras, K.; Maglaveras, N.; Strintzis, M. ECG analysis using nonlinear PCA neural networks for ischemia detection. IEEE Trans. Signal Process. 1998, 46, 3058–3067. [Google Scholar] [CrossRef]
  3. Zhang, H.; Zhang, L.-Q. ECG analysis based on PCA and support vector machines. In Proceedings of the International Conference on Neural Networks and Brain, Beijing, China, 13–15 October 2005; pp. 743–747.
  4. Abenstein, J.P.; Tompkins, W.J. A new data-reduction algorithm for real-time ECG analysis. IEEE Trans. Biomed. Eng. 1982, BME-29, 43–48. [Google Scholar] [CrossRef] [PubMed]
  5. Miaou, S.-G.; Yen, H.-L.; Lin, C.-L. Wavelet-based ECG compression using dynamic vector quantization with tree codevectors in single codebook. IEEE Trans. Biomed. Eng. 2002, 49, 671–680. [Google Scholar] [CrossRef] [PubMed]
  6. Ranjeet, K.; Kumar, A.; Pandey, R.K. ECG signal compression using different techniques. In Proceedings of the International Conference on Advances in Computing, Communication and Control, Mumbai, India, 28–29 January 2011.
  7. Nambakhsh, M.S.; Ahmadian, A.; Ghavami, M.; Dilmaghani, R.S.; Karimi-Fard, S. A novel blind watermarking of ECG signals on medical images using EZW algorithm. In Proceedings of the 28th Annual International Conference on the IEEE Engineering in Medicine and Biology Society, New York, NY, USA, 30 August–3 September 2006; pp. 3274–3277.
  8. Ibaida, A.; Khalil, I.; van Schyndel, R. A low complexity high capacity ECG signal watermark for wearable sensor-net health monitoring system. In Proceedings of the Computing in Cardiology, Hangzhou, China, 18–21 September 2011; pp. 393–396.
  9. Chan, A.D.C.; Hamdy, M.M.; Badre, A.; Badee, V. Person identification using electrocardiograms. In Proceedings of the Canadian Conference on Electrical and Computer Engineering, Ottawa, ON, Canada, 7–10 May 2006; pp. 1–4.
  10. Chan, A.D.C.; Hamdy, M.M.; Badre, A.; Badee, V. Wavelet distance measure for person identification using electrocardiograms. IEEE Trans. Instrum. Meas. 2008, 57, 248–253. [Google Scholar] [CrossRef]
  11. Wan, Y.B.; Yao, J.C. A neural network to identify human subjects with electrocardiogram signals. In Proceedings of the World Congress on Engineering and Computer Science 2008, San Francisco, CA, USA, 22–24 October 2008; pp. 1–4.
  12. Chiu, C.-C.; Chuang, C.-M.; Hsu, C.-Y. Discrete wavelet transform applied on personal identity verification with ECG signal. Int. J. Wavelets, Multiresolut. Inf. Process. 2009, 7, 341–335. [Google Scholar] [CrossRef]
  13. Ye, C.; Coimbra, M.T.; Kumar, B.V.K.V. Investigation of human identification using two-lead electrocardigram (ECG) signals. In Proceedings of the Fourth IEEE International Conference on Biometrics: Theory Applications and Systems, Washington, DC, USA, 27–29 September 2010; pp. 1–8.
  14. Sasikala, P.; Wahidabanu, R.S.D. Identification of individuals using electrocardiogram. Int. J. Comput. Sci. Netw. Security 2010, 10, 147–153. [Google Scholar]
  15. Saechia, S.; Koseeyaporn, J.; Wardkein, P. Human identification system based ECG signal. In Proceedings of the IEEE Region 10 Tencon 2005, Melbourne, Australia, 21–24 November 2005; pp. 1–4.
  16. Wang, Y.J.; Agrafioti, F.; Hatzinakos, D.; Plataniotis, K.N. Analysis of human electrocardiogram for biometric recognition. EURASIP J. Adv. Signal Process. 2008, 2008. [Google Scholar] [CrossRef] [PubMed]
  17. Biel, L.; Pettersson, O.; Philipson, L.; Wide, P. ECG analysis: A new approach in human identification. IEEE Trans. Instrum. Meas. 2001, 50, 808–812. [Google Scholar] [CrossRef]
  18. Shen, T.W.; Tompkins, W.J.; Hu, Y.H. One-lead ECG for identity verification. In Proceedings of the 2nd Joint Engineering in Medicine and Biology, 24th Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society (EMBS/BMES’02), Houston, TX, USA, 23–26 October 2002; pp. 62–63.
  19. Palaniappan, R.; Krishnan, S.M. Identifying individuals using ECG beats. In Proceedings of the International Conference on Signal Processing and Communications (SPCOM’04), Bangalore, India, 11–14 December 2004; pp. 569–572.
  20. Israel, S.A.; Irvine, J.M.; Cheng, A.; Wiederhold, M.D.; Wiederhold, B.K. ECG to identify individuals. Pattern Recognit. 2005, 38, 133–142. [Google Scholar] [CrossRef]
  21. Silva, H.; Gamboa, H.; Fred, A. One lead ECG based personal identification with feature subspace ensembles. In Proceedings of the 5th International Conference on Machine Learning and Data Mining in Pattern Recognition (MLDM 2007), Leipzig, German, 18–20 July 2007; pp. 770–783.
  22. Singh, Y.N.; Gupta, P. ECG to individual identification. In Proceedings of the 2nd IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS 2008), Arlington, VA, USA, 29 September–1 October 2008; pp. 1–8.
  23. Shen, T.-W.; Tompkins, W.J.; Hu, Y.H. Implementation of a one-lead ECG human identification system on a normal population. J. Eng. Comput. Innov. 2011, 21, 12–21. [Google Scholar]
  24. Lei, W.K.; Li, B.N.; Dong, M.C.; Vai, M.I. AFC-ECG: An adaptive fuzzy ECG classifier. Soft Comput. Ind. Appl. 2007, 39, 189–199. [Google Scholar]
  25. Sameni, R.; Shamsollahi, M.B.; Jutten, C.; Clifford, G.D. A nonlinear bayesian filtering framework for ECG denoising. IEEE Trans. Biomed. Eng. 2007, 54, 2172–2185. [Google Scholar] [CrossRef] [PubMed]
  26. Murugan, S.; Radhakrishnan, S. Rule based classification of ischemic ECG beats using ant-miner. Int. J. Eng. Sci. Technol. 2010, 2, 3929–3935. [Google Scholar]
  27. Vigneron, V.; Paraschiv-Ionescu, A.; Azancot, A.; Sibony, O.; Jutten, C. Fetal electrocardiogram extraction based on non-stationary ICA and wavelet denoising. In Proceedings of the 7th International Symposium on Signal Processing and Its Applications, Paris, France, 1–4 July 2003; pp. 69–72.
  28. Yeh, Y.-C.; Wang, W.-J.; Chiou, C.W. Cardiac arrhythmia diagnosis method using linear discriminant analysis on ECG signals. Measurement 2009, 42, 778–789. [Google Scholar] [CrossRef]
  29. Pan, S.-T.; Hong, T.-P. Robust speech recognition by DHMM with a codebook trained by genetic algorithm. J. Inf. Hiding Multimed. Signal Process. 2012, 3, 306–319. [Google Scholar]
  30. Yang, A.C.-C.; Hseu, S.-S.; Yien, H.-W.; Goldberger, A.L.; Peng, C.-K. Linguistic analysis of the human heartbeat using frequency and rank order statistics. Phys. Rev. Lett. 2003, 90, 1–4. [Google Scholar] [CrossRef]
  31. Peng, C.-K.; Havlin, S.; Stanley, H.E.; Goldberger, A.L. Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. Chaos 1995, 5, 82–87. [Google Scholar] [CrossRef] [PubMed]
  32. Zeng, F.F.; Tseng, K.-K.; Zhao, M.; Pan, J.-S.; Huang, H.-N.; Hsu, C.-Y.; Chen, S.-T. Biometric electrocardiogram card for access control system. In Proceedings of the 5th International Conference on Genetic and Evolutionary Computing, Xiamen, China, 29 August–1 September 2011; pp. 373–376.
  33. Kumar, N.; Lolla, V.N.; Keogh, E.; Lonardi, S.; Ratanamahatana, C.A.; Wei, L. Time-series bitmaps: A practical visualization tool for working with large time series databases. Available online: http://www.cs.ucr.edu/~eamonn/time_series_bitmaps.pdf (accessed on 7 August 2015).
  34. Goldberger, A.L.; Amaral, L.A.N.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.-K.; Stanley, H.E. Physiobank, physiotoolkit and physionet components of a new research resource for complex physiologic signals. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef] [PubMed]
  35. Moody, G.B.; Mark, R.G. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 2001, 20, 45–50. [Google Scholar] [CrossRef] [PubMed]
  36. Database Homepage on Physionet. Available online: https://physionet.org/works/BiometricHumanIdentificationbasedonECG (accessed on 7 August 2015).

Share and Cite

MDPI and ACS Style

Tseng, K.-K.; Huang, H.-N.; Zeng, F.; Tu, S.-Y. ECG Sensor Card with Evolving RBP Algorithms for Human Verification. Sensors 2015, 15, 20730-20751. https://doi.org/10.3390/s150820730

AMA Style

Tseng K-K, Huang H-N, Zeng F, Tu S-Y. ECG Sensor Card with Evolving RBP Algorithms for Human Verification. Sensors. 2015; 15(8):20730-20751. https://doi.org/10.3390/s150820730

Chicago/Turabian Style

Tseng, Kuo-Kun, Huang-Nan Huang, Fufu Zeng, and Shu-Yi Tu. 2015. "ECG Sensor Card with Evolving RBP Algorithms for Human Verification" Sensors 15, no. 8: 20730-20751. https://doi.org/10.3390/s150820730

Article Metrics

Back to TopTop