Next Article in Journal
Lab-Based Evaluation of Device-Free Passive Localization Using Multipath Channel Information
Next Article in Special Issue
Noninvasive Measurement of Tongue Pressure and Its Correlation with Swallowing and Respiration
Previous Article in Journal
Compressive Sensing-Based Radar Imaging and Subcarrier Allocation for Joint MIMO OFDM Radar and Communication System
Previous Article in Special Issue
Prediction of Motion Intentions as a Novel Method of Upper Limb Rehabilitation Support
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Driving Stress Detection Using Multimodal Convolutional Neural Networks with Nonlinear Representation of Short-Term Physiological Signals

Bio-Intelligence & Data Mining Laboratory, School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Korea
*
Author to whom correspondence should be addressed.
Sensors 2021, 21(7), 2381; https://doi.org/10.3390/s21072381
Submission received: 9 February 2021 / Revised: 12 March 2021 / Accepted: 26 March 2021 / Published: 30 March 2021
(This article belongs to the Collection Sensing Technologies for Diagnosis, Therapy and Rehabilitation)

Abstract

:
Mental stress can lead to traffic accidents by reducing a driver’s concentration or increasing fatigue while driving. In recent years, demand for methods to detect drivers’ stress in advance to prevent dangerous situations increased. Thus, we propose a novel method for detecting driving stress using nonlinear representations of short-term (30 s or less) physiological signals for multimodal convolutional neural networks (CNNs). Specifically, from hand/foot galvanic skin response (HGSR, FGSR) and heart rate (HR) short-term input signals, first, we generate corresponding two-dimensional nonlinear representations called continuous recurrence plots (Cont-RPs). Second, from the Cont-RPs, we use multimodal CNNs to automatically extract FGSR, HGSR, and HR signal representative features that can effectively differentiate between stressed and relaxed states. Lastly, we concatenate the three extracted features into one integrated representation vector, which we feed to a fully connected layer to perform classification. For the evaluation, we use a public stress dataset collected from actual driving environments. Experimental results show that the proposed method demonstrates superior performance for 30-s signals, with an overall accuracy of 95.67%, an approximately 2.5–3% improvement compared with that of previous works. Additionally, for 10-s signals, the proposed method achieves 92.33% classification accuracy, which is similar to or better than the performance of other methods using long-term signals (over 100 s).

1. Introduction

Excessive mental stress can negatively affect people in numerous ways, such as by causing various diseases or reducing concentration and work efficiency [1,2,3,4]. Particularly, in driving situations, stress is closely related to driving safety. For example, stress can lead to traffic accidents by impairing driving performance or reducing a driver’s ability to make decisions to cope with dangerous situations [5,6]. Thus, the problem of recognizing stress early has been tackled in several studies to reduce the possibility of traffic accidents [7,8,9].
To detect drivers’ stress, a variety of measurements have been used which can be classified into three categories, namely, vehicle motion measurements, facial behavior measurements, and physiological measurements. Vehicle motion measurements mainly include drivers’ acceleration, braking, lane position, steering angle, and handle movement patterns [10,11,12]. Such measurements are easily obtainable but dependent on vehicle types, driving habits, or road conditions. Similarly, facial behavior measurements, such as eye gaze status, pupil dilation, blink rate, yawning, and head movement, can be acquired easily without interfering with the driver [13,14,15]. However, these measurements tend to be unstable under certain conditions, such as poor lighting, bad weather, at night, or when a driver is wearing eyeglasses.
Meanwhile, physiological measurements are not affected by external factors unrelated to stress, such as lighting conditions or driving manner. Moreover, physiological signals collected by equipment attached to the body can provide relevant information on a driver’s internal state, which are effective for stress recognition [16,17,18]. As stress response is related to autonomic nervous system activity, galvanic skin response (GSR) signals associated with sweat gland activity and heart rate (HR) associated with cardiac activity are often used as reliable stress indicators [19,20]. Thus, measuring and utilizing various physiological signals from low-cost and widely available equipment is highlighted in stress recognition problems [14,21,22].
In stress recognition studies, feature extraction has been performed mainly in the time or frequency domain of physiological signals. Time domain features were typically extracted from time series segments truncated by window sliding strategies [2,6,23,24,25,26,27,28], whereas frequency domain features were extracted from low- and/or high-frequency regions [6,25,26,27,29,30]. Based on these features, statistical measures such as mean, standard deviation, skewness and kurtosis were commonly calculated and used to differentiate between stressed and non-stressed conditions [24]. Meanwhile, in other studies [14,31,32], domain-dependent features defined according to experts’ domain knowledge on specific signal types or human mental stress were used. Although such features are effective, they are generally not robust to certain variations, such as noise and intrapersonal variability.
In addition, nonlinear features, such as recurrence quantification analysis (RQA) [33] measurements, were employed in some stress recognition studies [6,34,35]. RQA measurements are to quantify the structure of a recurrence plot (RP) [36] representing the recurrence properties of time series data presented in the phase space trajectory. Although used in several studies, RQA measurements are limited in their usefulness because they provide less information than RPs themselves. So, it seems to be a prospective approach to learn and extract features directly from the RP itself. Table 1 provides the types of features that were used primarily in stress recognition studies.
Furthermore, in driving situations, short-term monitoring is essential to driving safety. However, many previous studies on stress detection used relatively long-term physiological signals, typically several minutes long [30,37]. In some recent studies [14,26,38], short-term ECG signals were often used which have high sampling frequency and correlation with stress conditions. However, ECGs are inconvenient and suffer from noise vulnerability from unstable contacts. Nowadays the leveraging of short-term (i.e., tens of seconds long) GSR or HR signals is increasing, but still knowledge-based feature engineering accompanied with conventional machine learning classifiers requires substantial time and human efforts [2,6,14,28].
To address above issues, in this study, we investigate the stress detection problem using short-term GSR and HR signals easily obtainable from wearable devices. Unlike conventional studies that utilize statistical features or domain knowledge-based feature engineering, we explored the nonlinear features presented in continuous RPs of short-term foot GSR (FGSR), hand GSR (HGSR), and HR signals, along with multimodal CNNs.
The main contributions of this study can be summarized as follows:
  • Employing short-term (30-s or less) FGSR, HGSR, and HR signals, which have not been fully utilized in previous stress classification studies.
  • Investigating continuous RPs (Cont-RPs) obtained by converting one-dimensional time series into two-dimensional matrices for exploring features differentiating between stressed and relaxed states.
  • Proposing a multimodal CNN classifier based on Cont-RPs and validating its effectiveness in drivers’ stress classification.

2. Materials and Methods

The proposed method using multimodal CNNs for stress detection is summarized in Figure 1. The method consists of three CNNs for handling foot GSR (FGSR), hand GSR (HGSR), and HR signals. Each CNN receives a nonlinear representation of the input signals to train the network, in which the nonlinear representation is obtained by generating continuous RPs (Cont-RPs), which is explained in Section 2.3. Next, the three outputs of the last convolution block of each CNN are flattened and concatenated into one representation vector, which is fed to a fully connected layer for stress class prediction.

2.1. Driving Stress Dataset

The dataset used in this study was obtained from the Stress Recognition in Automobile Drivers (SRAD) dataset in PhysioNet [37]. The SRAD dataset was originally collected to determine drivers’ relative stress level based on physiological signals in real-world driving environments. The dataset contains multiple types of physiological signals related to different driving stress conditions obtained from nine healthy subjects in the context of driving a given route through open roads. The prescribed route has two rest periods before and after driving, two highway driving periods, and three city driving periods. According to [37], physiological signals from the rest and highway and city driving periods are related to low, medium, and high levels of stress, respectively.
A total of 17 recordings ranging from 60 to 90 min are available to the public. Each recording consists of six different physiological signals, including ECG, electromyogram (EMG), FGSR, HGSR, HR, and respiration (RESP). In addition, another time series called a “marker” is given for each recording to indicate the interval (i.e., start and end points) of each period and its corresponding stress level. Figure 2 presents an example of the three signals in one recording divided into seven segments according to the period-switching lines extracted from the marker data. The magnitude of the physiological signals clearly varied with the road conditions, which generated different stress levels.
In this study, only nine recordings (i.e., drive 06, 07, 08, 09, 10, 11, 12, 15, and 16) were used where the marker signals were clear, and all three signals were provided. The excluded eight recordings do not contain all three sensor signals, or their marker signals are not clear in terms of transient time between each period, as shown in Table 2.
The statistical characteristics of the nine recordings used in our experiment are given in Table 3. In this table, it is observed that the values of the three signals (FGSR, HGSR, and HR) are in very different ranges depending on subjects. Moreover, in some subjects, the range of signal values very overlaps between low and medium stress levels or between medium and high stress levels. To clarify the usefulness of Cont-RPs for the three signals, this study focused on the problem of distinguishing between low stress levels in rest and high stress levels during city driving.

2.2. Preprocessing

To analyze the three signals, namely, FGSR, HGSR, and HR, from the input recordings, we preprocessed them in the following way. First, we resampled each signal to equalize the sampling frequency and produced RPs with an appropriate size, that is, neither too large nor too small, to adequately represent time-varying patterns. Specifically, in the SRAD dataset, two types of GSR signals (i.e., FGSR and HGSR) are given at a sampling rate between 33 Hz and 35 Hz, and the HR signals have a sampling rate between 0.5 Hz and 1 Hz. Therefore, we downsampled the GSR signals and upsampled the HR signals to equally obtain 16 samples per second. By doing so, reducing computational complexity while maintaining key information on the signal changes caused by driving stress was possible.
Second, we applied a median filter to eliminate noise and artifacts that can be caused by poor electrode contact or undesirable body movements. Median filtering is efficient for reducing the impact of these spiky noises, and easier to implement than other complicated filtering techniques. Variations in the magnitude of physiological signals from different people or under different environments (external factors such as temperature and humidity) exist. Thus, to compare signals of different amplitudes, we normalized them to have zero mean and unit standard deviation.
Finally, we generated short-term (10 s and 30 s) samples by segmenting the signals into a fixed window size with a 50% overlap between adjacent segments and assigned the same label to each sample as the original. To formulate the stress prediction problem, we used only the samples labeled as one of two stress levels, in which low-level stress (i.e., relaxed) was assigned to the rest periods, and high-level stress (i.e., stressed) was assigned to the city driving periods. If some transitions are involved in one sample, from the relaxed state to the stressed state, or vice versa, the sample was not considered for further analyses.

2.3. Stress-Relevant Characteristics of Cont-RPs

To investigate nonlinear properties distinct in the stressed and relaxed states, we converted the one-dimensional time series into two-dimensional matrices representing Cont-RPs. A conventional RP [36] is represented as a binary matrix, indicating the recurring states of dynamic system in the phase space. Specifically, we marked each component in the RP matrix as 1 if the distance between the two states in the phase space is less than the predefined threshold value, and 0 otherwise. Existing RP-based studies are based mainly on these binarized RPs.
Unlike such studies, we attempted to utilize all the information contained in the matrix of distances between the two states in the phase space without binarization. Therefore, the RP matrix used in this study contained components with continuous values, indicating the degree of recurrence between the two states, rather than binary values, which are referred to as Cont-RPs.
Examples of Cont-RPs for the three short-term FGSR, HGSR, and HR signals are presented in Figure 3 (for 10 s) and Figure 4 (for 30 s). We set the embedding dimension to 3 and the time delay to 2 so that the resulting Cont-RPs can well classify between stressed and relaxed states. In the figures, the magnitude of the components in the Cont-RPs is the Euclidean distance between two embedded states in phase space, visualized as color intensities with low and high values shown as dark blue and yellow, respectively.
Overall, the Cont-RP patterns showed the difference between the stressed and relaxed states. One distinct trait was that complex and irregular patterns were observed in the stressed state, whereas monotonous and regular patterns were observed in the relaxed states. Specifically, clearer distinctions between the two states were observed in the examples of both GSR signals regardless of their length (both 30 s and 10 s). Netting or bubble-shaped patterns appeared irregularly in the stressed conditions, whereas smooth transitions were mostly observed in the relaxed state, with few drastic transitions in intensity changes. In addition, slight differences were observed between the FGSR and HGSR signals, despite both being GSRs. This outcome appeared presumably because the two GSR signals were measured on drivers’ different body parts doing different activities while driving. Thus, they are likely complementary when used together.
Meanwhile, the Cont-RPs of the HR signals had characteristics differing from those of the Cont-RPs of the GSR signals. Figure 3 and Figure 4 show checkerboard patterns with regularly arranged squares with a similar size in the relaxed state, thereby indicating that the HR signals changed regularly. Meanwhile, in the stressful situations, bright vertical and horizontal lines appeared irregularly in contrast to the relaxed state. In addition, given the short-length (10 s) of the signals, their Cont-RPs manifested more localized patterns that corresponded to the 1/9 region of the 30 s Cont-RPs, as shown in Figure 3 and Figure 4.
The Cont-RPs of the three types of signals exhibited different aspects of characteristics related to stress. Therefore, examining the three signals individually to extract stress-relevant features would be worthwhile. For this purpose, we employed multimodal CNN, which is detailed below.

2.4. Feature Learning and Classification Based on Cont-RPs

To automatically learn the stress-relevant features of the three types of physiological signals, we constructed a multimodal CNN model consisting of three CNNs for the three signals and one dense layer to generate the probabilities of each stress class (e.g., stressed or related) from their outputs. As inputs for each CNN for training, the Cont-RPs of each type of signal were given. That is, three CNNs were for learning the inherent features contained in the Cont-RPs of FGSR, HGSR, and HR signals. The entire procedure is depicted in Figure 5, which can be divided into three phases, that is, first, learning representative feature maps through the three CNNs from the Cont-RPs of the three types of signals; second, combining their flattened outputs into one integrated representation vector; and third, generating class probabilities based on the representation vectors by learning the relationships between them through a fully-connected layer with a sigmoid function.
To learn the representative feature maps of the Cont-RPs, we built three CNNs with the same architecture, as depicted in Figure 5. The architecture of our CNNs is the same as the front part of the VGG16 model [39], but all the parameters were trained from scratch based on our training dataset. That is, each CNN consisted of five convolution blocks, each of which contained a two-dimensional convolution, rectified linear unit (ReLU) activation, and max pooling layer. The first two convolution blocks (i.e., ConvBlocks 1 and 2) included two convolution layers, whereas the remaining blocks contained three convolution layers. In addition, 3 × 3 filters with a stride of 1 were used for all the convolution layers and 2 × 2 filters with a stride of 2 were used for the max pooling layer, similar to [39]. The convolution layers learn spatial information and extract features using kernels with sliding window strategy, while pooling layers reduce the spatial dimension of feature maps. After going through the fifth convolution block in each CNN, the resulting three feature maps were flattened (i.e., three vectors with dimension of 256) and combined into one integrated representation vector with dimension of 768. Subsequently, this vector was fed to the dense layer, and the sigmoid function was used as the activation function for the output layer to predict the probabilities of the two classes, that is, the stressed and relaxed states.

3. Results and Discussion

3.1. Experimental Setup

For the experiments, we implemented the proposed method in Python using Keras with Tensorflow backend and carried out with an NVIDIA Titan X and 12 GB of RAM. In the SRAD dataset, the number of samples for the relaxed state (minority class) was fewer than the number of samples for the stressed state (majority class). The classifier from the imbalanced data was prone to bias toward the majority class, whereas the minority class samples were not well learned. To handle the class imbalance problem, we performed random undersampling on the stressed samples to balance the class distribution between the stressed and relaxed states. Consequently, we obtained 1872 30-s samples and 5348 10-s samples for the model development.
To assess the effectiveness of the proposed method, we used leave-one recording-out (i.e., recording-wise) cross-validation. That is, for each round, we used one recording for testing and the remaining eight recordings for model training. We conducted this procedure to evaluate the generalization ability of the trained model for new recordings of completely unknown individuals not included in the training data. We performed a grid search to get optimal hyperparameters such as learning rate, batch size and the number of epochs. For example, we considered a suite of different batch sizes from 1 to 20, the number of epochs between 1 to 30, and learning rate from 0.1 to 0.00001. We then trained our multimodal CNN model for 15 epochs with a batch size of 4 and used the Stochastic Gradient Descent (SGD) optimizer with a learning rate of 0.001 for the weight update.
We measured classification performance using accuracy, precision (positive predictive values), recall (sensitivity), the F1-score, and area under the curve (AUC). Accuracy and the AUC describe the overall performance across all classes of samples, whereas precision, recall and the F1-score calculated for each class present a method’s ability to distinguish between certain classes. These indices can be calculated as follows:
Accuracy = (TP + TN)/(TP + TN + FP + FN) × 100%,
Precision = TP/(TP + FP) × 100%,
Recall = TP/(TP + FN) × 100%,
F1-score = (2 × Precision × Recall)/(Precision + Recall)
where TP is the number of correctly classified positive samples, TN is the number of correctly classified negative samples, FP is the number of negative samples incorrectly classified as positive and FN is the number of positive samples incorrectly classified as negative.

3.2. Performance Evaluation

We evaluated the performance of the proposed stress detection approach using two different lengths (i.e., 10-s and 30-s) of input signals. The overall classification results are summarized in Table 4. From this table, it is found that for the 30-s input signals, overall classification accuracy (and F1-score) was 95.67%, whereas that for the 10-s input signals was 92.33%. Moreover, the AUC difference between the 30-s and 10-s input signals was not exceedingly large, with the AUC as 0.9870 and 0.9619, respectively.
Figure 6 represents aggregated confusion matrices which include the aggregation of all classification results obtained from leave-one recording-out cross-validation. We can observe that only a small portion of stressed and relaxed samples (3.8% of stressed samples and 4.6% of relaxed samples for classification using 30-s signals; 7.4% of stressed samples and 8.5% of relaxed samples for classification using 10-s signals) were incorrectly classified into each other.
Interestingly, reducing the length of the input signals from 30 s to 10 s did not significantly degrade our proposed method’s ability to distinguish between the stressed and relaxed states. This finding showed that our proposed method performed relatively well in detecting the stressed state using only short-length 10-s inputs.
As shown in Figure 7, the proposed method’s classification performance in the individual recordings varied from 86% to 100% for the 30-s signals and from 85% to 99% for the 10-s signals. In addition, when using the 10-s signals, classification accuracy decreased slightly by 1 to 7% compared with when using the 30-s signals.
Next, we compared the performance of our multimodal CNN model with that of three unimodal CNNs for the FGSR, HGSR, and HR signals, and the results are presented in Table 5. This table shows that the combined use of the three physiological signals significantly improved classification performance by approximately 5% to 33% for the 30-s signals and by 2% to 36% for the 10-s signals. Moreover, among the three unimodal CNNs, the FGSR-CNN demonstrated a fairly satisfactory performance, whereas the HR-CNN exhibited a fairly poor performance. The accuracy of the HGSR-CNN decreased by approximately 10% to 14% depending on the length of the input signals compared with that of the FGSR-CNN.
Regarding the variations in model performance, there was not much difference between 30-s and 10-s signals in FGSR. That is, the performance of stress detection with FGSR was not significantly affected by the length of input signal. This indicates that the Cont-RPs of short-term FGSR signals can be effective for stress classification in that they will reflect the traits able to differentiate between stressed and relaxed states. Moreover, HGSR performs slightly better for 10-s signals than for 30-s signals, which means that the short-term HGSR signal may be a more reliable indicator for stress detection than the long-term HGSR signal.
Further, the change in performance among different recordings is not very significant in our multimodal CNN model for both 30-s and 10-s signals, compared to other unimodal CNNs (refer to Figure 8). Of the three unimodal CNNs, FGSR is the most stable for different recordings regardless of the input signal length. For HR, longer signals appear more stable when classifying stressed/relaxed states. That is, 30-s signals have less variations between different recordings than 10-s signals.
We also compared the classification performance of multimodal CNN with two baseline neural network models. All three models in Table 6 used CNN architecture to extract features from FGSR, HGSR, and HR signals, and perform stress classification via concatenation of three vectors and fully connected layers as proposed multimodal CNN. Specifically, the multimodal 1-D CNN is a classifier that takes three one-dimensional sequence of FGSR, HGSR, and HR, and outputs the probability of stressed and relaxed states. The three inputs are the preprocessed signals before generating Cont-RPs. The multimodal VGG16 model has the same structure as proposed multimodal CNN but uses default parameters of existing VGG16 without parameter learning.
Table 6 shows that the performance of two models (multimodal CNN and multimodal VGG16) with three Cont-RPs as input is approximately 1% to 12% and 1% to 9% higher than that of multimodal 1-D CNN model with one-dimensional sequence as input for 30-s and 10-s input signal, respectively. The results indicate that Cont-RPs have more useful features for stress classification than one-dimensional signals as itself which have been used in most previous studies. Even with the same VGG structure, the performance of multimodal CNN model which have learned stress-relevant features based on Cont-RP samples is about 8% to 10% higher than that of multimodal VGG16 model.

3.3. Comparison with Related Works

A number of related studies were conducted on the problem of classifying drivers’ stress based on physiological signals, including GSR and/or HR. Table 7 shows the classification performance of other two-class (stressed and relaxed) stress classification methods compared with that of the proposed method. Although some of the other methods were tested using different datasets, they were also included in this table for reference.
Most of the studies in this table used multisensor signals for detecting drivers’ stress, which is favorable in the creation of a reliable system. For example, Wang et al. [40] proposed a CNN-based driving stress detection scheme using GSR, HR, HRV and breathe rate, and obtained 92% accuracy in a two-class classification. It is worth noting that the authors analyzed their own collected dataset, which differs from the dataset (i.e., SRAD) used in our experiments. Jiménez-Limas et al. [30] detected two driver stress classes using logistic regression and 5-min FGSR and HR signals and respiration rate. As drivers’ conditions should be detected as quickly as possible to prevent potential accidents, long signals (over 100 s) are not very helpful in detecting drivers’ stress early in actual situations. Lopez-Martinez et al. [41] classified two stress levels using support vector machine (SVM) and achieved high accuracy. However, the achieved performance is 2.67% lower than that of our model based on the same length (30 s) of signals.
In addition to Table 7, where comparison was done with the performance of stress classification for two classes, there are other studies using the same SRAD dataset to perform stress level classification for three classes which is slightly different from our purpose. Chen et al. [26] extracted features from GSR, ECG and respiration data and achieved 89.7% accuracy in by kernel-based classifiers. Meanwhile, Wang et al. [42] employed Adaboost classifier for driving stress detection system by using only FGSR data and resulted in high accuracy (90.09%). Although a single sensor is convenient to use, unstable results can occur in that no additional sensor data compensate for lost information due to inaccurate and missing signals. Healey and Picard [37] classified 5 min intervals of data into three stress levels with linear discriminant analysis (LDA). They used ECG, EMG, respiration rate and two GSR data, achieving the highest accuracy in the listed studies. Singh et al. [43] utilized 10 s signals of HGSR and photoplethysmogram (PPG) sensor for three-class classification, resulted in 89.23% accuracy. It is worth noting that they analyzed their own collected dataset, which is different from the data we used (SRAD) in our experiments.
The significance of our proposed method is in employing short-term (30-s or less) signals of GSR and HR, which are acquired while real driving but not fully utilized in stress classification. As mentioned in Section 1, considering GSR and HR related to ANS activity, experimental results have shown that GSR and HR signals of short length (30-s or less) are useful as stress indicators. Although the proposed model achieved good performance for unseen recording in leave-one recording-out cross-validation, the performance of our model is still limited by the small number of samples used in the experiment due to the lack of public available dataset collected in the actual driving environment. In the future, we plan to measure more physiological signals in simulated driving environment and conduct additional experiments to obtain more reliable results for the proposed model.

3.4. Visualization of Learned Feature Distributions

As mentioned in Section 2.4, a set of convolution blocks (1~5) in each CNN extracted features from the Cont-RPs of each type of input signal and outputted a 256-dimensional vector. As we employed three types of physiological signals as input, the concatenation of the three outputs led to one 253 × 3 = 768 integrated representation vector, which was then fed to a fully connected layer for stress classification. To visually confirm that the representation vectors learned from the Cont-RPs of the FGSR, HGSR, and HR signals were discriminative between classes, we reduced the dimension of the representation vectors from 768 to 2 using the t-stochastic neighbor embedding (t-SNE) method [44] and visualized them in a reduced two-dimensional space, as shown in Figure 9. The t-SNE method is a nonlinear dimensionality reduction technique that can transform high-dimensional vectors into low-dimensional vectors by preserving the intrinsic structure of the data.
In Figure 9, the representation vectors are colored by class, with red and green denoting the stressed and relaxed classes, respectively. The two classes were well distinguished in terms of 30-s and 10-s signals, which illustrated that our model learned satisfactory features from the Cont-RPs of the FGSR, HGSR, and HR signals for distinguishing between classes.
Interestingly, Figure 9 shows that the distributions of the two classes were in the form of their own long bands. Given that we split the continuous long-time signals into multiple short-length segments, we can infer that the learned representation vectors not only distinguish between classes but also contain consecutive temporal features between them.

4. Conclusions

In this study, we proposed a new method for detecting drivers’ stress based on short-term physiological signals, namely, FGSR, HGSR, and HR, which can be easily obtained by wearable devices. Specifically, by constructing the two-dimensional nonlinear representation of the Cont-RPs of short-term (10 s and 30 s) input signals, we were able to learn their corresponding satisfactory representation vectors through multimodal CNNs that can well distinguish between stressed and relaxed states. Experimental results using the publicly available SRAD dataset showed that the proposed method demonstrated superior performance in detecting drivers’ stress, overall accuracy of 95.67% with 30-s signals and 92.33% with 10-s signals, and a performance improvement of approximately 2.5–3% compared with conventional studies using long-term (100 s or more) signals.
As demand to leverage various types of physiological signals acquired by wearable devices increases, the proposed method is expected to be widely used for detecting drivers’ stress in real driving scenarios. Moreover, the proposed method’s use of short-term signals may be highly attractive for real-time applications. In future research, we plan to determine whether the proposed method can be applied to other types of physiological signals.

Author Contributions

Conceptualization, J.L. and M.S.; methodology, J.L.; software, J.L. and H.L.; validation, J.L.; investigation, J.L. and H.L.; writing—original draft preparation, J.L.; writing—review and editing, J.L., H.L. and M.S.; visualization, J.L. and H.L.; supervision, M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2020-2016-0-00313) supervised by the IITP (Institute for Information & communications Technology Planning & Evaluation).

Acknowledgments

We would like to thank the anonymous reviewers for their time and constructive comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kivimäki, M.; Steptoe, A. Effects of stress on the development and progression of cardiovascular disease. Nat. Rev. Cardiol. 2018, 15, 215. [Google Scholar] [CrossRef]
  2. Rodríguez-Arce, J.; Lara-Flores, L.; Portillo-Rodríguez, O.; Martínez-Méndez, R. Towards an anxiety and stress recognition system for academic environments based on physiological features. Comput. Meth. Prog. Bio. 2020, 190, 105408. [Google Scholar] [CrossRef] [PubMed]
  3. Celka, P.; Charlton, P.H.; Farukh, B.; Chowienczyk, P.; Alastruey, J. Influence of mental stress on the pulse wave features of photoplethysmograms. Healthc. Technol. Lett. 2020, 7, 7–12. [Google Scholar] [CrossRef]
  4. Shah, P.; Khaleel, M.; Thuptimdang, W.; Sunwoo, J.; Veluswamy, S.; Chalacheva, P.; Kato, R.M.; Detterich, J.; Wood, J.C.; Zeltzer, L.; et al. Mental stress causes vasoconstriction in subjects with sickle cell disease and in normal controls. Haematologica 2020, 105, 83–90. [Google Scholar] [CrossRef] [PubMed]
  5. American Psychological Association. Stress in America: The State of Our Nation; Stress in America Survey; American Psychological Association: Washington, DC, USA, 2017. [Google Scholar]
  6. Khowaja, S.A.; Prabono, A.G.; Setiawan, F.; Yahya, B.N.; Lee, S.L. Toward soft real-time stress detection using wrist-worn devices for human workspaces. Soft Comput. 2020, 25, 2793–2820. [Google Scholar] [CrossRef]
  7. Rastgoo, M.N.; Nakisa, B.; Rakotonirainy, A.; Chandran, V.; Tjondronegoro, D. A critical review of proactive detection of driver stress levels based on multimodal measurements. ACM Comput. Surv. 2018, 51, 1–35. [Google Scholar] [CrossRef] [Green Version]
  8. Munla, N.; Khalil, M.; Shahin, A.; Mourad, A. Driver stress level detection using HRV analysis. In Proceedings of the 2015 International Conference on Advances in Biomedical Engineering (ICABME), Beirut, Lebanon, 16–18 September 2015; pp. 61–64. [Google Scholar] [CrossRef]
  9. Muñoz-Organero, M.; Corcoba-Magaña, V. Predicting Upcoming Values of Stress While Driving. IEEE T. Intell. Transp. Syst. 2017, 18, 1802–1811. [Google Scholar] [CrossRef]
  10. Lanatà, A.; Valenza, G.; Greco, A.; Gentili, C.; Bartolozzi, R.; Bucchi, F.; Frendo, F.; Scilingo, E.P. How the autonomic nervous system and driving style change with incremental stressing conditions during simulated driving. IEEE T. Intell. Transp. Syst. 2014, 16, 1505–1517. [Google Scholar] [CrossRef]
  11. Rastgoo, M.N.; Nakisa, B.; Maire, F.; Rakotonirainy, A.; Chandran, V. Automatic driver stress level classification using multimodal deep learning. Expert Syst. Appl. 2019, 138, 112793. [Google Scholar] [CrossRef]
  12. Lim, S.; Yang, J.H. Driver state estimation by convolutional neural network using multimodal sensor data. Electron. Lett. 2016, 52, 1495–1497. [Google Scholar] [CrossRef] [Green Version]
  13. Gao, H.; Yüce, A.; Thiran, J.P. Detecting emotional stress from facial expressions for driving safety. In Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France, 27–30 October 2014; pp. 5961–5965. [Google Scholar] [CrossRef] [Green Version]
  14. Zalabarria, U.; Irigoyen, E.; Martinez, R.; Larrea, M.; Salazar-Ramirez, A. A Low-Cost, Portable Solution for Stress and Relaxation Estimation Based on a Real-Time Fuzzy Algorithm. IEEE Access 2020, 8, 74118–74128. [Google Scholar] [CrossRef]
  15. Zhang, J.; Mei, X.; Liu, H.; Yuan, S.; Qian, T. Detecting Negative Emotional Stress Based on Facial Expression in Real Time. In Proceedings of the 2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP), Wuxi, China, 19–21 July 2019; pp. 430–434. [Google Scholar]
  16. Soman, K.; Alex, V.; Srinivas, C. Analysis of physiological signals in response to stress using ECG and respiratory signals of automobile drivers. In Proceedings of the 2013 International Multi-Conference on Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), Kottayam, India, 22–23 March 2013; pp. 574–579. [Google Scholar] [CrossRef]
  17. Greene, S.; Thapliyal, H.; Caban-Holt, A. A survey of affective computing for stress detection: Evaluating technologies in stress detection for better health. IEEE Consum. Electr. Mag. 2016, 5, 44–56. [Google Scholar] [CrossRef]
  18. Elgendi, M.; Menon, C. Machine Learning Ranks ECG as an Optimal Wearable Biosignal for Assessing Driving Stress. IEEE Access 2020, 8, 34362–34374. [Google Scholar] [CrossRef]
  19. Christopoulos, G.I.; Uy, M.A.; Yap, W.J. The Body and the Brain: Measuring Skin Conductance Responses to Understand the Emotional Experience. Organ. Res. Methods 2016, 22, 1–27. [Google Scholar] [CrossRef]
  20. Aqajari, S.A.H.; Naeini, E.K.; Mehrabadi, M.A.; Labbaf, S.; Rahmani, A.M.; Dutt, N. GSR Analysis for Stress: Development and Validation of an Open Source Tool for Noisy Naturalistic GSR Data. arXiv 2020, arXiv:2005.01834. [Google Scholar]
  21. Mishra, V.; Pope, G.; Lord, S.; Lewia, S.; Lowens, B.; Caine, K.; Sen, S.; Halter, R.; Kotz, D. Continuous detection of physiological stress with commodity hardware. ACM Trans. Comput. Healthc. 2020, 1, 1–30. [Google Scholar] [CrossRef] [Green Version]
  22. Zontone, P.; Affanni, A.; Bernardini, R.; Piras, A.; Rinaldo, R. Stress detection through electrodermal activity (EDA) and electrocardiogram (ECG) analysis in car drivers. In Proceedings of the 2019 27th European Signal Processing Conference (EUSIPCO), A Coruna, Spain, 2–6 September 2019; pp. 1–5. [Google Scholar] [CrossRef]
  23. Guardiola, S.; Girbés, V.; Armesto, L.; Dols, J.; Tornero, J. Physiological Signal Analysis for Driver Stress Detection. Available online: https://www.researchgate.net/publication/320244776_PHYSIOLOGICAL_SIGNAL_ANALYSIS_FOR_DRDRIV_STRESS_DETECTION (accessed on 25 January 2021).
  24. Memar, M.; Mokaribolhassan, A. Stress level classification using statistical analysis of skin conductance signal while driving. SN Appl. Sci. 2021, 3, 1–9. [Google Scholar] [CrossRef]
  25. Bianco, S.; Napoletano, P.; Schettini, R. Multimodal car driver stress recognition. In Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 20–23 May 2019; pp. 302–307. [Google Scholar] [CrossRef]
  26. Chen, L.L.; Zhao, Y.; Ye, P.F.; Zhang, J.; Zou, J.Z. Detecting driving stress in physiological signals based on multimodal feature analysis and kernel classifiers. Expert Syst. Appl. 2017, 85, 279–291. [Google Scholar] [CrossRef]
  27. Seo, W.; Kim, N.; Kim, S.; Lee, C.; Park, S.M. Deep ECG-respiration network (DeepER net) for recognizing mental stress. Sensors 2019, 19, 3021. [Google Scholar] [CrossRef] [Green Version]
  28. Šalkevicius, J.; Damaševičius, R.; Maskeliunas, R.; Laukienė, I. Anxiety level recognition for virtual reality therapy system using physiological signals. Electronics 2019, 8, 1039. [Google Scholar] [CrossRef] [Green Version]
  29. Castaldo, R.; Melillo, P.; Bracale, U.; Caserta, M.; Triassi, M.; Pecchia, L. Acute mental stress assessment via short term HRV analysis in healthy adults: A systematic review with meta-analysis. Biomed. Signal Process. Control 2015, 18, 370–377. [Google Scholar] [CrossRef] [Green Version]
  30. Jiménez-Limas, M.A.; Ramírez-Fuentes, C.A.; Tovar-Corona, B.; Garay-Jiménez, L.I. Feature selection for stress level classification into a physiologycal signals set. In Proceedings of the 2018 15th International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE), Mexico City, Mexico, 5–7 September 2018; pp. 1–5. [Google Scholar] [CrossRef]
  31. Wang, J.S.; Lin, C.W.; Yang, Y.T.C. A k-nearest-neighbor classifier with heart rate variability feature-based transformation algorithm for driving stress recognition. Neurocomputing 2013, 116, 136–143. [Google Scholar] [CrossRef]
  32. Ghaderi, A.; Frounchi, J.; Farnam, A. Machine learning-based signal processing using physiological signals for stress detection. In Proceedings of the 2015 22nd Iranian Conference on Biomedical Engineering (ICBME), Tehran, Iran, 25–27 November 2015; pp. 93–98. [Google Scholar] [CrossRef]
  33. Marwan, N.; Wessel, N.; Meyerfeldt, U.; Schirdewan, A.; Kurths, J. Recurrence-plot-based measures of complexity and their application to heart-rate-variability data. Phys. Rev. E 2002, 66, 026702. [Google Scholar] [CrossRef] [Green Version]
  34. Singh, V.; Gupta, A.; Sohal, J.S.; Singh, A. A unified non-linear approach based on recurrence quantification analysis and approximate entropy: Application to the classification of heart rate variability of age-stratified subjects. Med. Biol. Eng. Comput. 2019, 57, 741–755. [Google Scholar] [CrossRef] [PubMed]
  35. Dimitriev, D.; Saperova, E.V.; Dimitriev, A.; Karpenko, Y. Recurrence Quantification Analysis of Heart Rate during Mental Arithmetic Stress in Young Females. Front. Physiol. 2020, 11, 40. [Google Scholar] [CrossRef] [PubMed]
  36. Marwan, N.; Romano, M.C.; Thiel, M.; Kurths, J. Recurrence plots for the analysis of complex systems. Phys. Rep. 2007, 438, 237–329. [Google Scholar] [CrossRef]
  37. Healey, J.A.; Picard, R.W. Detecting stress during real-world driving tasks using physiological sensors. IEEE T. Intell. Transp. Syst. 2005, 6, 156–166. [Google Scholar] [CrossRef] [Green Version]
  38. Hwang, B.; You, J.; Vaessen, T.; Myin-Germeys, I.; Park, C.; Zhang, B.T. Deep ECGNet: An optimal deep learning framework for monitoring mental stress using ultra short-term ECG signals. Telemed. J. E-Health 2018, 24, 753–772. [Google Scholar] [CrossRef]
  39. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
  40. Wang, K.; Murphey, Y.L.; Zhou, Y.; Hu, X.; Zhang, X. Detection of driver stress in real-world driving environment using physiological signals. In Proceedings of the 2019 IEEE 17th International Conference on Industrial Informatics (INDIN), Helsinki, Finland, 22–25 July 2019; Volume 1, pp. 1807–1814. [Google Scholar] [CrossRef]
  41. Lopez-Martinez, D.; El-Haouij, N.; Picard, R. Detection of Real-world Driving-induced Affective State Using Physiological Signals and Multi-view Multi-task Machine Learning. In Proceedings of the 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW), Cambridge, UK, 3–6 September 2019; pp. 356–361. [Google Scholar] [CrossRef] [Green Version]
  42. Wang, K.; Guo, P. An Ensemble Classification Model with Unsupervised Representation Learning for Driving Stress Recognition Using Physiological Signals. IEEE T. Intell. Transp. Syst. 2020, 1–13. [Google Scholar] [CrossRef]
  43. Singh, R.R.; Conjeti, S.; Banerjee, R. A comparative evaluation of neural network classifiers for stress level analysis of automotive drivers using physiological signals. Biomed. Signal Process. Control 2013, 8, 740–754. [Google Scholar] [CrossRef]
  44. Maaten, L.V.D.; Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
Figure 1. Overview of proposed multimodal CNN approach using FGSR, HGSR, and HR signals for stress class prediction.
Figure 1. Overview of proposed multimodal CNN approach using FGSR, HGSR, and HR signals for stress class prediction.
Sensors 21 02381 g001
Figure 2. Example of multiple physiological signals within one recording segmented based on different road conditions.
Figure 2. Example of multiple physiological signals within one recording segmented based on different road conditions.
Sensors 21 02381 g002
Figure 3. Examples of Cont-RPs for short-term (30 s) FGSR, HGSR, and HR signals.
Figure 3. Examples of Cont-RPs for short-term (30 s) FGSR, HGSR, and HR signals.
Sensors 21 02381 g003
Figure 4. Examples of Cont-RPs for short-term (10 s) FGSR, HGSR, and HR signals.
Figure 4. Examples of Cont-RPs for short-term (10 s) FGSR, HGSR, and HR signals.
Sensors 21 02381 g004
Figure 5. Detailed configuration of proposed multimodal CNN model for feature learning and stress classification.
Figure 5. Detailed configuration of proposed multimodal CNN model for feature learning and stress classification.
Sensors 21 02381 g005
Figure 6. Aggregated confusion matrices for each input signal length (30 s and 10 s). The number in parentheses of each quadrants of the confusion matrices indicates the total number of samples classified as each case.
Figure 6. Aggregated confusion matrices for each input signal length (30 s and 10 s). The number in parentheses of each quadrants of the confusion matrices indicates the total number of samples classified as each case.
Sensors 21 02381 g006
Figure 7. Classification performance in individual recordings based on input signal length (30 s and 10 s).
Figure 7. Classification performance in individual recordings based on input signal length (30 s and 10 s).
Sensors 21 02381 g007
Figure 8. Comparison of the performance variations of our multimodal CNN model and three unimodal CNNs for FGSR, HGSR and HR signals depending on input signal length and sensor type.
Figure 8. Comparison of the performance variations of our multimodal CNN model and three unimodal CNNs for FGSR, HGSR and HR signals depending on input signal length and sensor type.
Sensors 21 02381 g008
Figure 9. Distribution of our learned representation vectors embedded in two-dimensional vector space learned from FGSR, HGSR, and HR signals (red: stressed, green: relaxed).
Figure 9. Distribution of our learned representation vectors embedded in two-dimensional vector space learned from FGSR, HGSR, and HR signals (red: stressed, green: relaxed).
Sensors 21 02381 g009
Table 1. Type of features often used in stress recognition studies using physiological signals.
Table 1. Type of features often used in stress recognition studies using physiological signals.
Feature Domain Physiological SignalsFeature ExamplesStudy
TimeGSR, ECG, HR,
ST, BR, SpO2,
BVP
Mean, Median, SD, RMS, Skewness, Kurtosis,
Maximum, Minimum, Interquartile range, Sum, Amplitude,
Rise time, Means of differences between adjacent elements,
Number of peaks
[2,6,23,24,25,26,27,28]
FrequencyGSR, ECG, RSPEntropy, Power spectrum density, Power sum, The average power, LF, HF, Ratio of LF/HF, Spectral peak features [6,25,26,27,29,30]
Domain-dependentGSR, ECG, RSP, EMGMean HP, Variation in HP, Variation in GSR, Differential area between GSR and its first-order interpolation, Product between RMS and SDCC, Trend-based feature generation [14,31,32]
NonlinearECGRP, RQA, Poincare plot [6,34,35]
GSR: galvanic skin response; ECG: electrocardiogram; HR: heart rate; ST: skin temperature; BR: breath-flow rate; SpO2: oximetry; BVP: blood volume pressure; RSP: respiration; EMG: electromyogram; SD: standard deviation; RMS: root mean squares; LF: low frequency; HF: high frequency; HP: heart period; SDCC: standard deviation of the frequencies; RP: recurrence plot.
Table 2. Excluded recordings in our paper and the reasons.
Table 2. Excluded recordings in our paper and the reasons.
Excluded RecordingReason
drive 01Marker signal is missing.
drive 02HGSR signal is missing.
drive 03Marker and HR signals are missing.
drive 04Marker signal is not clear.
drive 05HR signal is missing.
drive 13HGSR signal is missing.
drive 14HR signal is missing.
drive 17Marker signal is missing.
Table 3. Mean and standard deviation of the three physiological signals for 9 recordings used in experiment.
Table 3. Mean and standard deviation of the three physiological signals for 9 recordings used in experiment.
Sensor FGSR HGSR HR
Status
(Stress Level)
Rest
(Low)
Highway
Driving
(Medium)
City
Driving
(High)
Rest
(Low)
Highway
Driving
(Medium)
City
Driving
(High)
Rest
(Low)
Highway
Driving
(Medium)
City
Driving
(High)
drive 067.42 ± 1.807.25 ± 1.2210.29 ± 2.6418.36 ± 1.3216.19 ± 1.7719.36 ± 1.9180.24 ± 9.3588.31 ± 10.5099.75 ± 13.19
drive 079.21 ± 3.3612.76 ± 1.1612.81 ± 1.725.46 ± 1.716.76 ± 1.177.75 ± 1.2070.9 ± 8.4173.44 ± 5.5578.22 ± 7.60
drive 082.89 ± 0.936.44 ± 0.906.80 ± 1.193.21 ± 0.675.45 ± 0.976.03 ± 1.5463.65 ± 12.5366.49 ± 11.0474.87 ± 24.93
drive 093.55 ± 1.705.12 ± 0.995.27 ± 1.104.40 ± 2.395.66 ± 1.356.60 ± 1.6971.24 ± 15.3373.36 ± 18.2074.03 ± 15.36
drive 104.62 ± 3.236.96 ± 2.129.66 ± 2.236.98 ± 4.056.44 ± 1.759.32 ± 2.6075.35 ± 10.6077.66 ± 7.9283.73 ± 12.99
drive 113.24 ± 0.895.61 ± 0.866.23 ± 1.283.53 ± 1.217.32 ± 1.368.52 ± 1.9460.64 ± 9.5371.42 ± 21.0075.54 ± 23.85
drive 123.32 ± 2.994.07 ± 1.275.35 ± 3.407.67 ± 2.7015.44 ± 2.2115.53 ± 2.0078.72 ± 4.5787.59 ± 4.0688.44 ± 6.32
drive 154.35 ± 1.386.84 ± 0.807.69 ± 1.374.55 ± 1.016.67 ± 1.257.77 ± 1.8669.83 ± 24.9167.98 ± 11.0172.36 ± 14.48
drive 163.74 ± 0.915.71 ± 0.746.90 ± 1.3116.09 ± 1.8420.10 ± 1.0721.21 ± 2.1189.16 ± 10.30101.9 ± 12.65106.1 ± 17.57
Table 4. Classification performance of proposed method based on input signal length.
Table 4. Classification performance of proposed method based on input signal length.
Input LengthClassPrecision (PPV)Recall (Sensitivity)F1-ScoreOverall AccuracyAUC
30 sStressed95.7%96.0%95.8%
Relaxed95.9%95.8%95.7%
95.89%95.67%95.67%95.67%0.9870
10 sStressed91.7%92.8%92.3%
Relaxed92.4%91.7%91.9%
91.67%92.78%92.33%92.33%0.9619
Table 5. Classification performance of proposed method based on input signal length and sensor type.
Table 5. Classification performance of proposed method based on input signal length and sensor type.
Signal Stressed Relaxed Overall
LengthTypePrecisionRecallPrecisionRecallF1-ScoreAccuracyAUC
30 sFGSR92.67%87.50%89.67%92.50%90.62%90.83%0.9091
HGSR82.71%79.57%82.86%77.00%76.57%78.29%0.7825
HR67.25%59.75%64.25%66.00%61.00%62.50%0.6274
3 types95.67%96.00%95.89%95.78%95.67%95.67%0.9870
10 sFGSR92.88%88.50%89.63%92.38%90.50%90.38%0.9101
HGSR83.56%82.67%83.56%79.00%79.83%80.67%0.8141
HR63.86%61.86%55.57%57.43%56.71%59.57%0.5963
3 types91.7%92.8%92.4%91.7%92.33%92.33%0.9619
Table 6. Classification performance of proposed method compared with baseline CNN classifier.
Table 6. Classification performance of proposed method compared with baseline CNN classifier.
SignalInput ClassificationStressed Relaxed Overall
LengthTypeModelPrecisionRecallPrecisionRecallAccuracy
30 s1-D
sequence
Multimodal 1-D CNN82.56%86.78%86.89%80.22%83.44%
Cont-RPMultimodal VGG16 87.88%81.88%85.22%86.11%84.11%
Cont-RPMultimodal CNN95.67%96.00%95.89%95.78%95.67%
10 s1-D
sequence
Multimodal 1-D CNN83.11%84.33%86.33%82.44%83.33%
Cont-RPMultimodal VGG1684.55%81.33%84.55%86.44%84.00%
Cont-RPMultimodal CNN91.7%92.8%92.4%91.7%92.33%
Table 7. Comparison of other 2-class stress classification methods in real-time driving scenarios.
Table 7. Comparison of other 2-class stress classification methods in real-time driving scenarios.
MethodDatasetUsed SignalsInput LengthClassifierAccuracy
[30]SRADFGSR, HR, RESP5 minLogistic
Regression
81.39%
[40]Self-collectionHGSR, HR, HRV,
Breath Rate
100 sCNN92%
[41]SRADFGSR, HGSR, HR30 sSVM93%
ProposedSRADFGSR, HGSR, HR30 sMultimodal CNN95.67%
ProposedSRADFGSR, HGSR, HR10 sMultimodal CNN92.33%
HRV: heart rate variability; RESP: respiration.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lee, J.; Lee, H.; Shin, M. Driving Stress Detection Using Multimodal Convolutional Neural Networks with Nonlinear Representation of Short-Term Physiological Signals. Sensors 2021, 21, 2381. https://doi.org/10.3390/s21072381

AMA Style

Lee J, Lee H, Shin M. Driving Stress Detection Using Multimodal Convolutional Neural Networks with Nonlinear Representation of Short-Term Physiological Signals. Sensors. 2021; 21(7):2381. https://doi.org/10.3390/s21072381

Chicago/Turabian Style

Lee, Jaewon, Hyeonjeong Lee, and Miyoung Shin. 2021. "Driving Stress Detection Using Multimodal Convolutional Neural Networks with Nonlinear Representation of Short-Term Physiological Signals" Sensors 21, no. 7: 2381. https://doi.org/10.3390/s21072381

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop