Next Article in Journal
Exact Likelihood Inference for Parameter of Exponential Distribution under Combined Generalized Progressive Hybrid Censoring Scheme
Previous Article in Journal
Sampled-Data Control for a Class of Singular Takagi-Sugeno Fuzzy Systems with Application in Truck-Trailer System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Human Defecation Prediction Method Based on Multi-Domain Features and Improved Support Vector Machine

1
School of Mechanical and Automotive Engineering, South China University of Technology, Guangzhou 510641, China
2
China Rehabilitation Research Center, Beijing 100068, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Symmetry 2022, 14(9), 1763; https://doi.org/10.3390/sym14091763
Submission received: 15 July 2022 / Revised: 14 August 2022 / Accepted: 19 August 2022 / Published: 24 August 2022
(This article belongs to the Topic Artificial Intelligence in Healthcare - 2nd Volume)

Abstract

:
The difficulty of defecation seriously affects the quality of life of the bedridden elderly. To solve the problem that it is difficult to know the defecation time of the bedridden elderly, this paper proposed a human pre-defecation prediction method based on multi-domain features and improved support vector machine (SVM) using bowel sound as the original signal. The method includes three stages: multi-domain features extraction, feature optimization, and defecation prediction. In the stage of multi-domain features extraction, statistical analysis, fast Fourier transform (FFT), and wavelet packet transform are used to extract feature information in the time domain, frequency domain, and time-frequency domain. The symmetry of the bowel sound signal in the time domain, frequency domain, and time-frequency domain will change when the human has the urge to defecate. In the feature optimization stage, the Fisher Score (FS) algorithm is introduced to select meaningful and sensitive features according to the importance of each feature, aiming to remove redundant information and improve computational efficiency. In the stage of defecation prediction, SVM is optimized by the gray wolf optimization (GWO) algorithm to realize human defecation prediction. Finally, experimental analysis of the bowel sound data collected during the study is carried out. The experimental result shows that the proposed method could achieve an accuracy of 92.86% in defecation prediction, which proves the effectiveness of the proposed method.

1. Introduction

With the increasing number of elderly [1], defecation care for the long-term bedridden elderly has become a pressing social issue. The defecation care of the long-term bedridden elderly is often accompanied by “dirty, messy and smelly” problems [2], which not only make the caregivers miserable but also make it difficult to guarantee the privacy and dignity of the elderly.
The existing methods of defecation care mainly include post-defecation treatment and pre-defecation warning. At present, the most commonly used nursing methods are post-defecation treatment, such as caregivers nursing, using diapers, using anal bags and using intelligent nursing robots, etc., but this kind of method often causes the skin to come into contact with excrement [3], which easily causes skin damage or even skin diseases in the elderly, and is not comfortable. Although methods based on pre-defecation warning can effectively solve the above problems, the existing methods are rare and difficult to be used in long-term care. For example, Zan et al. [4] proposed a method for predicting bowel intention based on rectal pressure signals monitored by biological parameter telemetry capsules. However, one capsule can only be monitored for 24 h. Long-term monitoring requires the long-term use of telemetry capsules, which is potentially risky and costly. Therefore, there is an urgent need for a new, cost-effective, long-term, and simple method for predicting before defecation to improve the quality of life of the disabled elderly. In this paper, based on the study of physiological signals related to human defecation, a machine learning algorithm is used to identify the intention of defecation, and a defecation prediction method is proposed.
The physiological signal of the human body reflects the electrical activities of specific body parts [5], which is closely related to the health status and life process of the human body. Therefore, this information is a valuable data source for disease detection, rehabilitation, and treatment [6]. At present, physiological signals that have been widely studied in the field of healthcare include electromyography (EMG), electroencephalogram (EEG), electrocardiogram (ECG), and electrooculography (EOG). Among them, the EMG signal is a typical clinical recording method for diagnosing and monitoring neuromuscular behavior, which can be used to identify muscle injuries [7], neuroprosthetic control [8], and motor intention decoding [9], etc. EEG is the electrical activity of the brain and is widely used in sleep state recognition [10] epilepsy detection [11], and emotion recognition [12]. ECG, as the main means to detect the electrical activity of the heart, is an important and harmless means to predict and diagnose cardiovascular diseases [13,14]. EOG can detect eye movements, and can be used for ophthalmic diagnosis [15] and sleep status monitoring [16]. Inspired by these studies, we set out to investigate the physiological signals associated with defecation in humans.
A medical study shows that the human defecation intention is generated by the feedback loop formed by the pressure receptors distributed around the periphery of the rectum, nerve tissue, and brain [17]. Therefore, when the feces entering the rectum reaches a certain volume, the human body will have an obvious defecation intention. In addition, studies have shown that defecation is one of the important evaluation criteria reflecting gastrointestinal motility [18]. Based on the above studies, we can speculate that there is a high correlation between gastrointestinal status and human defecation activities, so it is feasible to identify defecation intention by monitoring human gastrointestinal status. Bowel sound (BS) is the sound produced when intestinal contractions push liquid and gas through different parts of the intestine during digestion [19]. It is a simple and effective physiological signal for evaluating the state of the gastrointestinal tract. Therefore, BS is used to monitor gastrointestinal status. In addition, the rectum is normally empty [20], but when feces stored in the colon are pushed into the rectum, the defecation center sends defecation signals through the efferent nerves. We hypothesized that the process of feces pushing into the rectum will be accompanied by more frequent BS, and the difference in the number of feces stored in the bowel may also lead to differences in the quality of BS.
However, manual interpretation of BS is complex, not only requires expertise acquired over many years of training, but is also time-consuming and labor-intensive. Therefore, computer-aided methods are needed to automatically identify and analyze BS, reduce human burden, and reduce errors caused by fatigue and internal variability [21]. The existing research techniques in BS can be divided into two categories: statistics-based methods and machine-learning-based methods. Iterative kurtosis-based detection (IKD) [22] is one of the classical statistics-based algorithms. It locates each BS event through point-by-point estimation of the kurtosis value of BS record points. The points with kurtosis values greater than the threshold are attributed to the existence of nearby BS events. However, IKD algorithms are acausal and cannot be implemented in real-time. On the other hand, with the development of artificial intelligence technology, some advanced machine learning technologies have been continuously used in BS clinical research, and have been widely used in the diagnosis and identification of intestinal diseases. Machine learning techniques conduct BS signal studies in a data-driven manner and are often more effective than statistical-based methods. In 2008, Dimoulas et al. [23] used wavelet feature extraction and multilayer perceptron (MLP) network classifier to analyze BS, which has high accuracy. In 2011, Kim et al. [24] used back propagation(BP) network to analyze BS, and their results showed that there is a good correlation between BS and colonic transit. In 2014, Ulusar et al. [25] proposed a method based on Naive Bayesian (NB) algorithm to confirm the recovery of bowel function in patients after major abdominal surgery by monitoring BS. In 2018, Liu et al. [26] proposed a BS monitoring method based on Mel frequency cepstrum coefficient (MFCC) features and long-short term memory (LSTM) neural networks. This method can accurately distinguish BS and noises, and can be used for long-term detection of gastrointestinal motility. In 2018, Yin et al. [27] improved the BS monitoring system and proposed a BS recognition system based on Support Vector Machine (SVM) to identify intestinal motility events. In 2019, Du et al. [28] proposed the feasibility of applying BS to the diagnosis of irritable bowel syndrome based on a logistic regression algorithm.
It can be seen that applying machine learning technology to BS signal analysis provides a promising and effective method for gastrointestinal status-related diagnosis. However, there is no relevant research on the application of BS to defecation prediction. In addition, in previous studies, the distinction between types is mostly based on typical acoustic features, including spectral centroid, sub-band normalized energy, value of envelope thoracic coefficient, etc. [29], which are all single-domain features. Since BS has the characteristics of asymmetry, weak signal, strong background noise, large individual differences, and strong randomness [25], feature extraction based on single domain tends to ignore the information of signals in other domains, and it is difficult to fully reflect the inherent characteristics of the signal [30,31]. Moreover, in practical applications, the accuracy and generalization ability of the method need to be guaranteed, but the high computational complexity will hinder the applicability of the method in real-time continuous processing. Most previous studies have failed to balance accuracy and computational complexity. To solve the above problems, this paper proposes a new defecation prediction method based on multi-domain features and gray wolf optimizer-based support vector machine (GWO-SVM). This method is different from all previous methods. First of all, we extract the multi-domain features of BS, and select the features that are sensitive to classification. As far as we know, this is the first time in BS research. In addition, in feature extraction in time-frequency domain, we use wavelet packet transform and entropy instead of discrete wavelet transform coefficients, which can retain important information in high-frequency components. Finally, when we use SVM as the prediction model, we use the gray wolf optimization (GWO) algorithm for optimization, which can improve the recognition accuracy and generalization ability of the model. The method proposed in this paper is an innovative application of machine learning in the field of health care, and provides a promising and effective method for defecation care of the bedridden elderly. Since the monitoring of BS is non-invasive in vitro, it can avoid pain and inconvenience when applied to the human body. The proposed method has high recognition accuracy, which can meet the detection function of actual use, assist the bedridden elderly to get timely defecation care, and reduce the nursing cost and burden.
The research framework of the proposed method is shown in Figure 1, and the details are as follows. We firstly filter and denoise the collected BS signals, and then use three methods (statistical analysis, fast Fourier transform (FFT), and wavelet packet transform) to extract the features from multi-domain aspects (time domain, frequency domain, and time-frequency domain) to construct high-dimensional datasets. We believe that the multi-domain characteristics of normal human BS are distributed in a centrosymmetric arrangement, and this symmetry will change after adding the human BS signals. Therefore, the Fisher Score (FS) algorithm is used for feature selection of multi-domain features, and sensitive features with large changes in symmetry and meaningful for classification are selected to improve computational efficiency. After that, SVM which has a strong generalization ability in small datasets and nonlinear data classification is used to establish the defecation prediction model. In addition, the GWO algorithm is used to optimize SVM to get better accuracy of defecation prediction. Finally, experimental analysis is carried out based on the BS data collected in the experiment to verify the feasibility of defecation prediction based on the BS signal. The rest of this paper is organized as follows. Section 2 introduces the BS collection system and dataset. Section 3 introduces the processing methods of the BS signals and specifically introduces the process of multi-domain features extraction. Section 4 introduces the implementation process of GWO algorithm to optimize SVM classifier. Section 5 verifies the effectiveness of the proposed method through an experimental case, and the conclusion is shown in Section 6.

2. Bowel Sound Collection System and Dataset

2.1. Bowel Sound Collection System

To realize the monitoring of BS data and other physiological parameter signals, we build a BS collection system, as shown in Figure 2. The system includes a signal acquisition module, a data receiving module, and a real-time display module. The signal acquisition module includes a 3M™Littmann® 3200 electronic stethoscope for BS acquisition and a Biosignalsplux physiological multipurpose recorder for EEG, EGG, ECG, and other physiological parameter signals acquisition. The data receiving module adopts the industrial computer KMDA-3921, connected with the signal acquisition module through Bluetooth. The real-time acquisition module can display the physiological signals collected in real-time. Among them, the Biosignalsplux physiological multipurpose recorder in the signal acquisition module will be used for subsequent research.

2.2. Bowel Sound Data Acquisition

The sampling frequency of the BS stethoscope is 4000 Hz, which is the built-in sampling frequency of the sensor. Since the typical BS frequency range is between 50 Hz and 1500 Hz, according to the Nyquist sampling theory, when the sampling frequency is greater than double the highest frequency in the signal, the digital signal after sampling can completely retain the information of original signal. Therefore, the sampling frequency of 4000 Hz can completely retain the information of the BS signal. Since BS is best collected in the membrane-type auscultation mode (amplifying the 20–2000 Hz sound and strengthening the 100–500 Hz sound), the auscultation mode of the stethoscope is set to the membrane-type.
Figure 3 shows a schematic diagram of BS data collection locations. It can be seen that the common collection locations of BS are the upper right, upper left, and lower left positions of the abdomen. The lower left part of the abdomen is the sigmoid colon, which is connected to the rectum. When the feces stored in the sigmoid colon are large or the feces are pushed to the rectum from the sigmoid colon, humans will have a more obvious defecation intention. Therefore, we believe that it is the most suitable to fix the stethoscope in the lower left quadrant of the abdomen to collect BS in this paper.
Before BS data collection, it is necessary to shave off the excess hair where the BS stethoscope is located and apply a scrub to remove dead skin. During the collection process, to reduce unnecessary human error, the stethoscope is fixed on the lower left quadrant of the volunteer’s abdomen with medical tape. The volunteers are asked to take a supine position, minimize body movement, try to breathe evenly, avoid speaking, and stay awake. The duration of each data point is 1 min. After data collection, the BS data within 10 min before defecation are manually marked as label 1 (intention to defecate), and the rest are marked as label 0 (no intention to defecate).

3. The Processing Methods of the Bowel Sound Signals

3.1. Filtering Method for Noise Reduction

BS is usually regarded as a non-stationary short-term signal with sudden characteristics, which is easily polluted by noise caused by activities such as heartbeat, breathing, and exercise. Therefore, the bowel sound signal needs to be filtered for noise reduction before extracting features. A recent study confirmed that the largest part of the power spectral density of BS is between 100 Hz and 500 Hz, and the power spectral density above 1000 Hz only accounts for a very small part [32]. Therefore, we mainly retain the information of the BS signal from 100 Hz to 1000 Hz in this paper.
By filtering the original BS signal with a second-order Butterworth-type high-pass filter with a cut-off frequency of 100 Hz, a low-pass filter with a cut-off frequency of 1000 Hz, and a notch filter of 50 Hz, the noise caused by breathing, heartbeat, electromagnetic interference, and other activities can be effectively reduced [25]. Figure 4 shows the time domain and frequency domain comparison diagram of the BS signal before and after filtering. It can be seen that the signal before filtering is noisy and contains more noise below 100 Hz. After filtering, the noise is effectively suppressed. In addition, as shown by the arrow in Figure 4, the location of BS can be more clearly seen in the filtered signal.
Figure 5a,b are the time domain diagrams of typical BS signals after filtering with and without defecation intention. From the figure, we can see that the frequency and amplitude of BS are lower when there is no urge to defecate, while the frequency and amplitude of BS will increase when there is an urge to defecate, especially when there is a strong urge to defecate. This paper speculates that this phenomenon may be due to the movement of feces from the colon to the rectum, accompanied by more pronounced intestinal peristalsis.

3.2. Multi-Domain Features Eextraction

Due to the asymmetry, strong randomness, and wide dynamic range of BS signals, it cannot be directly used for defecation intention prediction, and it is difficult to fully extract suitable feature information only by linear correspondence or single-domain feature vector. Therefore, it is necessary to extract multi-domain features of BS signals.
The BS collected in this paper can be expressed as x n = x 1 , x 2 , , x N , where N is the number of data points of the signal x n . Since the sampling frequency is 4000 HZ and the duration of each acquisition is 1 min, N is 240,000 here. In this paper, three widely used methods (statistical analysis, FFT, and wavelet packet transform) are used to extract the multi-domain features of the filtered BS signals.

3.2.1. Time Domain Features Extraction

Statistical analysis is a research method that uses statistical methods to analyze the research object from quantitative and qualitative. Using statistical analysis methods to extract the time domain features of the signal, we can obtain a distribution of the signal, which represents the waveform of the signal. When the human body has an obvious urge to defecate, the amplitude and distribution of the signal may be different from that when there is no urge to defecate. Therefore, this paper uses statistical methods to extract 16 time domain features of BS signals, and their expressions are listed in Table 1. Among them, there are 10 dimensional statistical parameters T 1 T 10 such as mean, standard deviation, square root amplitude, and six dimensionless statistical parameters T 11 T 16 such as waveform index, peak index, and pulse index [33].

3.2.2. Frequency Domain Features Extraction

FFT is a transform form that can transform a signal from the time domain to the frequency domain. It has a wide range of applications in the fields of acoustics and signal processing, because of its fast and efficient calculation algorithm, low computational cost, and can meet the premise that digital systems can process. Using FFT to extract the frequency domain features of the signal, the spectral information of the signal can be obtained, such as amplitude or phase, and it is more feasible to identify signal changes or patterns. The frequency domain analysis of the BS signal observes the characteristics of the signal according to the frequency, which makes the analysis of the signal more profound and convenient. When BS is frequent enough or occurs rarely, the collected signals will be relatively concentrated in the frequency spectrum, and due to the urge to defecate, there are more feces in the sigmoid colon, which may also affect the position of the main frequency band of the BS. The frequency domain feature expressions of the 13 BS signals F 1 F 13 are shown in Table 2, and the definitions of the involved parameters are shown in Table 3. Among them, the feature value F 1 reflects the vibration energy of the BS signal in the frequency domain, and the feature values F 2 F 4 , F 6 and F 10 F 13 reflect the concentration and dispersion of the BS signal in the frequency spectrum. The feature values F 5 and F 7 F 9 reflect the positional change of the BS signal in the main frequency band [33,34].

3.2.3. Time-Frequency Domain Features Extraction

Both the time domain features and the frequency domain features describe the state information of the entire BS signal. It is not possible to analyze the BS signal locally, and it is difficult to observe the information of the frequency of the non-stationary component in the signal changing with time, that is, the time-frequency resolution is not high. With the help of time-frequency domain analysis, the frequency information of the BS signal can be observed in a small range, and the time-frequency resolution of the BS signal can be improved. Wavelet packet transform [35] is developed based on wavelet transform, and it is a modern time-frequency analysis and processing method that can effectively process all kinds of non-stationary random signals. It overcomes the shortcomings of wavelet transform with low resolution in the high-frequency part, and has better time-frequency resolution. It has been widely used in language, image, seismic, mechanical vibration and other fields.
In this paper, wavelet packet transform is used to extract the time-frequency domain features, and the collected BS signal can be decomposed into multiple two-dimensional parameters (time and frequency) to realize the feature decomposition in different frequency bands and different times. The time-frequency domain features include wavelet energy ratio P 1 P 16 , wavelet energy entropy E E , wavelet feature scale entropy P E 1 P E 16 and wavelet singularity entropy S E . The specific steps are as follows:
(1)
BS signal x n is decomposed by a four-layer wavelet packet transform, and 16 sub-bands are obtained. The j-th layer wavelet packet decomposition of the signal x n can be written as:
x n = i = 1 2 j x j i ( n ) ,
where x j i ( n ) is the i-th sub-band signal decomposed by j-layer wavelet packet transform, and the sub-band signal is reconstructed to be the same length as the signal x n .
(2)
Extracting wavelet energy ratio P 1 P 16 and wavelet energy entropy E E . After decomposition by wavelet packet, the total energy E x of the signal x n can be written as:
E x = i = 1 2 j E j i ,
where E j i = + x j i ( n ) 2 d n is the node energy of the i-th sub-band, so the wavelet energy ratio and wavelet energy entropy P 1 P 16 can be defined as [36]:
P i = E j i E x ,
E E = i = 1 2 j P i log P i ,
(3)
According to the wavelet packet coefficient sequence at each scale, extracting wavelet feature scale entropy P E 1 P E 16 . After the wavelet packet transform is performed on the signal x n , the wavelet packet coefficient sequence at each scale can be obtained as: C j = c j ( 1 ) , c j ( 2 ) , , c j ( N ) j = 1 , 2 , 3 , 4 , where N is the length of the sub-band signal and c j can be regarded as a division of the signal x n . The measure of this division is defined as:
P j k = c F ( j ) ( k ) k = 1 N c F ( j ) ( k ) ,
where c F ( j ) ( k ) is the FFT of c j ( k ) , so the wavelet feature scale entropy of the j-th scale of the signal x n can be defined as [37]:
P E j = k = 1 N P j k log P j k ,
(4)
Extracting the wavelet singular entropy S E . Wavelet singular entropy [38] makes full use of the advantages of wavelet packet transform for adaptive time-frequency localization, the extraction function of singular value decomposition for time-frequency spatial feature patterns, and the statistical properties of information for signal uncertainty and complexity. It can be used to effectively identify BS signals in different states. The wavelet packet decomposition tree after the j-th layer wavelet packet decomposition is performed on the signal x n is shown in Figure 6. The bottom p nodes of wavelet decomposition coefficients of length q can form a time-frequency distribution matrix W p × q , which reflects the time-frequency space energy distribution characteristics of the signal x n . According to the singular value decomposition theory, W p × q can be decomposed as:
W p × q = U p × k Λ k × k V k × q ,
where p = 2 j , q = N / 2 j , and Λ = d i a g ( λ 1 , λ 2 , , λ k ) is the singular value diagonal matrix, which satisfies the descending order: λ 1 λ 2 λ k . So the wavelet singular entropy is defined as:
S E = i = 1 k Q i ,
Q i = λ i i = 1 k λ i ln λ i i = 1 k λ i ,
Ultimately, by integrating the multi-domain features D data i of each data sample, high-dimensional feature sets H data can be constructed:
H data = D data 1 D data p D data q = T 1 × 16 F 1 × 13 P 1 × 16 E E 1 P E 1 × 16 S E 1 T p × 16 F p × 13 P p × 16 E E p P E p × 16 S E p T q × 16 F q × 13 P q × 16 E E q P E q × 16 S E q
where q represents the total number of data samples, and D data p represents the multi-domain features of the p-th sample. The dimension of the matrix H data is 63, including 16-dimensional time domain features, 13-dimensional frequency domain features, 16-dimensional wavelet energy ratio features, one-dimensional wavelet energy entropy feature, 16-dimensional wavelet feature scale entropy features, and one-dimensional wavelet singular spectrum entropy feature.

3.3. Fisher Score Algorithm

We believe that the multi-domain features of normal human BS signals are distributed in centrosymmetric, which will change after adding the BS signals of having an urge to defecate. The multi-domain features of BS signals constitute a high-dimensional dataset H data , which can reveal the state information and intrinsic properties of BS more broadly, but also brings some redundant and negative information. So before the model training, it is necessary to reduce the dimensionality of the high-dimensional dataset, which can select sensitive features with large changes in symmetry and meaningful for classification, and improve computational efficiency. FS algorithm is a typical filtering feature selection method [39]. It selects features that are effective for classification according to the score of candidate features, especially the features with the most distinguishing ability as candidate features. In the data space spanned by the selected features, the distance between data points in different classes is as large as possible, and the distance between data points in the same class is as small as possible. It can be considered that the higher the score of the feature, the greater the change in symmetry, and the more meaningful it is for classification. For a given high-dimensional dataset H data , a widely used heuristic strategy is to calculate the score of each feature independently according to the F criterion, then the FS value of the j-th feature is calculated as follows:
F ( x j ) = k = 1 c n k ( μ k j μ j ) 2 ( σ j ) 2 ,
where ( σ j ) 2 = k = 1 c n k ( σ k j ) 2 , c is the number of classes, n k is the number of the sample of the k-th class, μ k j and σ k j are the mean and standard deviation of the j-th feature in the data of the k-th class, μ j and σ j are the mean and standard deviation of the entire dataset of the j-th feature. After calculating the FS value of each feature in the multi-domain features of BS, according to the experimental test result, the top m features with the highest scores are selected as sensitive features, and a new low-dimensional feature set is established.

4. Support Vector Machine Optimized by Gray Wolf Optimization Algorithm

4.1. Support Vector Machine

There is a nonlinear mapping between the peristaltic mechanism of the intestine and the feature vectors of BS signal, and in general, the frequency of defecation in healthy people is about 1.2 times per 24 h [40]. Affected by this physiological characteristic, the number of BS data samples that can be collected during the experiment is limited, and it is necessary to consider the prediction of defecation intention based on small samples. SVM [41] is a machine learning method based on the principle of structural risk minimization, and its wide and successful application in the engineering and biomedical field in recent years has proved that it is an excellent classification model with strong generalization ability [42,43,44]. Therefore, this paper selects the SVM model to predict defecation intention.
Given a training set M = x i , y i x i R N , y i 1 , 1 , i = 1 , 2 , , n , where x i are the sample data and y i is the sample category, the basic principle of SVM is to find the optimal separation hyperplane, as shown in Figure 7. The solution of its optimal hyperplane can be transformed into the following constrained minimization problem:
min 1 2 ω 2 s . t . y i ω x i + b 1 ,
where ω R N is the coefficient or weight vector, and b is the bias term.
To improve the generalization ability of SVM, a slack variable θ i 0 , i = 1 , 2 , , n is introduced, and Equation (12) is rewritten as: …
min 1 2 ω 2 + c i = 1 n θ i θ i 0 s . t . y i ω x i + b 1 θ i c 0 i = 1 , 2 , , n ,
where c represents the penalty factor, and its value can weigh the empirical risk and the structural risk. By using the Lagrange multiplier method on the above equation, the Lagrange function of the optimization problem can be obtained:
L ω , b , α , θ , μ = 1 2 ω 2 + c i = 1 n θ i + i = 1 n α i 1 θ i y i ω T x i + b i = 1 n μ i θ i ,
where α = α 1 ; α 2 ; ; α n and μ = μ 1 ; μ 2 ; ; μ n are Lagrange multipliers, among them, α i 0 , μ i 0 . Then the kernel function G x i , x j = φ T x i φ x i is introduced, where · represents the inner product operation. According to the KKT condition, Equation (14) can be described as the following dual optimization problem:
max L ( α ) = i = 1 n α i 1 2 i = 1 n j = 1 n α i α j y i y j G x i , x j s . t . i = 1 n α i y i 0 α i c ; i = 1 , 2 , , n ,
Common kernel functions in SVM include linear kernel function, polynomial kernel function, Gaussian kernel function, and radial basis function (RBF). Choosing different kernel functions, the classification accuracy may be very different. To better approximate any nonlinear function, the RBF is selected as the kernel function in this paper, which is defined as:
G x i , x j = exp x i x j 2 2 g 2 ,
where g is the kernel parameter, which reflects the distribution complexity of the data samples in the high-dimensional space. Finally, the classification decision function of SVM can be defined as:
f ( x ) = s i g n i = 1 n α i y i G x i , x j + b ,

4.2. Gray Wolf Optimization Algorithm

In the study, the choice of penalty factor c and kernel parameter g has a great influence on the classification accuracy of SVM, therefore, it is necessary to study the setting of SVM parameters [45]. Because GWO algorithm has a convergence factor that can be adaptively adjusted and an information feedback mechanism, it can achieve a balance between local optimization and global search. Besides, GWO has strong convergence and is easy to implement, so in this paper, GWO algorithm is used to optimize the penalty factor c and the kernel parameter g of the RBF kernel function of SVM.
The GWO algorithm was proposed by Mirjalili et al. [46] in 2014, which was inspired by the predation behavior of gray wolves and simulated the population structure and hunting behavior of wolves. The highest-ranking leader in the group becomes wolf A, who is mainly responsible for various decision-making matters in the group. Wolf B ranks second only to wolf A in the wolf pack and is mainly responsible for assisting wolf A in decision-making. When wolf A’s position becomes vacant, wolf B will become the new leader. Wolf C is the third level of the wolf pack and mainly follows the decisions of wolf A and wolf B, and wolf A and wolf B with poor fitness will also be downgraded to wolf C. Wolf Z is the lowest rank of wolves and is mainly responsible for the balance of relationships within the population. The GWO algorithm regards wolf A, wolf B and wolf C in the wolf pack as the three optimal solutions of the algorithm, and the rest of the wolves, including Z, revolve around wolf A, wolf B or wolf C to update the position.
The hunting process of gray wolves includes three parts: stalking and approaching prey, chasing and surrounding prey, and attacking prey. The behavior of gray wolves to update distance and position during stalking and approaching prey is defined as follows:
D = C × X p t X t ,
X t + 1 = X p t + A × D ,
where D is the distance between the gray wolf and the prey, t is the number of iterations, X p is the position of the prey, X is the position of the gray wolf, and its initial position coordinate is defined as c , g . A and C represent coefficients, and the calculation formula is as follows:
A = 2 a × r 1 a ,
C = 2 r 2 ,
where a is the convergence factor, which decreases linearly from 2 to 0 as the number of iterations increases. r 1 and r 2 are random values of 0 , 1 respectively. As shown in Figure 8, when A > 1 , it represents a global search, that is, the gray wolves expand the search range to find better prey, when A 1 , it represents a local search, the gray wolves will narrow the surrounding area to search for nearby prey.
When the gray wolves recognize the location of the prey, wolf B and wolf C surround the prey under the leadership of wolf A. Because wolves A, B, and C are closest to the prey, the positions of these three are used to determine the location of the prey. At the same time, the candidate wolves (including wolf Z) update their positions according to the positions of the three, and gradually approach the prey. The update mechanism of the individuals in the wolf group is shown in Figure 9.
Among them, the distance between wolf A, wolf B and wolf C and other individuals is expressed as:
D a = C 1 × X a ( t ) X ( t ) D b = C 2 × X b ( t ) X ( t ) D c = C 3 × X c ( t ) X ( t ) ,
where X a , X b and X c represent the current positions of wolf A, wolf B and wolf C respectively, C 1 , C 2 and C 3 are random variables, and X ( t ) represents the current position of the gray wolf. The step size and direction that wolf Z advances toward wolves A, B, and C are defined by Formula (23), and the final position of wolf Z is defined by Formula (24):
X 1 = X a A 1 × D a X 2 = X b A 2 × D b X 3 = X c A 3 × D c ,
X t + 1 = X 1 + X 2 + X 3 3 ,
Then, the position of the gray wolf is updated again until there is the best optimal solution. At that time, the prey stops moving, and the gray wolf completes the hunting process by attacking. The position coordinate value corresponding to the best optimal solution is defined as best c , best g . The calculation flowchart of GWO-SVM is shown in Figure 10. In the optimization process, this paper sets the population size M = 20, the maximum number of iterations t max = 30 , and defines the fitness function as:
f = y f y t + y f ,
where y f and y t represent the number of misclassified and correctly classified samples after 10-fold cross-validation, respectively. The smaller the fitness value, the higher the classification accuracy, that is to say, the parameter optimization process of SVM can be described as the solution to the fitness function minimization problem. GWO algorithm is used to optimize the penalty factor c and kernel function parameters g of SVM, which can effectively improve classification accuracy.

5. Validation of the Proposed Method

All the data processing algorithms in this experiment are compiled in MATLAB R2019b, and the machine learning algorithm is implemented based on Scikit-Learn machine learning library. The operating environment is as follows: Intel(R) Xeon(R) CPU, 96 GB RAM and Windows 10 system.

5.1. Explanation of the Experimental Data

A total of 16 volunteers are recruited for the experiment. They have normal gastrointestinal function and did not take any medication that affected bowel motility in the recent period before data collection. And all the volunteers read the relevant precautions and signed the informed consent. According to the method in Section 2.2, we establish a dataset, and a total of 232 groups of BS data are collected. The duration of each group of BS data is 1 min. Among them, 117 groups of the data are marked as having the intention to defecate, and 115 groups of the data are marked as not having the intention to defecate. The data were collected from Beijing Bo’ai Hospital and South China University of Technology during November 2020 to February 2022. Among them, the data of bedridden elderly are all from Beijing Bo’ai Hospital, which is affiliated with the China Rehabilitation Research Center. The volunteers here are between 40 and 75 years old, both male and female, and each volunteer’s condition is somewhat different. For example, there are those who have had a stroke, those who could not speak, those who have a broken bone, and those who are not able to take care of themselves. In addition, in order to obtain more experimental data and increase the generalizability of the experimental data, we also conducted data collection at South China University of Technology. The volunteers here are between 22 and 35 years old. Therefore, the experimental data contains data from different age groups, different genders, and different physical conditions, which ensures the generalizability of the experimental data. Although the size of the BS dataset is limited by time and recruitment conditions, similar dataset sizes have been successfully used in proof-of-concept studies in the past [28,47]. Furthermore, we also plan to collect more BS data.

5.2. Experimental Results and Analysis

According to the method proposed in Section 3.2, multi-domain features extraction is performed on the filtered BS data. The 63 features extracted from each sample are combined to form a high-dimensional feature set with a size of 232 × 63, and then normalized, and the normalized interval is [0, 1]. After normalization, FS algorithm is used to select the candidate multi-domain features. According to the importance of different features, the multi-domain features are reordered as follows:
F S 31 > F S 32 > F S 8 > F S 10 > F S 36 > F S 33 > F S 48 > F S 25 > F S 14 > F S 13 > F S 50 > F S 21 > F S 12 > F S 44 > F S 6 > F S 23 > F S 16 > F S 53 > F S 37 > F S 11 > F S 61 > F S 40 > F S 45 > F S 57 > F S 19 > F S 18 > F S 20 > F S 43 > F S 47 > F S 30 > F S 62 > F S 2 > F S 17 > F S 49 > F S 22 > F S 27 > F S 7 > F S 29 > F S 9 > F S 35 > F S 15 > F S 54 > F S 5 > F S 3 > F S 58 > F S 56 > F S 55 > F S 34 > F S 4 > F S 38 > F S 42 > F S 51 > F S 60 > F S 24 > F S 59 > F S 28 > F S 63 > F S 46 > F S 39 > F S 41 > F S 52 > F S 26 > F S 1
To study the contribution of FS algorithm, Figure 11 plots the classification accuracy of the proposed method under different feature dimensions selected by the FS algorithm. It can be seen from the figure that when the 19-dimensional feature set is selected by the FS algorithm, the highest classification accuracy rate is 92.86%. After continuing to increase the feature dimension, the classification accuracy rate does not increase but decreases. In addition, the higher the input feature dimensions, the longer the algorithm calculation time. Therefore, this paper adopts FS algorithm for feature selection, which can also balance the calculation efficiency and accuracy, and verifies the effectiveness of the FS algorithm.
The first 19 features with the highest scores after sorting by FS algorithm are used as sensitive features, and a new low-dimensional feature set with a size of 232 × 19 is constructed. Finally, the resulting low-dimensional feature set is fed into GWO-SVM classifier, and Figure 12 plots the relative curve between the number of iterations and the test accuracy. It can be seen from the figure that when the penalty factor c and the kernel parameter g are optimized to 306.6857 and 0.049902, respectively, the proposed method can achieve the best accuracy of 92.86% in predicting human defecation intention, which verifies the effectiveness of the proposed method.
To study the influence of parameters ( c , g ) on the results of the proposed method, the parameters c and g of SVM are adjusted according to the literature and some empirical values [34,48], and the corresponding test results are shown in Table 4. As can be seen from Table 4, using GWO algorithm to optimize the parameters (c = 306.6857 and g = 0.049902) achieves higher accuracy than using the six combined parameters listed in Table 3, that is to say, an inappropriate combination parameters ( c , g ) will reduce the classification performance of SVM, which illustrates the necessity and effectiveness of using GWO algorithm to optimize SVM.

5.3. Comparison between Different Methods

In this subsection, a research comparison of different methods is carried out to verify the effectiveness and superiority of the proposed method. In this comparison, the proposed method is compared with methods in similar studies. These methods are KNN, NB, SVM, and LR. KNN is used by Saini et al. [14] to identify arrhythmia ECG signals, and it can ensure the simplicity and effectiveness of information in the classification process. NB is used by Ulusar [25] to evaluate the recovery of intestinal function of patients after surgery, and has a high accuracy rate. SVM is one of the classic machine learning methods with excellent generalization ability, and Yin et al. [27] used this model when recognizing BS. LR is the model used by Du et al. [28] to verify the feasibility of BS signals in the diagnosis of irritable bowel syndrome.
We take the extracted 16 time domain features, 13 frequency domain features, 34 time-frequency domain features, and 19 multi-domain features selected by FS algorithm as four different feature sets. The different input features are then combined with five classifiers (GWO-SVM, SVM, NB, KNN and LR). Table 5 shows the classification accuracy of different classifiers with different domain features. Table 6 shows the classification accuracy of different classifiers with different domain feature combinations. As can be seen from Table 5 and Table 6, the method proposed in this paper (combination of multi-domain features and GWO-SVM) has the highest testing accuracy (92.86%) of all the combinations, which proves the superiority of the proposed method. It can also be seen in Table 5 and Table 6 that combining GWO-SVM classifier with different types of features achieve an average test accuracy of 85.71% and 90.36%, respectively, which is higher than other classifiers. The effectiveness and superiority of GWO-SVM classifier is proved. In addition, we can also see in Table 5 and Table 6 that the combination of multi-domain features and different classifiers can achieve a higher average test accuracy (86.57%) than other input features. This shows that the extraction of multi-domain features can improve the classification accuracy, that is, multi-domain features have obvious advantages in classification. Further, we can see that the average test accuracy of the other input features in Table 6 is higher than that of the other input features in Table 5. This can be explained by the fact that richer feature information can be obtained with additional domain features extracted, which is helpful to obtain features sensitive to classification. Finally, we can also see from Table 6 that the average test accuracy decreases after removing any single-domain features from multi-domain features. This shows that the features of each domain are meaningful in multi-domain feature extraction. Further, we can see that the feature combination of time and frequency domains achieved an average test accuracy of 82.00%, which is lower than the combination of frequency and time-frequency domains (82.29%), and also lower than the combination of time and time-frequency domains (83.71%). This shows that among multi-domain features, time-frequency domain features provide more classification-sensitive information, which indicates the effectiveness of using wavelet packet transform to extract time-frequency features. This result is also consistent with the ranking result of the importance of different features by the FS algorithm in Section 5.2. Among the 19 sensitive features selected by us, there are 10 features in time-frequency domain, which are more than those in time domain (six features) and frequency domain (four features).
To demonstrate the generality of the experimental results, we input the multi-domain features into different classifiers for five different tests, and the results are shown in Figure 13. As can be seen from the figure, GWO-SVM used in this paper has higher classification accuracy than other classifiers (SVM, NB, KNN and LR) in five tests, which verifies the generalization and superiority of the proposed method. There are several reasons for this phenomenon. On the one hand, compared with standard SVM, GWO-SVM optimizes the parameters c and g, so the classification ability is better. On the other hand, compared with other classifiers, SVM can make more reliable decisions on a small number of data samples and has stronger generalization ability. It should be noted that the parameters of other comparison classifiers such as SVM, KNN, NB, and LR are chosen empirically, which may lead to a decrease in accuracy.
During the above experiment, we divided all data into training set and test set in a ratio of 7:3. Only the data in the training set is used for model training. After the model training is completed, the model is tested and verified on an unseen test set. Therefore, the obtained results are generalizable and can be applied to general situations. And compared with other methods, the accuracy of the proposed method reaches 92.86%, which is better than other methods.Therefore, we can conclude that the method proposed in this paper is more effective than traditional classification models. Since there is no related method proposed in the field of defecation prediction, this paper does not compare with other existing methods.

6. Conclusions, Limitations, and Future Research

This paper proposes a new defecation prediction method based on multi-domain features and GWO-SVM, so that nursing staff can know the defecation needs of the bedridden elderly in advance, better provide defecation care, reduce the dirty, messy, and smelly phenomenon in the defecation nursing process, and relieve the pain of the bedridden elderly in their later years.
To improve the quality of life of the bedridden elderly in their later years, a human defecation prediction method based on multi-domain features and GWO-SVM is proposed, so that nursing staff could know the defecation needs of the bedridden elderly in advance and provide better defecation care. In this method, the BS is used as the original signal, and three methods of statistical analysis, FFT and wavelet packet transform are applied to the filtered BS to extract the time domain, frequency domain and time-frequency domain features for building a high-dimensional feature set. In addition, feature selection is performed on high-dimensional feature sets, and FS algorithm is used to select meaningful and sensitive features according to the importance of each feature which can remove redundant information and construct low-dimensional feature sets. In this paper, the detailed process of defecation intention prediction based on GWO-SVM is designed, and the actual BS signals collected during the study are used as experimental data for experimental analysis, which verified the defecation intention prediction method proposed in this paper. The result shows that the proposed method can achieve a defecation prediction accuracy of 92.86%. Compared with the traditional classification method, the proposed method achieves better recognition results, and has more stable performance and higher reliability.
The innovations and contributions of this paper mainly include:
(1)
The possibility of defecation prediction based on BS signals is proposed, and the correlation between BS signals and defecation intention is verified through experiments, which provide a new idea of defecation prediction;
(2)
A BS monitoring system is established, and data were collected in Beijing Bo’ai Hospital affiliated to the China Rehabilitation Research Center, and a BS dataset for defecation prediction is established;
(3)
Based on multi-domain features and GMO-SVM, we propose a new, cost-effective, and non-invasive method for human defecation prediction, which is an innovative application of machine learning in the field of healthcare.
Although the study in this paper has achieved good classification results, there are some aspects that need to be improved. On the one hand, the number of volunteers and the sample size are not sufficient, while more and higher quality data are necessary for future research. On the other hand, due to the noise in the data collection process, a further research of the noise reduction algorithm is needed.
To solve the above problems, the next research can start with the acquisition of more available BS data and data augmentation. At the same time, optimizing the noise reduction algorithm for the BS signals is also needed. Currently, research based on data augmentation is a popular direction.

Author Contributions

Conceptualization, L.L.; methodology, L.L., Y.K. and T.Z.; software, L.L. and Y.K.; validation, L.L., Y.K. and T.Z.; formal analysis, L.L., Y.K. and T.Z.; investigation, Y.K., J.Z. and Z.H.; resources, L.L. and T.Z.; data curation, T.Z., Z.H., J.Z., L.L. and Y.K.; writing—original draft preparation, L.L., T.Z. and J.Z.; writing—review and editing, L.L., Y.K. and Z.H.; supervision, T.Z. and L.L.; project administration, T.Z. and L.L.; funding acquisition, T.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Major Science and Technology Project of China [grant numbers 2020YFC2007600].

Institutional Review Board Statement

The study was approved by the Medical Ethics Committee of China Rehabilitation Research Center (protocol code 2020-085-1 and date of approval 2020.06) for studies involving humans.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

We created a dataset for this study. Since further research is in progress, we cannot publish the dataset right now.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cudjoe, T.K.M.; Roth, D.L.; Szanton, S.L.; Wolff, J.L.; Thorpe, R.J. The Epidemiology of Social Isolation: National Health & Aging Trends Study. J. Gerontol. Ser. B 2020, 75, 107–113. [Google Scholar]
  2. Musa, M.K.; Saga, S.; Blekken, L.E.; Harris, R.; Norton, C. The Prevalence, Incidence, and Correlates of Fecal Incontinence Among Older People Residing in Care Homes: A Systematic Review. J. Am. Med. Dir. Assoc. 2019, 20, 956–962. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Mugita, Y.; Koudounas, S.; Nakagami, G.; Weller, C.; Sanada, H. Assessing absorbent products’ effectiveness for the prevention and management of incontinence-associated dermatitis caused by urinary, faecal or double adult incontinence: A systematic review. J. Tissue Viability 2021, 30, 599–607. [Google Scholar] [CrossRef] [PubMed]
  4. Zan, P.; Zhao, J.; Yang, L. Research on biomechanical compatibility for the artificial anal sphincter based on rectal perception function reconstruction. IET Sci. Meas. Technol. 2015, 9, 921–927. [Google Scholar] [CrossRef]
  5. Devasahayam, S.R. Signals and Systems in Biomedical Engineering. Topics in Biomedical Engineering International Book; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  6. Faust, O.; Bairy, M.G. Nonlinear analysis of physiological signals: A review. J. Mech. Med. Biol. 2012, 12, 1240015. [Google Scholar] [CrossRef]
  7. Merletti, R.; Botter, A.; Cescon, C.; Minetto, M.A.; Vieira, T. Advances in surface EMG: Recent progress in clinical research applications. Crit. Rev. Biomed. Eng. 2010, 38, 347–379. [Google Scholar] [CrossRef]
  8. Zhai, X.; Beth, J.; Chan, R.; Chung, T. Self-Recalibrating Surface EMG Pattern Recognition for Neuroprosthesis Control Based on Convolutional Neural Network. Front. Neurosci. 2017, 11, 379. [Google Scholar] [CrossRef] [Green Version]
  9. Park, K.H.; Lee, S.W. Movement intention decoding based on deep learning for multiuser myoelectric interfaces. In Proceedings of the International Winter Conference on Brain-Computer Interface, Gangwon, Korea, 22–24 February 2016; pp. 1–2. [Google Scholar]
  10. Fraiwan, L.; Lweesy, K. Neonatal sleep state identification using deep learning autoencoders. In Proceedings of the 2017 IEEE 13th International Colloquium on Signal Processing & Its Applications (CSPA), Penang, Malaysia, 10–12 March 2017. [Google Scholar]
  11. Jang, S.W.; Lee, S.H. Detection of Epileptic Seizures Using Wavelet Transform, Peak Extraction and PSR from EEG Signals. Symmetry 2020, 12, 1239. [Google Scholar] [CrossRef]
  12. Suwicha, J.; Setha, P.N.; Pasin, I. EEG-Based Emotion Recognition Using Deep Learning Network with Principal Component Based Covariate Shift Adaptation. Sci. World J. 2014, 2014, 627892. [Google Scholar]
  13. Li, D.; Wu, H.; Zhao, J.; Tao, Y.; Fu, J. Automatic Classification System of Arrhythmias Using 12-Lead ECGs with a Deep Neural Network Based on an Attention Mechanism. Symmetry 2020, 12, 1827. [Google Scholar] [CrossRef]
  14. Saini, I.; Singh, D.; Khosla, A. QRS detection using K-Nearest Neighbor algorithm (KNN) and evaluation on standard ECG databases. J. Adv. Res. 2013, 4, 331–344. [Google Scholar] [CrossRef] [Green Version]
  15. Barea, R.; Boquete, L.; Mazo, M.; Lopez, E. System for assisted mobility using eye movements based on electrooculography. IEEE Trans. Nural Syst. Rehabil. Eng. 2002, 10, 209–218. [Google Scholar] [CrossRef]
  16. Xia, B.; Li, Q.; Jie, J.; Wang, J.; Chaudhary, U.; Ramos-Murguialday, A.; Birbaumer, N. Electrooculogram based sleep stage classification using deep belief network. In Proceedings of the International Joint Conference on Neural Networks, Killarney, Ireland, 12–16 July 2015. [Google Scholar]
  17. Furness, J.B.; Callaghan, B.P.; Rivera, L.R.; Cho, H.J. The Enteric Nervous System and Gastrointestinal Innervatloe: Integrated Local, and Central Control. Adv. Exp. Med. Biol. 2014, 817, 39–71. [Google Scholar]
  18. Madsen, D.; Sebolt, T.; Cullen, L.; Folkedahl, B.; Mueller, T.; Richardson, C.; Titler, M. Listening to bowel sounds: An evidence-based practice project: Nurses find that a traditional practice isn’t the best indicator of returning gastrointestinal motility in patients who’ve undergone abdominal surgery. AJN Am. J. Nurs. 2005, 105, 40–49. [Google Scholar] [CrossRef]
  19. Baid, H. A critical review of auscultating bowel sounds. Br. J. Nurs. 2009, 18, 1125–1129. [Google Scholar] [CrossRef]
  20. Yang, P.J.; LaMarca, M.; Kaminski, C.; Chu, D.I.; Hu, D.L. Hydrodynamics of defecation. Soft Matter 2017, 13, 4960–4970. [Google Scholar] [CrossRef]
  21. Acharya, U.R.; Faust, O.; Sree, S.V.; Ghista, D.N.; Dua, S.; Joseph, P.; Ahamed, V.I.T.; Janarthanan, N.; Tamura, T. An integrated diabetic index using heart rate variability signal features for diagnosis of diabetes. Comput. Methods Biomech. Biomed. Eng. 2013, 16, 222–234. [Google Scholar] [CrossRef]
  22. Rekanos, I.; Hadjileontiadis, L. An iterative kurtosis-based technique for the detection of nonstationary bioacoustic signals. Signal Process. 2006, 86, 3787–3795. [Google Scholar] [CrossRef]
  23. Dimoulas, C.; Kalliris, G.; Papanikolaou, G.; Petridis, V.; Kalampakas, A. Bowel-sound pattern analysis using wavelets and neural networks with application to long-term, unsupervised, gastrointestinal motility monitoring. Expert Syst. Appl. 2008, 34, 26–41. [Google Scholar] [CrossRef]
  24. Kim, K.S.; Seo, J.H.; Song, C.G. Non-invasive algorithm for bowel motility estimation using a back-propagation neural network model of bowel sounds. Biomed. Eng. Online 2011, 10, 69. [Google Scholar] [CrossRef] [Green Version]
  25. Ulusar, U.D. Recovery of gastrointestinal tract motility detection using Naive Bayesian and minimum statistics. Comput. Biol. Med. 2014, 51, 223–228. [Google Scholar] [CrossRef]
  26. Liu, J.; Yue, Y.; Jiang, H.; Kan, H.; Wang, Z. Bowel Sound Detection Based on MFCC Feature and LSTM Neural Network. In Proceedings of the 2018 IEEE Biomedical Circuits and Systems Conference (BioCAS), Cleveland, OH, USA, 17–19 October 2018. [Google Scholar]
  27. Yin, Y.; Jiang, H.; Feng, S.; Liu, J.; Chen, P.; Zhu, B.; Wang, Z. Bowel sound recognition using SVM classification in a wearable health monitoring system. Sci. China Inf. Sci. 2018, 61, 1–3. [Google Scholar] [CrossRef]
  28. Du, X.; Allwood, G.; Webberley, K.M.; Inderjeeth, A.J.; Osseiran, A.; Marshall, B.J. Noninvasive Diagnosis of Irritable Bowel Syndrome via Bowel Sound Features: Proof of Concept. Clin. Transl. Gastroenterol. 2019, 10, e00017. [Google Scholar] [CrossRef]
  29. Du, X.; Allwood, G.; Webberley, K.M.; Osseiran, A.; Wan, W.; Volikova, A.; Marshall, B.J. A mathematical model of bowel sound generation. J. Acoust. Soc. Am. 2018, 144, EL485–EL491. [Google Scholar] [CrossRef]
  30. Zhao, C.; Feng, Z. Application of multi-domain sparse features for fault identification of planetary gearbox. Measurement 2017, 104, 169–179. [Google Scholar] [CrossRef]
  31. Li, C.; Sanchez, R.V.; Zurita, G.; Cerrada, M.; Vásquez, R. Multimodal deep support vector classification with homologous features and its application to gearbox fault diagnosis. Neurocomputing 2015, 168, 119–127. [Google Scholar] [CrossRef]
  32. Klle, K.; Aftab, M.F.; Andersson, L.E.; Fougner, A.L.; Stavdahl, Y. Data driven filtering of bowel sounds using multivariate empirical mode decomposition. BioMed. Eng. OnLine 2019, 18, 28. [Google Scholar] [CrossRef] [Green Version]
  33. Lu, O.U. Rolling Bearing Fault Diagnosis Based on Supervised Laplaian Score and Principal Component Analysis. J. Mech. Eng. 2014, 50, 88. [Google Scholar]
  34. Liu, Z.; Cao, H.; Chen, X.; He, Z.; Shen, Z. Multi-fault classification based on wavelet SVM with PSO algorithm to analyze vibration signals from rolling element bearings. Neurocomputing 2013, 99, 399–410. [Google Scholar] [CrossRef]
  35. Goksu, H. BCI oriented EEG analysis using log energy entropy of wavelet packets. Biomed. Signal Process. Control 2018, 44, 101–109. [Google Scholar] [CrossRef]
  36. Cao, Y.; Sun, Y.; Xie, G.; Wen, T. Fault Diagnosis of Train Plug Door Based on a Hybrid Criterion for IMFs Selection and Fractional Wavelet Package Energy Entropy. IEEE Trans. Veh. Technol. 2019, 68, 7544–7551. [Google Scholar] [CrossRef]
  37. Rodriguez, N.; Alvarez, P.; Barba, L.; Cabrera-Guerrero, G. Combining Multi-Scale Wavelet Entropy and Kernelized Classification for Bearing Multi-Fault Diagnosis. Entropy 2019, 21, 152. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Rodriguez, N.; Cabrera, G.; Lagos, C.; Cabrera, E. Stationary Wavelet Singular Entropy and Kernel Extreme Learning for Bearing Multi-Fault Diagnosis. Entropy 2017, 19, 541. [Google Scholar] [CrossRef] [Green Version]
  39. Bishop, C.M. Neural Networks for Pattern Recognition. Agric. Eng. Int. Cigr J. Sci. Res. Dev. Manuscr. Pm 1995, 12, 1235–1242. [Google Scholar]
  40. Rose, C.; Parker, A.; Jefferson, B.; Cartmell, E. The Characterization of Feces and Urine: A Review of the Literature to Inform Advanced Treatment Technology. Crit. Rev. Environ. Sci. Technol. 2015, 45, 1827–1879. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Burges, C. A Tutorial on Support Vector Machines for Pattern Recognition. Data Min. Knowl. Discov. 1998, 2, 121–167. [Google Scholar] [CrossRef]
  42. Cai, J.; Chen, W.; Yin, Z. Multiple Transferable Recursive Feature Elimination Technique for Emotion Recognition Based on EEG Signals. Symmetry 2019, 11, 683. [Google Scholar] [CrossRef] [Green Version]
  43. Wang, L.; Liu, Y.; Li, T.; Xie, X.; Chang, C. The Short-Term Forecasting of Asymmetry Photovoltaic Power Based on the Feature Extraction of PV Power and SVM Algorithm. Symmetry 2020, 12, 1777. [Google Scholar] [CrossRef]
  44. Dong, Z.; Zheng, J.; Huang, S.; Pan, H.; Liu, Q. Time-Shift Multi-scale Weighted Permutation Entropy and GWO-SVM Based Fault Diagnosis Approach for Rolling Bearing. Entropy 2019, 21, 621. [Google Scholar] [CrossRef] [Green Version]
  45. Bruzzone, L.; Chi, M.; Marconcini, M. A Novel Transductive SVM for Semisupervised Classification of Remote-Sensing Images. IEEE Trans. Geosci. Remote Sens. 2006, 44, 3363–3373. [Google Scholar] [CrossRef] [Green Version]
  46. Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
  47. Markaki, M.; Germanakis, I.; Stylianou, Y. Automatic classification of systolic heart murmurs. In Proceedings of the IEEE International Conference on Acoustics, Vancouver, BC, Canada, 26–31 May 2013. [Google Scholar]
  48. Wong, P.K.; Yang, Z.; Chi, M.V.; Zhong, J. Real-time fault diagnosis for gas turbine generator systems using extreme learning machine. Neurocomputing 2014, 128, 249–257. [Google Scholar] [CrossRef]
Figure 1. Research framework.
Figure 1. Research framework.
Symmetry 14 01763 g001
Figure 2. BS collection system.
Figure 2. BS collection system.
Symmetry 14 01763 g002
Figure 3. Schematic diagram of BS data collection.
Figure 3. Schematic diagram of BS data collection.
Symmetry 14 01763 g003
Figure 4. Comparison of before and after BS filtering.
Figure 4. Comparison of before and after BS filtering.
Symmetry 14 01763 g004
Figure 5. (a) BS signal without defecation intention. (b) BS signal with defecation intention.
Figure 5. (a) BS signal without defecation intention. (b) BS signal with defecation intention.
Symmetry 14 01763 g005
Figure 6. The j-layer wavelet packet decomposition tree.
Figure 6. The j-layer wavelet packet decomposition tree.
Symmetry 14 01763 g006
Figure 7. The optimal hyperplane for binary classification.
Figure 7. The optimal hyperplane for binary classification.
Symmetry 14 01763 g007
Figure 8. Gray wolf looking for and attacking prey.
Figure 8. Gray wolf looking for and attacking prey.
Symmetry 14 01763 g008
Figure 9. Gray wolf location update diagram.
Figure 9. Gray wolf location update diagram.
Symmetry 14 01763 g009
Figure 10. The flow chart of GWO-SVM.
Figure 10. The flow chart of GWO-SVM.
Symmetry 14 01763 g010
Figure 11. Classification accuracy under different feature dimensions.
Figure 11. Classification accuracy under different feature dimensions.
Symmetry 14 01763 g011
Figure 12. The optimization curve of GWO algorithm.
Figure 12. The optimization curve of GWO algorithm.
Symmetry 14 01763 g012
Figure 13. Five test results of using multi-domain features for different classifiers.
Figure 13. Five test results of using multi-domain features for different classifiers.
Symmetry 14 01763 g013
Table 1. Time domain feature expressions.
Table 1. Time domain feature expressions.
Feature NameFeature ExpressionFeature NameFeature Expression
mean value T 1 = 1 N n = 1 N x n Minimum Value T 9 = min x n
standard deviation T 2 = 1 N 1 n = 1 N [ x n T 1 ] 2 peak-to-peak value T 10 = T 8 T 9
square root amplitude T 3 = ( 1 N n = 1 N x n ) 2 waveform index T 11 = T 2 T 4
absolute mean value T 4 = 1 N n = 1 N x n peak index T 12 = T 8 T 2
skewness T 5 = 1 N n = 1 N x n 3 pulse index T 13 = T 8 T 4
kurtosis T 6 = 1 N n = 1 N x n 4 margin index T 14 = T 8 T 3
variance T 7 = 1 N n = 1 N x n 2 skewness index T 15 = T 5 ( T 7 ) 3
maximum value T 8 = max x n kurtosis index T 16 = T 6 ( T 7 ) 2
Table 2. Frequency domain feature expressions.
Table 2. Frequency domain feature expressions.
NumberFeature ExpressionNumberFeature Expression
1 F 1 = k = 1 K y k K 8 F 8 = k = 1 K ( f k 4 y k ) k = 1 K ( f k 2 y k )
2 F 2 = k = 1 K [ y k F 1 ] 2 K 1 9 F 9 = k = 1 K ( f k 2 y k ) [ k = 1 K ( f k 4 y k ) ] [ k = 1 K y k ]
3 F 3 = k = 1 K [ y k F 1 ] 3 K ( F 2 ) 3 10 F 10 = F 6 F 5
4 F 4 = k = 1 K [ y k F 1 ] 4 K ( F 2 ) 2 11 F 11 = k = 1 K [ ( f k F 5 ) 3 y k ] K ( F 6 ) 3
5 F 5 = k = 1 K ( f k y k ) k = 1 K y k 12 F 12 = k = 1 K [ ( f k F 5 ) 4 y k ] K ( F 6 ) 4
6 F 6 = k = 1 K [ ( f k F 5 ) 2 y k ] K 13 F 13 = k = 1 K [ f k F 5 y k ] K F 6
7 F 7 = k = 1 K ( f k 2 y k ) k = 1 K y k
Table 3. The definitions of the involved parameters in Table 2.
Table 3. The definitions of the involved parameters in Table 2.
Parameter NameDefine
KThe number of spectral lines
y k Frequency spectrum obtained by using FFT
f k The frequency value of the k-th spectral line
Table 4. Classification accuracy of SVM with different parameters.
Table 4. Classification accuracy of SVM with different parameters.
Parameter ValueAccuracy
c = 10/g = 1087.14%
c = 10/g = 490.00%
c= 42.1/g = 1.2487.14%
c = 53.2/g = 2.682.86%
c = 28.6/g = 0.8890.00%
c = 100/g = 0.0191.43%
c = 306.7/g = 0.0592.86%
Table 5. Classification accuracy of different classifiers with different domain features.
Table 5. Classification accuracy of different classifiers with different domain features.
Different
Classifiers
The Testing Accuracy Obtained Using
Classification Method with Different Features (%)
Average
Accuracy (%)
Multi-Domain
Features
Time Domain
Features
Frequency
Domain Features
Time-Frequency
Domain Features
GWO-SVM92.86%82.86%81.43%85.71%85.72%
SVM87.14%75.71%72.86%80.00%78.93%
NB82.86%72.86%68.57%75.71%75.00%
KNN84.28%71.43%70.00%72.86%74.64%
LR85.71%72.86%74.29%78.57%77.86%
Average
accuracy (%)
86.57%75.14%73.43%78.57%
Table 6. Classification accuracy of different classifiers with different domain feature combinations.
Table 6. Classification accuracy of different classifiers with different domain feature combinations.
Different
Classifiers
The Testing Accuracy Obtained Using
Classification Method with Different Features Combinations (%)
Average
Accuracy (%)
Multi-Domain
Features
Time and Frequency
Domain Features
Time and
Time-Frequency
Domain Features
Frequency and
Time-Frequency
Domain Features
GWO-SVM92.86%91.43%87.14%90.00%90.36%
SVM87.14%88.57%85.71%82.86%86.07%
NB82.86%75.71%80.00%81.43%80.00%
KNN84.28%77.14%81.42%74.29%79.28%
LR85.71%77.14%84.28%82.86%82.50%
Average
accuracy(%)
86.57%82.00%83.71%82.29%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, L.; Ke, Y.; Zhang, T.; Zhao, J.; Huang, Z. A Human Defecation Prediction Method Based on Multi-Domain Features and Improved Support Vector Machine. Symmetry 2022, 14, 1763. https://doi.org/10.3390/sym14091763

AMA Style

Li L, Ke Y, Zhang T, Zhao J, Huang Z. A Human Defecation Prediction Method Based on Multi-Domain Features and Improved Support Vector Machine. Symmetry. 2022; 14(9):1763. https://doi.org/10.3390/sym14091763

Chicago/Turabian Style

Li, Lin, Yuwei Ke, Tie Zhang, Jun Zhao, and Zequan Huang. 2022. "A Human Defecation Prediction Method Based on Multi-Domain Features and Improved Support Vector Machine" Symmetry 14, no. 9: 1763. https://doi.org/10.3390/sym14091763

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop