Next Article in Journal
Robust Arm Impedocardiography Signal Quality Enhancement Using Recursive Signal Averaging and Multi-Stage Wavelet Denoising Methods for Long-Term Cardiac Contractility Monitoring Armbands
Previous Article in Journal
Metasurface-Assisted Terahertz Sensing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Data-Driven Fault Detection Method Based on Stable Kernel Representation for Dynamic Systems

1
Department of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
2
Changchun Changguang Yuanchen Microelectronic Technology Co., Ltd., Changchun 130000, China
3
Department of Aeronautics and Astronautics, Stanford University, Stanford, CA 94305, USA
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(13), 5891; https://doi.org/10.3390/s23135891
Submission received: 28 May 2023 / Revised: 20 June 2023 / Accepted: 23 June 2023 / Published: 25 June 2023

Abstract

:
With the steady improvement of advanced manufacturing processes and big data technologies, modern industrial systems have become large-scale. To enhance the sensitivity of fault detection (FD) and overcome the drawbacks of the centralized FD framework in dynamic systems, a new data-driven FD method based on Hellinger distance and subspace techniques is proposed for dynamic systems. Specifically, the proposed approach uses only system input/output data collected via sensor networks, and the distributed residual signals can be generated directly through the stable kernel representation of the process. Based on this, each sensor node can obtain the identical residual signal and test statistic through the average consensus algorithms. In addition, this paper integrates the Hellinger distance into the residual signal analysis for improving the FD performance. Finally, the effectiveness and accuracy of the proposed method have been verified in a real multiphase flow facility.

1. Introduction

As a result of intelligence and informatization, modern industrial processes have evolved to become more complicated. Any abnormal behavior of equipment components may affect productivity or even cause accidents. To guarantee the reliability and stability of industrial systems, fault detection (FD) plays a fundamental role and has received intensive attention from scholars and engineers [1,2,3].
Currently, the majority of FD methods are commonly classified as model-based techniques and data-driven techniques [4]. In the framework of model-based FD, it is necessary to obtain a precise mathematical representation of the systems. In addition, the accuracy of the FD results will depend on the accuracy of the modeling. According to the way of signal generation, existing model-based FD methods can be divided into three distinct groups, i.e., parameter estimation techniques, observer-based techniques, the subspace-based strategy [5]. In practical applications, such methods can successfully implement FD schemes when accurate mathematical models are available. Unfortunately, with the increasing size of modern engineering systems, the modeling of systems using first principles poses further challenges.
With the accelerated advancement of sensor networks and data processing methods, data-driven FD strategies have naturally become a critical research topic and are well developed [6,7,8,9,10]. Data-driven FD techniques are typically sorted into multivariate statistics, neural networks, and subspace-technique-aided schemes, etc. [11]. Specifically, traditional multivariate statistical methods [12,13,14], i.e., canonical correlation analysis (CCA), have been extensively studied and applied to modern industrial systems. The core of multivariate statistical methods is to analyze the correlation among process variables, followed by constructing appropriate test statistics for FD tasks. This group of methods can solve the FD problem well in static processes. However, such traditional FD strategies do not usually consider dynamic changes in the systems. Therefore, they are usually unable to perform FD tasks in dynamic systems. In the past few years, with increasing attention to statistical learning, neural network-based FD methods have been rapidly developed [15,16,17,18]. This group of methods usually uses historical data generated by the production process to train the neural network model. Then, the models classify the test data to determine whether faults have occurred. Due to the excellent fitting ability of neural networks, neural network-based methods have superior performance in dealing with FD problems for nonlinear systems. However, the training phase of neural networks requires the use of abundant labeled data, which has some limitations in practical applications [19]. In recent years, the subspace-technique-aided FD method has been widely studied because of its simple design and lower computational effort [20,21,22,23,24,25]. The core idea behind it is to identify the system parameters through the collected data. Because it takes into account the dynamic behavior in the systems, the subspace-aided FD method performs very well in dealing with process dynamics. In the subspace technique framework, Reference [23] proposes a stable kernel representation (SKR) of the systems. Remarkably, the proposed SKR scheme can directly construct residual generators using process data without identifying complex system models. Based on this, many data-driven FD algorithms are designed under the SKR framework.
At present, the majority of data-driven FD methods have been designed in a centralized framework, which involves collecting all process data in a central location to carry out necessary computations and FD tasks. With the growth of industrial process size, the centralized design becomes increasingly demanding in terms of memory and computing power, resulting in poor flexibility and high cost. Therefore, there is a strong research increase in distributed data-driven FD [26]. For example, ref. [27] evaluated the effectiveness of multiblock multivariate statistics-based approaches, such as PCA and PLS, for decentralized monitoring and assessed the individual contributions of each block. Although the above approaches can implement distributed FD for each sub-block, they do not take into account the connection among the sub-blocks. On this basis, ref. [28] designed a multiblock PCA strategy where information interaction among subblocks is considered. Similarly, considering the connection among neighboring nodes, ref. [29] designed a distributed CCA algorithm to achieve plant-wide process monitoring. The core idea behind it was to reduce uncertainty through information interaction among neighboring nodes. However, when there are few relevant variables in the historical data, the distributed FD results obtained based on this method are usually unreliable. In order to address this defect, ref. [30] designed a distributed, regularized CCA-based process monitoring algorithm. First, the traditional CCA algorithm is executed between the local and the neighboring nodes. In order to eliminate uncorrelated variables from the monitoring data, a GA-based regularization algorithm is then embedded in the traditional CCA technique. Finally, according to the local monitoring results, the corresponding residual signals and test statistics can be generated at the local node. In terms of technical systems, the multiphase flow facility used in this study is a device to achieve the separation of water, oil, and air at a given flow rate. The test zone, consisting of splitters and supply lines, can provide a mixture of water, oil, and air. The technical system is typically used to verify the effectiveness and accuracy of process monitoring and FD algorithms, such as the latent-variable-analysis-based FD method [31], the Kalman-filter-based FD method [32], and the multivariate-statistics-based process monitoring method [33]. In addition, the technical system used is described in detail in [34,35].
In general, the data-driven distributed FD methods above usually ignore the dynamic changes in the process. Therefore, these data-driven distributed methods have some limitations in dealing with FD problems in dynamic processes. In addition, there are abundant process data in dynamic systems, and the relationship among process variables is complicated. The abundant process data and strong coupling among variables bring new challenges to existing distributed data-driven FD solutions.
Motivated by the aforementioned points, a new data-driven distributed FD strategy is developed for dynamic systems. Compared with previous research, the key contributions of the developed FD solution are given as
  • Compared with traditional SKR-based FD approaches, the proposed method is more sensitive to fault information by introducing the Hellinger distance (HD) in the residual signal.
  • The consensus algorithm is embedded in the information interaction among sensor blocks. Therefore, each sensor block can obtain FD results without global fusion operations, thus remarkably improving FD efficiency.
  • It has superior flexibility in the design of FD framework, particularly when the system models are not accurately obtained.
The structuring of this work is structured as follows. Section 2 provides information about the system descriptions, Hellinger distance, and the average consensus algorithms. In Section 3, a new distributed FD scheme for dynamic systems is presented. The effectiveness of the proposed FD algorithm is then demonstrated through a multiphase flow facility in Section 4. Finally, Section 5 presents the conclusion and prospects for future work.

2. Preliminaries

2.1. System Descriptions

Given a LTI system H ( z ) with input factor u R u and output factor y R y , the input–output (I/O) behavior is characterized as
y ( z ) = H ( z ) u ( z )
where variable z represents the complex z-transform. In order to analyze the relationship among variables in dynamic systems, the state space model is used in this study. It not only reflects dynamic behavior in the process data but also provides a concise description, which is usually expressed in a standard form, as follows:
x ( k + 1 ) =   A B   x ( k ) u ( k )
y ( k ) =   C D   x ( k ) u ( k )
where A, B, C and D are system parameters; u ( k ) R u , y ( k ) R y , and x ( k ) R x refer to the system input, output, and state variables, respectively.
A sensor network consisting of k t blocks has been integrated into the considered system, as shown in Figure 1. In the sensor network, the network topology G can be represented using “node” J and “edge” K as
G = ( J , K )
where J = { 1 , , k t } depicts the set of sensor blocks; K { δ × δ } depicts the set of edges.
In order to obtain the sef of adjacent blocks, J in (1) is further sketched as
J i = { i J | ( i , j ) K }
where J i depicts all adjacent blocks of the i-th block.

2.2. Hellinger Distance

Hellinger distance (HD), also known as the Bhattacharyya distance, is a type of f-divergence [36]. The f-divergence is a function that measures the difference between two probability distributions. HD is a statistical technique that evaluates the resemblance of two frequency distributions to each other. Supposing that n ( x ) and m ( x ) denote two probability density functions (PDFs) and since the probability distributions of the variables are unknown in the general definition, the HD between n ( x ) and m ( x ) is next defined as
H ( n , m ) = 1 2 + ( n ( x ) m ( x ) ) 2 d x
which can be also be expressed in Euclidean norm
H ( n , m ) = 1 2 n m 2
Based on the Cauchy inequality, HD is a symmetric bounded metric which satisfies 0 H n , m 1 and H ( n , m ) = H ( m , n ) . In addition, according to the Lebesgue theorem, the squared form of HD in (6) is characterized as
H 2 ( n , m ) = 1 + n ( x ) m ( x ) d x
In order to perform FD task, Lemma 1 gives a concise representation of (8), which serves as the basis of the proposed approach.
Lemma 1.
Consider two PDFs n ( x ) N ( μ n , σ 2 n ) and m ( x ) N ( μ m , σ 2 m ) , H 2 ( n , m ) is further represented as follows:
H 2 ( n , m ) = 1 ( 2 σ n σ m σ n 2 + σ m 2 ) exp ( ( μ n μ m ) 2 4 ( σ n 2 + σ m 2 ) )

2.3. Average Consensus Algorithm

Given a communication network consisting of k t nodes, the consensus algorithm is a convergence technique to implement consensus calculations. For a data vector μ i at the i-th sensor block, the consensus technique can be executed as
μ i ( s + 1 ) = v i , i μ i ( s ) + j J i v i , j μ j ( s ) , i = 1 , , k t
where μ i ( s ) refers to the calculated value of μ i during the sth iteration; v i , j is the weighting coefficients,. Many studies [37,38,39] have designed algorithms to solve the weighting problem. Among them, the Metropolis–Hastings technique not only speeds up the convergence of the iterative algorithm but also enables computation in a distributed manner. Therefore, the Metropolis–Hastings technique is used to construct the weighting factors in this paper. The weighting factors are assembled as follows
v i j = 1 max { j i , j j } + 1 j J i 0 j J i a n d j i 1 k i v i k i = j
where J i represents all adjacent blocks of the i-th block. j i = J i as the cardinality of J i . Let V be
v 1 , 1 v 1 , k t v k t , 1 v k t , k t R k t × k t .
The final consensus results can be presented as
lim s μ 1 ( s ) μ k t ( s ) = 1 ( 1 k t i = 1 k t μ i ( 0 ) )
which indicates that the consensus value of each block will converge to the average value of all sensor blocks.

3. Methodology

In this section, the SKR of the residual generator is first introduced. Considering the probability distribution of fault information, a novel FD strategy is then presented and applied to the FD problem.

3.1. SKR

Considering the process model of (1)–(3) above, the left coprime factorizations (LCF) of H ( z ) is given as follows:
H ( z ) = D + C ( z I A ) 1 B = P 1 ( z ) Q ( z )
where P ( z ) , Q ( z ) is called the left coprime pair. A key feature of the LCF under noise-free and fault-free conditions is displayed as
P ( z ) Q ( z ) u ( z ) y ( z ) = 0
where P ( z ) Q ( z ) is denoted as the SKR of (2) and (3) [23]. Therefore, all LTI residual generators can be parameterized as
r ( z ) = P ( z ) Q ( z ) u ( z ) y ( z ) = 0

3.2. Data-Driven Distributed Fault Detection

Given that a sensor network is integrated into the dynamic system, the state-space model of the system and sensor measurements with noise are displayed as
x ( k + 1 ) = A x ( k ) + B u ( k ) + σ ( k )
y i k = L i x k + τ i k ,   fault free L i x ( k ) + f ( k ) + τ i ( k ) ,   faulty
where x ( k ) R k l , u ( k ) R k m denote the system state and the process input; y i ( k ) R k n is the output vector at the i-th sensor sub-block; f ( k ) R k f denotes the unknown faults; σ ( k ) N ( 0 , Σ σ ) and τ i ( k ) N ( 0 , Σ τ i ) represent process and measurement noise, respectively. In addition, σ ( k ) and τ i ( k ) are assumed to be Gaussian distributions.
Considering that measurement data y i , i = 1 , k t , can be sent to a sensor block, a global model is then constructed as follows:
y k = L x k + τ k ,   fault free L x ( k ) + f ( k ) + τ ( k ) ,   faulty
where
y ( k ) = y 1 ( k ) y k t ( k ) ,   L = L 1 L k t ,   τ ( k ) = τ 1 ( k ) τ k t ( k ) .
To complete the algorithm implementation of SKR, data models are indispensable in the design processes [40]. Assuming there exists a data variable κ s ( k ) R k κ , it can be further depicted as
κ s ( k ) = κ T ( k ) κ T ( k + s ) T R ( s + 1 ) k κ
Υ k = κ ( k ) κ ( k + N 1 ) T R k κ × N
Υ k , s = κ s ( k ) κ s ( k + N 1 ) T R ( s + 1 ) k κ × N
where k denotes sampling instants, and s and N are some integers.
According to the extended models of H ( z ) in (17) and (18), a data model is derived by iterative computation at each node:
Y s , i = F s , i X k + G s , i U k , s + H s , i Θ k , s + Ξ s , i R ( s + 1 ) k n × N
where
F s , i = L i L i A L i A s ,
G s , i = 0 0 0 L i B 0 L i A s 1 B L i B 0 ,
Ξ s , i = τ i ( k ) τ i ( k + N 1 ) τ i ( k + s ) τ i ( k + s + N 1 ) .
In order to remove the unobservable variable X k , (23) is re-modeled as
U k , s Y s , i = K s , i U k , s X k + 0 H s , i Θ k , s + Ξ s , i
where K s , i = I 0 G s , i F s , i R ( s + 1 ) ( k n + k m ) × ( n + ( s + 1 ) k m ) . When s is large enough, there must be a left nullspace of K s , i as
K s , i K s , i = 0 , K s , i R ( ( s + 1 ) k n k l ) × ( s + 1 ) ( k n + k m )
K s , i U k , s Y s , i = K s , i , u K s , i , y U k , s Y s , i = 0 K s , i ( H s , i Θ k , s + Ξ s , i ) .
K s , i is called the SKR of the system. Due to the excellent reliability and robustness of QR algorithm, the data-driven implementation of K s , i is able to be executed via QR decomposition and SVD:
T k s p , s p 1 , i U k , s Y s , i = R 1 , 1 0 0 R 2 , 1 R 2 , 2 0 R 3 , 1 R 3 , 2 R 3 , 3 Q 1 , 1 Q 2 , 1 Q 3 , 1
R 2 , 1 R 2 , 2 R 3 , 1 R 3 , 2 = V 1 V 2 Ψ 1 , 1 0 0 Ψ 2 , 2 D 1 T D 2 T
where s p represents the past moment; T k s p , s p 1 , i = U k s p , s p 1 Y k s p , s p 1 , i is an I/O dataset of the past s p moments.
In addition, the noise terms can be identified by the proof in [20]:
R 3 , 3 Q 3 , 1 = H s , i Θ k , s + Ξ s , i
Observe that R 2 , 1 R 2 , 2 R 3 , 1 R 3 , 2 and K s , i have same null space. It thus holds that
Ψ 2 , 2 0 , K s , i = V 2 T R ( ( s + 1 ) k n k l ) × ( s + 1 ) ( k n + k m ) .
In order to identify the residual signal, K s , i , y in (25) needs to be obtained in a data-driven manner. It has been demonstrated [40] that
K s , i , y F s , i = 0 , K s , i , u = K s , i , y G s , i
where K s , i , y = F s , i . Therefore, the residual generator can be obtained as follows:
K s , i , y ( H s , i Θ k , s + Ξ s , i ) = K s , i , y R 3 , 3 Q 3 , 1
r i ( k ) = K s , i , y y s ( k ) K s , i , u u s ( k ) .
Although the residual signal r i ( k ) generated by SKR has the advantages of having a simple design and a low computational effort, the robustness of its FD results often becomes weak under actual operating conditions. In order to improve the robustness of the SKR framework for FD applications, the probability distribution of the residual signal deserves further investigation. Based on the idea of HD, an approach to evaluate the similarity between two PDFs is introduced into the SKR framework.
Considering that the above process noise and measurement noise obey Gaussian distributions, the residual signal is r i ( 0 , Λ i 2 ) in the normal (fault-free) historical dataset. In addition, for the actual fault dataset, the residual signal is r ^ i ( ϑ ^ i , Λ ^ i 2 ) .
Therefore, a HD metric for the reisdual signal r i at the i-th sensor block can be represented as follows:
h i 2 = 1 + f ( r i ) f ( r ^ i ) d r i
where f ( r i ) and f ( r ^ i ) denote the PDFs of r i and r ^ i , respectively.
According to the property of Lemma 1, (33) is able to be rewritten as
h i 2 = 1 ( 2 Λ i Λ ^ i Λ i 2 + Λ ^ i 2 ) 1 / 2 exp ( ϑ i ϑ ^ i ) 2 4 ( Λ i 2 + Λ ^ i 2 )
Based on the proposed FD algorithm, the T 2 statistic at each sensor block can be displayed as
T i 2 = ( h ^ i 2 h i 2 ˜ ) Ψ 1 ( h ^ i 2 h i 2 ˜ ) T
where h ^ i 2 R k n can be obtained by (34) under the actual fault dataset; h i 2 ˜ R k n denotes the mean term of h i 2 under the normal historical dataset; Ψ is the covariance matrix of h ^ i 2 h i 2 ˜ .
In order to implement distributed FD, each sensor node needs to perform the identical T 2 test statistic. Based on the above purpose, the average consensus technique is introduced in this framework. The consensus algorithm for ϖ i = h ^ i 2 h i 2 ˜ is
ϖ i ( s + 1 ) = v i , i ϖ i ( s ) + j J i v i , j ϖ j ( s ) , i = 1 , , k t
where s denotes the iteration number. The initial value is ϖ i ( 0 ) = ϖ i . Furthermore, it holds that [5]
lim s ϖ i ( s + 1 ) = ϖ ¯ = 1 k t i = 1 k t ϖ i = 1 k t i = 1 k t ( h ^ i 2 h i 2 ˜ )
As the algorithm runs until convergence, the consensus result can be obtained at each block:
ϖ i = 1 k n i = 1 k n ( h ^ i 2 h i 2 ˜ )
Based on the consensus techniques above, identical ϖ i is obtained at each node. Therefore, (35) can be rewritten as
T i 2 = ϖ i ϕ 1 ϖ i T
where ϕ denotes the the covariance matrix of ϖ i .
As a result, the T 2 statistic can be executed in parallel at each block. When the amount of data is sufficient, the used T 2 statistic obeys a chi-square distribution ( T 2 χ β 2 ( k n ) ). Specifically, χ β can be determined by a χ 2 distribution with degrees k n of freedom as follows:
p r o b χ > χ β = β p r o b χ χ β = 1 β
Based on this, the fault detection threshold can be calculated as
J t h = χ β 2 ( k n )
where k n is the dimension of the residual data; and β is the confidence level (acceptable false alarm rate).
In addition, the FD logic for each node is represented as follows:
T i 2 < J t h fault free , T i 2 J t h faulty .
In summary, with the help of the SKR framework, Hellinger distance, and average consensus algorithm, the distributed FD scheme is summarized in Algorithms 1 and 2. In addition, the flow chart of the proposed FD algorithm is shown in Figure 2.
Algorithm 1: Off-Line Phase.
S1. Load the normal (fault-free) data.;
S2: Set two indices i , j where i = 1 , , k t and j J i J ¯ i ;
S3: while  i k t do
S4: Constuct I/O data model at each node via (23);
S5: Perform QR-decomposition (26) and SVD (27);
S6: Identify SKR K s , i via (29) and (30);
S7: Obtain the residual signals r i at each sensor node;
S8: Calculate Hellinger distance h i 2 for each residual signal via (34);
S9: end while
S10: Constuct weight matrix V via (11).
Algorithm 2: Online Phase.
S1. Load the actual test data.;
S2: while  i k t do
S3: Obtain the residual signals r ^ i under the acutal fault case;
S4: Calculate Hellinger distance h ^ i 2 for each residual signal via (34);
S5: end while
S6: Calculate ϖ ¯ = 1 k t i = 1 k t ( h ^ i 2 h i 2 ˜ ) using the average consensus algorithm (36);
S7: Obtain the identical T 2 test statistic via (39);
S8: Make a FD logic decision whether a fault has occurred based on (41).
Figure 2. The flow chart of the proposed fault detection algorithm.
Figure 2. The flow chart of the proposed fault detection algorithm.
Sensors 23 05891 g002

4. Case Study

4.1. Facility Description

In this study, a data set obtained from a multi-phase flow plant [35] is used to validate the proposed FD algorithm. The multiphase flow plant can achieve gas–liquid separation at a given flow rate. This device takes into account various working conditions during operation, so it can generate abundant process data from different operating conditions. In addition, the generated process data contain dynamic behavior by changing the set point of the flow rate. It is depicted in Figure 3, and its schematic diagram is presented in Figure 4. Specifically, the device comprises geometrically designed pipes and a 1.2 m high liquid–gas splitter. It is capable of providing separate air, oil, and water, as well as mixtures of these fluids. During the operation of the plant, the mixtures are split in a horizontal splitter. The air is returned to the environment, while the water–oil mixture is returned to their respective tanks (T100 and T200). The water coalescers ensure complete separation of oil and water before returning to their respective tanks. The flow conditions of air, water, and oil can be regulated by control valves. In addition, the relevant control valves can be operated continuously between closing and opening. In terms of sensor distribution, there are sensors measuring pressure at the air delivery line (PT417) and inside the three-phase splitter (PT501). Other sensors are located at the water delivery line (FIC101, FT102), at the bottom of the two-phase splitter(FT406), at the top of the two-phase splitter (FT404), at the top of the three-phase splitter (PIC501), at the air delivery line (FIC302, FT302), at the top of the water tank (LI101), at the bottom of the three-phase splitter (LI502), and at the top of the water coalescer (LI503).

4.2. Fault Injection and Distributed Fault Detection

In order to gather the necessary historical data for this experiment, a SCADA platform can be utilized at a sample rate of 1 Hz. The data parameters utilized in this validation are outlined in Table 1. A communication topology of the sensor network is represented in Figure 5.
To evaluate the effectiveness of the proposed method, the off-line part is first executed. The input flow rate of the training dataset is shown in Figure 6. Then, two typical faults are used to verify FD performance. The first fault scenario involves an incipient fault that arises due to the obstruction of the top separator input, leading to the shutdown of VC404 between 1136 s and 8352 s. The input situation of the faulty dataset is depicted in Figure 7. The residual signal of fault 1 and the HD-based h 2 statistic at node 1 are shown in Figure 8. The residual signal r 1 ( k ) in Figure 8 indicates the overall trend of the system. However, when the fault amplitude is low, the residual signal is often not effective in capturing abnormal cases. The blue curve in Figure 8 does not changed significantly after the fault occurred.
Based on the above problem, a HD-based metric is implemented into the proposed distributed SKR framework to enhance the sensitivity of the residual signal. When the fault intensity is minor, the information on the probability distribution also changes remarkably. As a result, the sensitivity of FD is significantly increased by measuring the HD of the residual signal. The green curve in Figure 8 indicates that the HD-based h 2 statistics have changed significantly after the fault occurred. The performance evaluation focuses on two key metrics: the Missed Alarm Rate (MAR) and the False Alarm Rate (FAR). In this study, the consensus algorithm is embedded in the information interaction among sensor blocks. Each node in the sensor network is used to execute FD algorithm through average consensus techniques. As a consensus result, each sensor node can obtain an identical FD performance. The distributed FD diagrams for fault 1 are displayed in Figure 9 and Figure 10. J(2) and J(8) in Figure 9 and Figure 10 represent the distributed FD results at node 2 and node 8, respectively. In terms of performance metrics, the MAR is 0.0323 and the FAR is 0.0391 at node 2. In addition, the MAR is also 0.0323 and the FAR is also 0.0391 at node 8. The second fault scenario is an intermittent fault, also known as segment plugging in practical engineering terms. This type of fault commonly occurs in the riser of multiphase flow when the flow rate of liquid and gas is low. The fault was introduced by deliberately reducing the air and water flow rates to levels that induce plugging. In this dataset, two plugging conditions were introduced and eliminated, from 686 s to 1172 s and from 1772 s to 2253 s, during the experiment. Specifically, the plugging fault was first formed at 686 s by continuously reducing the flow rate of air and water. The plugging fault was then removed at 1172 s when the air flow rate gradually increases. Additionally, the plugging fault was introduced again from 1772 s to 2253 s by changing the input flow rate. The input situation and detection results for fault 2 are shown in Figure 11, Figure 12 and Figure 13, respectively. The calculated MAR and FAR at node 1 and node 5 are 0.0610 and 0.0454, respectively.

4.3. Comparison Results

In order to show the enhanced FD performance, Table 2 provides four sets of evaluations using MARs and FARs as assessment indicators. In Table 2, traditional SKR and dynamic principal component analysis are centralized designs; The distributed CCA and the proposed scheme are distributed designs.
According to the performance indicators in Table 2, both FAR and MAR of the proposed algorithm are significantly lower than other FD algorithms. This indicates that the accuracy and effectiveness of the proposed algorithm are better than the traditional SKR algorithm and other comparison algorithms. The excellent FD performance is mainly the result of the introduction of the Hellinger distance in the SKR framework. Specifically, the Hellinger distance is first introduced into the traditional SKR framework to further analyze the fault features of the residual signal. Since the Hellinger distance can accurately measure the difference between two probability distributions, the proposed algorithm is more sensitive to the fault information of the residual signal. As a result, the proposed algorithm can capture the fault information in the residual signal more effectively, which further improves the reliability and accuracy of the FD algorithm.

5. Conclusions

This study proposes a novel distributed FD method by introducing Hellinger distance and average consensus algorithm in the SKR framework. The proposed algorithm has the following three main advantages and differences over existing FD methods. This study introduces the first Hellinger distance in the traditional SKR framework to further analyze the fault features of the residual signal. Since the Hellinger distance can accurately measure the difference between two probability distributions, the proposed algorithm is more sensitive to the fault information of the residual signal. In addition, the consensus algorithm is embedded in the information interaction among sensor blocks. Based on this idea, each block can obtain FD results without performing global fusion operations. Finally, the proposed algorithm can identify noise terms and the residual signals directly from the process data. It has superior flexibility in the design of the detection framework, particularly when the system models are not accurately obtained. The accuracy and validity of the proposed FD algorithm have been verified via a multiphase flow facility. In addition, fault-tolerant control based on data-driven SKR is an open problem that can avoid the complex design of control systems. Based on this study, distributed FD with external disturbances and fault-tolerant control based on data-driven SKR will be explored in our future work.

Author Contributions

Conceptualization, Q.W. and C.C.; experiment and analysis, Q.W., B.P. and P.X.; writing–original draft preparation, Q.W.; writing–review and editing, Q.W.; visualization, C.C.; funding acquisition, C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant from the Key Programs of the Changchun City Science and Technology Bureau (No. 21GD05), and the Key Program of Education Department of Jilin Province (No. 20210201113GX).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data generated by this work are available upon reasonable request to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhao, X.P.; Shao, F.; Zhang, Y.H. A Novel Joint Adversarial Domain Adaptation Method for Rotary Machine Fault Diagnosis under Different Working Conditions. Sensors 2022, 22, 9007. [Google Scholar] [CrossRef] [PubMed]
  2. Leite, D.; Martins, A.; Rativa, D. An Automated Machine Learning Approach for Real-Time Fault Detection and Diagnosis. Sensors 2022, 22, 6138. [Google Scholar] [CrossRef]
  3. Schmidt, S.; Oberrath, J.; Mercorelli, P. A Sensor Fault Detection Scheme as a Functional Safety Feature for DC-DC Converters. Sensors 2021, 21, 6516. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, H.T.; Jiang, B. A Review of Fault Detection and Diagnosis for the Traction System in High-Speed Trains. IEEE Trans. Intell. Transp. Syst. 2019, 21, 450–465. [Google Scholar] [CrossRef]
  5. Petrone, R.; Zheng, Z.; Hissel, D. A review on model-based diagnosis methodologies for PEMFCs. Int. J. Hydrogen Energy 2013, 38, 7077–7709. [Google Scholar] [CrossRef]
  6. Chen, H.T.; Jiang, B.; Mao, Z.H. Deep PCA Based Real-time Incipient Fault Detection and Diagnosis Methodology for Electrical Drive in High-Speed Trains. IEEE Trans. Veh. Technol. 2018, 67, 4819–4830. [Google Scholar] [CrossRef]
  7. Tariq, M.F.; Khan, A.Q.; Abid, M. Data-Driven Robust Fault Detection and Isolation of Three-Phase Induction Motor. IEEE Trans. Ind. Electron. 2019, 66, 4707–4715. [Google Scholar] [CrossRef]
  8. Dong, Y.Y.; Qin, S.J. A novel dynamic PCA algorithm for dynamic data modeling and process monitoring. J. Process Control 2017, 67, 1–11. [Google Scholar] [CrossRef]
  9. Luo, H.; Yin, S. A Data-Driven Realization of the Control-Performance-Oriented Process Monitoring System. IEEE Trans. Ind. Electron. 2019, 67, 521–530. [Google Scholar] [CrossRef]
  10. Dong, Y.Y.; Qin, S.J. Dynamic-Inner Partial Least Squares for Dynamic Data Modeling. IFAC-PapersOnLine 2015, 48, 117–122. [Google Scholar] [CrossRef]
  11. Freeman, P.; Pandita, R.; Srivastava, N. Model-Based and Data-Driven Fault Detection Performance for a Small UAV. IEEE/ASME Trans. Mechatron. 2013, 18, 1300–1309. [Google Scholar] [CrossRef]
  12. Taouali, O.J. A new online fault detection method based on PCA technique. IMA J. Math. Control Inform. 2014, 31, 487–499. [Google Scholar]
  13. Zhang, Y.; Yang, Z. Fault detection of non-Gaussian processes based on modified independent component analysis. Chem. Eng. Sci. 2010, 65, 4630–4639. [Google Scholar] [CrossRef]
  14. Chen, Z.Z.; Steven, D. Canonical correlation analysis-based fault detection methods with application to alumina evaporation process. Control Eng. Pract. 2016, 46, 51–58. [Google Scholar] [CrossRef]
  15. Chen, H.T.; Chen, Z.; Chai, Z. A Single-Side Neural Network-Aided Canonical Correlation Analysis with Applications to Fault Diagnosis. IEEE Trans. Cybern. 2021, 52, 9454–9466. [Google Scholar] [CrossRef]
  16. Chen, H.T.; Yi, H.; Jiang, B. Data-Driven Detection of Hot Spots in Photovoltaic Energy Systems. IEEE Trans. Syst. 2019, 49, 1731–1738. [Google Scholar] [CrossRef]
  17. Wang, C.C.; Too, G. Rotating machine fault detection based on HOS and artificial neural networks. J. Intell. Manuf. 2002, 13, 283–293. [Google Scholar] [CrossRef]
  18. Liu, X.Q.; Qiu, Y.; Zhang, H.Y. Fault detection and diagnosis of Aero-Starter-Generator based on spectrum analysis and neural network method. Acta Agron. Sin. 2004, 30, 483–487. [Google Scholar]
  19. Huo, M.M.; Luo, H.; Wang, H. A Distributed Closed-loop Monitoring Approach for Interconnected Industrial System. IEEE Trans. Ind. Electron. 2023, 70, 7362–7372. [Google Scholar] [CrossRef]
  20. Qin, S.J. An overview of subspace identification. Comput. Chem. Eng. 2006, 30, 1502–1513. [Google Scholar] [CrossRef]
  21. Yu, C.P.; Verhaegen, M. Subspace Identification of Distributed Homogeneous Systems. IEEE Trans. Autom. Control 2017, 62, 463–468. [Google Scholar] [CrossRef] [Green Version]
  22. Steven, X.D. A characterization of parity space and its application to robust fault detection. IEEE Trans. Autom. Control 1999, 44, 337–343. [Google Scholar]
  23. Steven, X.D. Data-driven design of monitoring and diagnosis systems for dynamic processes: A review of subspace technique based schemes and some recent results. J. Process Control 2014, 24, 431–449. [Google Scholar]
  24. Wang, J.; Qin, S.J. A new subspace identification approach based on principle component analysis. J. Process Control 2002, 42, 841–855. [Google Scholar] [CrossRef]
  25. Li, Y.; Yuan, M.; Chadli, M.; Wang, Z.; Zhao, D. Unknown input functional observer design for discrete time interval type-2 Takagi-Sugeno fuzzy systems. IEEE Trans. Fuzzy Syst. 2022, 30, 4690–4701. [Google Scholar] [CrossRef]
  26. Chen, H.T.; Dogru, O.; Huang, B. Distributed Process Monitoring for Multi-Agent Systems Through Cognitive Learning. IEEE Trans. Cognit. Dev. Syst. 2022, 1, 1–12. [Google Scholar] [CrossRef]
  27. Choi, S.W.; Lee, I.B. Multiblock PLS-based localized process diagnosis. J. Process Control 2005, 15, 295–306. [Google Scholar] [CrossRef]
  28. Jiang, Q.C.; Yan, X.F. Plant-wide process monitoring based on mutual information–multiblock principal component analysis. ISA Trans. 2014, 53, 1516–1527. [Google Scholar] [CrossRef]
  29. Chen, Z.W.; Cao, Y.; Zhang, K. A Distributed Canonical Correlation Analysis-based Fault Detection Method for Plant-wide Process Monitoring. IEEE Trans. Ind. Inform. 2019, 15, 2710–2720. [Google Scholar] [CrossRef]
  30. Jiang, Q.C.; Steven, X.D. Data-Driven Distributed Local Fault Detection for Large-Scale Processes Based on the GA-Regularized Canonical Correlation Analysis. IEEE Trans. Ind. Electron. 2017, 64, 8148–8157. [Google Scholar] [CrossRef]
  31. Tao, Y.; Shi, H.B.; Tan, S. Hierarchical Latent Variable Extraction and Multisegment Probability Density Analysis Method for Incipient Fault Detection. IEEE Trans. Ind. Inform. 2022, 18, 2244–2254. [Google Scholar] [CrossRef]
  32. Cong, T.; Tan, R.M. Anomaly Detection and Mode Identification in Multimode Processes Using the Field Kalman Filter. IEEE Control Syst. 2021, 29, 2192–2205. [Google Scholar] [CrossRef]
  33. Tan, R.M.; Ottewill, J.R.; Thornhill, N.F. Nonstationary Discrete Convolution Kernel for Multimodal Process Monitoring. IEEE Trans. Neural. Netw. Learn. Syst. 2020, 31, 3670–3681. [Google Scholar] [CrossRef] [Green Version]
  34. Ruiz, C.; Cao, Y.; Samuel, T. Statistical process monitoring of a multiphase flow facility. Control Eng. Pract. 2015, 42, 74–88. [Google Scholar] [CrossRef] [Green Version]
  35. Stief, A.; Tan, R.M. A heterogeneous benchmark dataset for data analytics: Multiphase flow facility case study. J. Process Control 2019, 79, 41–55. [Google Scholar] [CrossRef]
  36. Chen, H.T.; Jiang, B. A Newly Robust Fault Detection and Diagnosis Method for High-Speed Trains. IEEE Trans. Intell. Transp. Syst. 2018, 20, 2198–2208. [Google Scholar] [CrossRef]
  37. Xiao, L.; Boyd, S.; Kim, S.J. Distributed average consensus with least-mean-square deviation. J. Parallel Distrib. Comput. 2007, 67, 33–46. [Google Scholar] [CrossRef] [Green Version]
  38. Xiao, L.; Boyd, S. Fast linear iterations for distributed averaging. In Proceedings of the 42nd IEEE International Conference on Decision and Control, Maui, HI, USA, 9–12 December 2003. [Google Scholar]
  39. Olshevsky, A.; Tsitsiklis, J.N. Convergence Speed in Distributed Consensus and Averaging. SIAM Rev. 2011, 53, 747–772. [Google Scholar] [CrossRef] [Green Version]
  40. Steven, X.D. Data-driven design of fault diagnosis fystem in dynamic process. In Data-Driven Design of Fault Diagnosis and Fault-Tolerant Control Systems; Springer: London, UK, 2014; pp. 107–139. [Google Scholar]
Figure 1. A dynamic system equipped with a sensor network.
Figure 1. A dynamic system equipped with a sensor network.
Sensors 23 05891 g001
Figure 3. An overview of the multi-phase flow facility.
Figure 3. An overview of the multi-phase flow facility.
Sensors 23 05891 g003
Figure 4. A schematic of the multiphase flow facility [35] (see Table 1 for tag descriptions).
Figure 4. A schematic of the multiphase flow facility [35] (see Table 1 for tag descriptions).
Sensors 23 05891 g004
Figure 5. A connectivity graph of the sensor network.
Figure 5. A connectivity graph of the sensor network.
Sensors 23 05891 g005
Figure 6. Input stream sequences for fault-free data set.
Figure 6. Input stream sequences for fault-free data set.
Sensors 23 05891 g006
Figure 7. Input stream sequences for fault scenario 1.
Figure 7. Input stream sequences for fault scenario 1.
Sensors 23 05891 g007
Figure 8. Residual signal r and h 2 at node 1.
Figure 8. Residual signal r and h 2 at node 1.
Sensors 23 05891 g008
Figure 9. Detection results for fault 1 at node 2.
Figure 9. Detection results for fault 1 at node 2.
Sensors 23 05891 g009
Figure 10. Detection results for fault 1 at node 8.
Figure 10. Detection results for fault 1 at node 8.
Sensors 23 05891 g010
Figure 11. Input stream sequences for fault 2.
Figure 11. Input stream sequences for fault 2.
Sensors 23 05891 g011
Figure 12. Detection results for fault 2 at node 1.
Figure 12. Detection results for fault 2 at node 1.
Sensors 23 05891 g012
Figure 13. Detection results for fault 2 at node 5.
Figure 13. Detection results for fault 2 at node 5.
Sensors 23 05891 g013
Table 1. Parameters used in this experiment.
Table 1. Parameters used in this experiment.
Sensor LocationDescriptionUnits
PIC501Placement of PIC501 valve(%)
PT417Pressure in the blending areabarg
FIC302Placement of FIC302 valve(%)
FIC101Placement of FIC101 valve(%)
FT404Air flow rate from 2-phase splitterm 3 /h
FT406Water flow rate from 2-phase splitterkg/s
PT501Pressure in 3-phase splitterbarg
LI101Level of water tankm
LI502Level of 3-phase splitter(%)
LI503Level of watercoalescer(%)
FT302Air intake velocitySm 3 /h
FT102Water intake velocitykg/s
Table 2. Performance comparisons for the multiphase flow plant.
Table 2. Performance comparisons for the multiphase flow plant.
Detection FrameworkFD Strategies f 1 : Blockage in the Input Block Separator f 2 : Slugging SituationObtain FD Result’s Way
MARFAR MARFAR
CentralizedTraditional SKR [40]0.04520.0729 0.50360.1693Central node
Dynamic principal component analysis [36]0.64170.3461 0.58260.2754Central node
DistributedThe developed HSKR0.03230.0391 0.06100.0454Any node
Distributed CCA [29]0.38810.2668 0.20760.3045Any node
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Q.; Peng, B.; Xie, P.; Cheng, C. A Novel Data-Driven Fault Detection Method Based on Stable Kernel Representation for Dynamic Systems. Sensors 2023, 23, 5891. https://doi.org/10.3390/s23135891

AMA Style

Wang Q, Peng B, Xie P, Cheng C. A Novel Data-Driven Fault Detection Method Based on Stable Kernel Representation for Dynamic Systems. Sensors. 2023; 23(13):5891. https://doi.org/10.3390/s23135891

Chicago/Turabian Style

Wang, Qiang, Bo Peng, Pu Xie, and Chao Cheng. 2023. "A Novel Data-Driven Fault Detection Method Based on Stable Kernel Representation for Dynamic Systems" Sensors 23, no. 13: 5891. https://doi.org/10.3390/s23135891

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop