Next Article in Journal
Experimental Characterization of Large Turbomachinery Tilting Pad Journal Bearings
Next Article in Special Issue
Intelligent Fault Diagnosis Method for Blade Damage of Quad-Rotor UAV Based on Stacked Pruning Sparse Denoising Autoencoder and Convolutional Neural Network
Previous Article in Journal
A Comparison of the Probes with a Cantilever Beam and a Double-Sided Beam in the Tool Edge Profiler for On-Machine Measurement of a Precision Cutting Tool
Previous Article in Special Issue
Auxiliary Model-Based Multi-Innovation Fractional Stochastic Gradient Algorithm for Hammerstein Output-Error Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Disturbance Detection of a Power Transmission System Based on the Enhanced Canonical Variate Analysis Method

1
College of Control Science and Engineering, China University of Petroleum, Qingdao 266580, China
2
Qingdao Topscomm Communication Co., Ltd., Qingdao 266109, China
*
Author to whom correspondence should be addressed.
Machines 2021, 9(11), 272; https://doi.org/10.3390/machines9110272
Submission received: 11 October 2021 / Revised: 3 November 2021 / Accepted: 4 November 2021 / Published: 6 November 2021
(This article belongs to the Special Issue Deep Learning-Based Machinery Fault Diagnostics)

Abstract

:
Aiming at the characteristics of dynamic correlation, periodic oscillation, and weak disturbance symptom of power transmission system data, this paper proposes an enhanced canonical variate analysis (CVA) method, called SLCVAkNN, for monitoring the disturbances of power transmission systems. In the proposed method, CVA is first used to extract the dynamic features by analyzing the data correlation and establish a statistical model with two monitoring statistics T 2 and Q. Then, in order to handling the periodic oscillation of power data, the two statistics are reconstructed in phase space, and the k-nearest neighbor (kNN) technique is applied to design the statistics nearest neighbor distance D T 2 and D Q as the enhanced monitoring indices. Further considering the detection difficulty of weak disturbances with the insignificant symptoms, statistical local analysis (SLA) is integrated to construct the primary and improved residual vectors of the CVA dynamic features, which are capable to prompt the disturbance detection sensitivity. The verification results on the real industrial data show that the SLCVAkNN method can detect the occurrence of power system disturbance more effectively than the traditional data-driven monitoring methods.

1. Introduction

With the increasing demand on the power energy in the modern industry, power transmission systems are becoming more and more large-scale and complicated [1,2]. Due to the system complexity, anomalies and disturbances are often unavoidable in real power systems. If these unexpected events are not handled timely, they may cause huge accident risks and even the widespread power outages, which are companied by the huge economic loss and severe life inconvenience. Therefore, it is of great value to detect the abnormal events quickly and maintain the safe running of power systems [3]. In recent years, the wide area measurement system (WAMS) based on synchronous phaser technology has been successfully applied in the power industry. The phasor measurement units in WAMS provide the basic data support for the real-time dynamic monitoring of the power system [4]. Accordingly, safety monitoring and disturbance detection of power systems based on the measurement data analysis has been a hot topic in academic and engineering fields [5,6,7].
Aiming at the power system disturbance detection task, researchers have conducted a lot of studies, which can be roughly divided into two categories: time/frequency domain analysis and multivariate statistical analysis. The time/frequency domain analysis investigates the power system changes from the perspective of the signal processing, which involves the time domain, frequency domain, or time-frequency domain. In consideration of the good time-frequency localization property, Huang et al. [8] discussed the application of the Morelet wavelets method in power system disturbance detection. The Hilbert Huang Transform is another time-frequency signal analysis tool. Manglik et al. [9] applied it to the disturbance detection for the electric power system. Ghaderi et al. [10] proposed the time-frequency analysis method assisted by current waveform energy and normalized joint time-frequency moment and demonstrated its performance in the high-impedance ground fault detection. Salehi et al. [11] designed a morphological edge detection filter to obtain the transient features of fault signals. Liu et al. [12] used the wavelet packet Tsallis singularity entropy algorithm for disturbance detection. In general, the time/frequency domain analysis methods mainly analyze the single signal and fail to fully consider the correlation between different parameters. In response to this shortcoming, some scholars started their work by applying multivariate statistical analysis. Multivariate statistical analysis (MSA) methods can realize the simultaneous detection of multiple parameter changes and have outstanding advantages in the complex industrial systems. However, most of the present MSA studies focus on the system modeling and disturbance detection in the chemical process, steel industry, and high-train system [13,14,15,16,17], but MSA’s application to power system monitoring is very rare. Barocio et al. [18] first introduced the principal component analysis (PCA) method into the field of power system monitoring and discussed the detection and visualization of power system disturbances based on PCA. Guo et al. [19] built a transmission line fault detection method by combining PCA and support vector machine. Considering the masking influence caused by the oscillation trend and strong noise of power system data, Cai et al. [20] further proposed a PCAkNN method, which is superior to the basic PCA method in the numerical model testing and New England power system model data. These research articles point out that the multivariate statistical analysis has great application potential in the field of power system monitoring.
Although PCA and PCAkNN methods have achieved significant success in the power system monitoring field, they have some shortcomings deserving further studies. On the one hand, these methods do not take into account the dynamic characteristics of power system data, which easily leads to a high missing detection rate. Different from the other industrial process data with the steady operation mode, the power system data, such as the voltage and current, are with obvious dynamic trends. On the other hand, the present methods do not consider how to enhance the detection of weak disturbances. In real applications, some disturbances may be with small amplitudes, slow changes, unclear disturbance characteristics, and are easy to be covered by noises [21,22]. How to enhance the detection capability on these weak disturbances is one challenging task.
Aiming at the aforementioned problems, this paper proposes a SLCVAkNN-based disturbance detection method for power transmission system monitoring by combining canonical variate analysis (CVA), kNN, and statistical local analysis (SLA). Compared with the traditional PCA-based power system monitoring methods, CVA has a stronger dynamic feature extraction ability [23,24,25], which provides a new and powerful tool for power system data analysis. Referring to the present PCAkNN method, the CVAkNN statistical model is developed to deal with the dynamic periodic oscillation signals. Furthermore, in order to enhance the detection of weak faults, SLA is integrated for SLCVAkNN modeling, which mines the local statistical information for better weak disturbance monitoring.
The rest of the paper’s content is arranged as follows. The principle of the proposed SLCVAkNN methodology is given in the Section 2, while the corresponding disturbance detection procedure is detailed in Section 3. One case study on the actual industrial data is used to verify the effectiveness of the proposed method.

2. The Proposed Methodology

This section clarifies the proposed SLCVAkNN-based power system disturbance detection method. The whole methodology involves three parts: dynamic system modeling using canonical variate analysis, monitoring index construction based on kNN, and weak disturbance detection by statistical local analysis.

2.1. CVA Monitoring Model

A power system is a classical dynamic process [26,27], where the measurement data demonstrate the clear trend along the sampling time. The measured three-phase electric field and current waveform change with time, and the current data point has a certain correlation with the historical samples. Therefore, it is more reasonable to apply the dynamic data analysis tool to extract the process features.
Canonical variate analysis (CVA) is an effective dynamic data analysis tool, which has been applied to the model identification and control in the multivariate dynamic system [28,29]. This paper introduces it to deal with power system data. For a certain power transmission line, the data points under normal system operation have a fixed correlation along the time dimension. When a disturbance occurs, this correlation may be destroyed. By monitoring the correlation among the time series data, CVA can find the system disturbance effectively. When CVA is applied to data modeling, the training data are firstly divided into the historical data set and the future data set, and the CVA optimization problem is designed to find the maximum correlation between these two data sets for describing the data dynamic features. The algorithm details are clarified as follows.
For the power system measurement data vector x h R m at the h-th sampling instant, its corresponding historical data vector p h and future data vector f h are constructed as
p h = [ x h T , x h 1 T , , x h l + 1 T ] T R M
f h = [ x h + 1 T , x h + 2 T , , x h + l T ] T R M
where M = m × l , and l represents the time lag order.
Given the projection vectors a and b , they are used to transform the historical and future vectors into their respective projections d = a T p h and v = b T f h . CVA is to optimize the vector pair a and b so that the correlation between d and v is maximized, which are also called canonical variates. This can be described by the mathematical expression as
max a , b ρ ( d , v ) = a T Σ p f b s . t . v a r ( d ) = a T Σ p p a = 1 v a r ( v ) = b T Σ f f b = 1
where Σ p f represents the cross-covariance matrix of the historical and future data vectors, and Σ p p , Σ f f denote the covariance matrix of the historical and future data vectors, respectively.
Suppose that the training data set includes n samples as X = [ x 1 T , x 2 T , , x n T ] T R n × m , then the historical and future data matrix can be expressed by
P = [ p l T , p l + 1 T , , p n l T ] T R N × M
F = [ f l T , f l + 1 T , , f n l T ] T R N × M
where N = n 2 l + 1 is the sample number of the historical and future data matrix. Then the covariance matrices defined in Equation (3) can be calculated by
Σ p f = 1 N 1 P T F
Σ p p = 1 N 1 P T P
Σ f f = 1 N 1 F T F
Solving the optimization problem described by Equation (3) leads to a singular value decomposition on the matrix Ξ = Σ p p 1 / 2 Σ p f Σ f f 1 / 2 , which is expressed by
Ξ = U Λ V T
The solution of Equation (9) is further used to build the a series of the projection vectors a i and b i ( 1 i M ) , which are computed by
a i = Σ p p 1 / 2 U ( : , i ) , b i = Σ f f 1 / 2 V ( : , i ) ,
where ( : , i ) represents the i-th column of the matrix. The vectors a i and b i are ordered by the corresponding correlation degree, which is given in the diagonal elements of matrix Λ , also meaning the correlation coefficients. The first s pairs of projection vectors { a i , b i , 1 i s } describe the stronger correlation and indicate the close relationship between the historical data and the future data. Therefore, a projection matrix A s = [ a 1 a 2 a s ] is defined to extract the canonical variate vector d h as
d h = A s T p h .
which describes the main dynamic features of process data. Here, s is determined so that the corresponding canonical variate vectors describe a cumulative percentage of 90% of correlation coefficients.
As A s only involves the first s projection directions, it cannot cover all the data information. The rest information in the CVA model can be described by the CVA residual vector e h as
e h = ( I A s A s T ) p h
Based on the canonical variate vector and CVA residual vector, two monitoring statistics T 2 and Q are often used to judge the process state. The T 2 statistic describes the changes of principal dynamic states, while the Q statistic monitors the changes of the residual information. For the h-th sample, the statistics are written by
T h 2 = d h T d h
Q h = e h T e h
In the normal operation, these two statistics should satisfy T h 2 T h , l i m 2 and Q h Q h , l i m , where T h , l i m 2 and Q h , l i m are the corresponding confidence limits. In some literature, the confidence limits of these two statistics can be obtained by assuming the prior distribution [30]. However, these distribution assumptions are often difficult to satisfy. Therefore, this paper applies the data-driven kernel density estimation to determine the confidence limit [31,32].

2.2. CVAkNN Model Based on kNN Monitoring Index

As the measurement data of power transmission systems have the periodic fluctuation characteristic, the traditional CVA monitoring statistics T 2 and Q behave unsteadily with the periodic changes. In this case, disturbance detection by directly monitoring the amplitudes of monitoring statistics cannot discover the disturbance signals effectively and may lead to a high disturbance missing rate.
In order to overcome this defect, this paper introduces the k-nearest neighbor analysis (kNN) to enhance the basic monitoring statistics. kNN is one effective multimodal data analysis tool and does not depend on the amplitude changes before and after the disturbance. In the literature [33,34], kNN was introduced and adapted for real-time detection of system disturbances. By combining the CVA model and the kNN-based monitoring statistics, the improved method, which is called CVAkNN, has a stronger capability of dealing with the periodic oscillation data property. The main idea of CVAkNN is to first reconstruct the monitoring statistic in the phase space and then build the monitoring index based on the distance between the reconstructed statistic vector and its k-nearest neighbor.
Phase space reconstruction is a good method to deal with time series analysis. This method regards one-dimensional time series as the result of nonlinear dynamic system motion and constructs the phase vectors by re-arranging the time series. This theory has been successfully applied in the fields of chaotic time series prediction and equipment failure data analysis [35,36]. Here it is introduced to deal with the CVA monitoring statistics for the further kNN analysis.
For the training data set with n samples x 1 , x 2 , , x n , the corresponding statistics vectors are obtained by the CVA modeling as
T 2 = [ T l 2 T l + 1 2 T n 2 ]
Q = [ Q l Q l + 1 Q n ]
Further, the phase reconstruction statistics matrix can be formulated as follows:
MT 2 = T l 2 T l + L 2 2 T l + L 1 2 T l + 1 2 T l + L 1 2 T l + L 2 T n L + 1 2 T n 1 2 T n 2
MQ = Q l Q l + L 2 Q l + L 1 Q l + 1 Q l + L 1 Q l + L Q n L + 1 Q n 1 Q n
where L is the embedding dimension defining the length of the reconstructed phase vector. Based on the results of the phase space reconstruction, the dynamic behavior of the statistics can be better described, which is conducive to the detection of power system disturbances.
In the online monitoring stage, a new testing sample x t is collected at the t-th sampling instant. Then the monitoring statistics can be computed by applying Equations (13) and (14), and the reconstructed phase vectors are described as
NT t 2 = T t L + 1 2 T t 1 2 T t 2
NQ t = Q t L + 1 Q t 1 Q t
To determine whether the test data x t is normal, it is necessary to compare the similarity between NT t 2 , NQ t 2 , and the reconstructed statistics matrix in Equations (17) and (18). If the reconstructed statistics NT t 2 , NQ t 2 are strongly similar to one column of the training statistics vectors in Equations (17) and (18), then the test data x t describe the normal working condition. Otherwise, it means that some faults occur in the power transmission system. Therefore, the key is how to perform this similarity comparison. This paper introduces the k-nearest neighbor (kNN) analysis to construct a kNN-based distance measurement indicator: statistical nearest neighbor distance (SNND).
The idea of SNND is to find the first k-th nearest neighbors of the test vector in the given matrix data and compute the distance between the test vector and the k-th nearest neighbors as a disturbance detection criterion. The SNND index for NT t 2 is defined as
D T t 2 = NT t 2 MT 2 j k , : ,
where MT 2 ( j k , : ) represents the j k -th row in the MT 2 matrix, which corresponds to the k-th nearest neighbor of NT t 2 , and . represents the L2 norm calculation. By analogy, the SNND indicator of N Q t can be established as
D Q t = NQ t MQ j k , : .
Under normal operating conditions, the above two indicators should fluctuate within a relatively small range. That means D T t 2 D T l i m 2 and D Q t D Q l i m for the normal running status. Once the threshold is exceeded, it means that there is a system disturbance. The threshold can be obtained by the kernel density estimation method.

2.3. SLCVAkNN Model Assisted by Statistical Local Analysis

In the power transmission system, some weak disturbances are often difficult to detect, such as the high-impendence single-phase ground fault. When this kind of disturbance occurs, the changes reflected by the measure voltage and current variables are very small. Further, considering the influence of modeling error and process noise, this kind of disturbance may be concealed and viewed as the normal process changes. Therefore, enhacning the weak disturbance detection is of great value to ensure the safety of power transmission systems. In this paper, we integrate the statistical local analysis (SLA) with CVAkNN and propose an improved SLCVAkNN monitoring model for better weak disturbance monitoring performance.
SLA was originally proposed by Basseville [37] for inspecting the process parameter changes. In recent years, some researchers have introduced it into the chemical process fault detection and demonstrated its effectiveness [38,39,40]. In this paper, we will perform the statistical local analysis on the CVA model. To look back into the CVA monitoring statistics in Equations (13) and (14), it is found that the monitoring statistics used to indicate the process status are composed of the canonical variate vector d h and the CVA residual vector e h . Therefore, if we attempt to improve the weak disturbance monitoring of CVA statistics, the vectors d h and e h must be improved with stronger disturbance sensitivity.
According to the statistical local analysis theory, given the system observation z j and the system parameter ϑ , a primary residual vector φ ( z j , ϑ ) can be defined for disturbance detection if it meets the following conditions: [37,38]
  • E { φ ( z j , ϑ ) } = 0 , if ϑ = ϑ 0 ;
  • E { φ ( z j , ϑ ) } 0 , if ϑ is in the neighborhood of ϑ 0 , but ϑ ϑ 0 ;
  • φ ( z j , ϑ ) is differentiable with ϑ ;
  • φ ( z j , ϑ ) exists for ϑ in the neighborhood of ϑ 0 .
Here ϑ 0 represents the parameters under the normal condition.
By investigating the i-th element in the vector d h , which is denoted as d h , i , it is easily derived by Equation (11) that d h , i = a i T p h . Naturally, the variance of d h , i can be computed as
E { d h , i 2 } = a i T E { p h p h T } a i
For the statistical samples, E { p h p h T } is factually equal to the covariance matrix Σ p p . Further combining the first constraint on the vector a in Equation (3), it is known that a i T E { p h p h T } a i = 1 . Therefore, we build the SLA primary residual of the canonical variate as
φ d h , i = d h , i 2 1 .
which meets the condition E { φ d h , i } = 0 in the normal condition.
Similarly, we analyze the variance of e h , i to obtain
E { e h , i 2 } = A r ( i , : ) E { p h p h T } A r ( i , : ) T
As A r can be obtained in the model training procedure, the above expression must be equal to a fixed value, which is denoted as σ i = A r ( i , : ) E { p h p h T } A r ( i , : ) T . Therefore, the SLA primary residual of the CVA residual can be built as
φ e h , i = e h , i 2 σ i .
which meets the condition E { φ e h , i } = 0 for the normal data.
For a more sensitive disturbance detection, the SLA improved residual is applied in a moving window with the width of w, which is expressed by
ψ d h , i = 1 h j = 1 h φ d j , i , h < w 1 w j = h w + 1 h φ d j , i , h w
ψ e h , i = 1 h j = 1 h φ e j , i , h < w 1 w j = h w + 1 h φ e j , i , h w
Up to now, we can obtain the SLA improved residual vectors ψ d , h = [ ψ d h , 1 ψ d h , 2 ψ d h , s ] T and ψ e , h = [ ψ e h , 1 ψ e h , 2 ψ e h , M ] T . These residual vectors are used to replace the original CVA features d h and e h so that the monitoring model is modified to the SLCVAkNN model.
With the SLA improved residual vectors, the monitoring statistics are constructed as follows:
T h 2 = ψ d , h T ψ d , h
Q h = ψ e , h T ψ e , h

3. Disturbance Detection Procedure Based on SLCVAkNN

Power transmission system disturbance detection based on SLCVAkNN method is divided into two stages: offline modeling stage and online detection stage. The corresponding flowchart is shown in Figure 1.
  • Stage 1: offline modeling stage
    • Acquire the normal condition data to constitute the training data set X = [ x 1 T , x 2 T , , x n T ] T R n × m and perform data normalization processing. Here, the mentioned normal condition data mean the data from a section of transmission line between two adjacent nodes. For different lines, the corresponding modelings are needed separately.
    • Construct historical data sets P and future data sets F according to Equations (4) and (5), calculate the covariance matrices by Equations (6)–(8), and solve the CVA optimization by the SVD as Equation (9).
    • Extract the canonical variate vector d h and the CVA residual vector e h , as shown in Equations (11) and (12).
    • Perform Equations (24) and (26) to obtain the SLA primary residual vectors and further calculate the SLA improved residual vectors by Equations (27) and (28).
    • Compute the monitoring statistics T h 2 and Q h for all the training samples according to Equations (29) and (30).
    • Construct the statistics matrix in the phase space according to Equations (17) and (18).
    • Calculate the SNND monitoring indices D T 2 and D Q for all the training samples and determine the 95% confidence limits D T l i m 2 and D Q l i m by kernel density estimation.
  • Stage 2: online detection stage
    • Obtain online new data x t and normalize it with the training data.
    • Construct the corresponding historical vector p t and project the p t to the CVA model and obtain the canonical variate vector d t and e t according to Equations (11) and (12).
    • Apply Equations (24), (26)–(28) to compute the SLA primary residual vector and the improved residual vector orderly.
    • Compute the monitoring statistics T t 2 and Q t for the online new sample x t according to Equations (29) and (30).
    • Construct the phase space statistics vector NT t 2 and NQ t 2 , and calculate the SNND index D T t 2 and D Q t 2 by Equations (21) and (22).
    • Compare the SNND indices with the corresponding confidence limits D T l i m 2 and D Q l i m . If any one exceeds the confidence limit, a disturbance sample is indicated.
Here, it is pointed out that the local neighborhood standardization (LNS) [41] may be used to enhance the traditional z-score standardization. Compared with the traditional z-score method, LNS has better capability to deal with the non-steady data with the periodic oscillations.

4. Case Analysis

In order to verify the advantages of the SLCVAkNN method in the power transmission system disturbance detection, this section gives the case study on the real industrial data collected from the actual power transmission system. For method comparison, four methods, including the proposed SLCVAkNN method and three other methods of PCA, PCAkNN, and CVAkNN, are all applied to build the monitoring models for disturbance detection. The PCA method has two monitoring statistics T 2 and Q, while the other three methods are with the kNN-based statistics D T 2 and D Q . When these methods are used, they indicate the system status by the monitoring charts, where the monitoring indices of normal and faulty samples are given by black and blue solid lines, respectively, while the detection threshold, that is the 95% confidence limit of the monitoring index, is plotted by the red dashed line. One evaluation index, called the disturbance detection rate (DDR), is used to evaluate the different monitoring methods. DDR is the percentage of the abnormal samples exceeding the detection threshold over all the abnormal samples.
The used real industrial data are collected from the seven transmission lines in a power supply station in August 2018. These lines are radially connected. Their data are collected because all of them involve the ground fault. The data acquisition units, designed by Qingdao Topscomm Communication CO. LTD, are used to collect the electric field intensity and current. Here, the real line voltage is up to 110 KV so that the existing equipment can not directly measure it. Therefore, the electric field intensity is applied to reflect the voltage trend. For each transmission line, one corresponding data set is recorded that involves the normal state and the abnormal state. The data set has the length of about 1300 samples, where the disturbance starting time (DST) is different in different transmission lines. The detailed information about the acquired data sets are listed in Table 1, where DST data record the sample number corresponding to the disturbance starting time. A demonstration of the collected data for the DATA-A is given in Figure 2, where six measured variables, including the electric field intensities of phase A, B, and C, and the currents of phase A, B, and C, are involved. Due to the existence of the harmonic load, the current sine wave distortion can be seen in these curves.
Taking the data set DATA-2 as one example, it is collected from the pole 116-3 of the line 906. This data set includes 1312 samples. To investigate it with the help of on-site engineers, it is known that the disturbance occurs from the 456th sample. Although engineers can find this disturbance by careful analysis, this manual way is very time-consuming and inefficient, so it is difficult to implement in large-scale transmission system monitoring. Therefore, building an automatic multivariate data analysis tool is very necessary. In this section, we apply four MSA methods, which are PCA, PCAkNN, CVAkNN, and SLCVAkNN, to perform the automatic fault detection. When the statistical models are developed, the model parameters are set as follows: k = 3 , L = 10 , l = 2 , w = 20 . For the data set DATA-2, the first 320 sampling point are considered to be in a normal operating state, they can be utilized as the training data set for model development, while monitoring charts of PCA, PCAkNN, CVAkNN, and SLCVAkNN are demonstrated in the Figure 3, Figure 4, Figure 5 and Figure 6, respectively. By the PCA monitoring results shown in Figure 3, it can be seen that the disturbance cannot be detected very effectively. The DDR of PCA T 2 is 4.43%, while the Q is a little better with the DDR of 29.52%. When PCAkNN is used, the D T 2 has a similarly poor detection rate, but the D Q statistic achieves clear improvement with the DDR of 57.76%. These results demonstrate that the PCAkNN method proposed by Cai et al. [20] can deal with the power system data with oscillation characteristic effectively. However, from these figures, the monitoring statistics do not exceed the confidence limits significantly. This may lead to the uncertain judgement on the occurrence of disturbance. When the CVAkNN is applied in Figure 5, the D Q statistic performs a little better with the DDR of 49.71%. However, its D T 2 indicator clearly improves the DDR to 92.51%, which means a significant detection rate improvement of about 70% in contrast with the PCAkNN’s D Q index. The best monitoring results on this data set is provided by SLCVAkNN, which are shown in Figure 6. By this figure, it is observed that the disturbance is detected very clearly with the DDRs of 97.25% and 96.80% for D T 2 and D Q , respectively. This case gives a comprehensive comparison on the four methods of PCA, PCAkNN, CVAkNN, and SLCVAkNN. The applications show that PCAkNN does better than PCA due to the use of kNN, while SLCVAkNN further prompts the disturbance detection performance with the integration of CVA and SLA.
Another example on the data set DATA-6 is illustrated, which corresponds to the line 906 exit. The modeling procedure is similar to the above case. Here we only give the monitoring charts of CVAkNN and SLCVAkNN, as shown in the Figure 7 and Figure 8. With the consideration of system dynamics, the CVAkNN D T 2 monitoring chart gives a higher DDR of 88.51%. Compared with the CVAkNN method, which has only one effective monitoring statistic, SLCVAkNN has two well-behaved monitoring statistics. The D T 2 and D Q have the DDRs of 97.37% and 97.25%, respectively. The testing results on DATA-6 further verify the advantage of the proposed method over the CVAkNN method.
The summary of disturbance detection rates for all seven data sets are shown in Table 2. From this table, it is shown that the faults in DATA-2 and DATA-4 are difficult to detect by PCA, whose DDRs are all lower than 30%. By the use of PCAkNN, these two faults are detected with higher DDRs, which are 57.76% and 26.78%, respectively. By contrast, CVAkNN does better on the two faults. In particular, its D T 2 statistic gives the DDR higher than 90%. When SLCVAkNN is used, its two monitoring statistics have the higher DDRs than 95%. For the sets of DATA-1, DATA-3, DATA-6, and DATA-7, PCA can detect these faults with about 70-80% DDR on one statistic. That means PCA can alarm these faults, but the alarm degree is not very sufficient. The PCAkNN and CVAkNN improve the DDR to about 90%. Further combining the SLA technique, SLCVAkNN achieves higher DDR than CVAkNN on these four sets. As to DATA-5, all these four methods give a similarly good performance with the DDRs higher than 95%. Considering all seven of these data sets, we observe that the average detection rates of CVAkNN outperforms the PCA and PCAkNN method, while the ones of SLCVAkNN statistics can reach 97.46% and 96.29%, which are the highest among these four methods.
To sum up, the applications on real industrial data verify the effectiveness of the proposed SLCVAkNN in the power transmission system monitoring. All the tested faults are about the ground faults. Although this paper does not provide the results on the other disturbances such as 1,3-phase short circuits, overvoltages, the presented algorithm is also suitable for these cases because they similarly lead to the changes of voltage and current. However, one related issue should be noted. In this article, this method detects all the occurred disturbances, including normal disturbances such as load power variations. To judge whether the disturbance is a fault or a normal disturbance is a further job. In fact, as to this issue, one solution is to enrich the modeling data with different normal changes. As the kNN used in this method can deal with the multimodal data case, the trained model can distinguish the faults and normal disturbances effectively when the normal changing data are considered in the model training procedure.

5. Conclusions

This paper proposes a power transmission system disturbance detection method based on SLCVAkNN. The real industrial data collected from the field transformer station are applied to verify the proposed method. By investigating the application results, we can draw the following conclusions.
  • CVA-based monitoring method can provide better dynamic information mining. The dynamic data analysis tool CVA is introduced to deal with the power transmission system data. By observing the application results, we find that CVAkNN has a higher detection rate than PCAkNN.
  • The statistical local analysis can further enhance the disturbance monitoring. Considering that many high-impendence ground faults in the real power systems are with insignificant symptoms, the weak disturbance detection methods are very important in improving the disturbance detection sensitivity. By focusing on the statistical local information of CVA features, the proposed SLCVAkNN method outperforms the CVAkNN method.

Author Contributions

Conceptualization, S.W. and X.D.; methodology, S.W. and Y.T.; software, Y.T. and X.D.; validation, Y.T. and L.W.; formal analysis, S.W. and Y.T.; investigation, Y.T. and P.S.; resources, L.W. and Q.C.; data curation, Y.T., L.W., and Q.C.; writing—original draft preparation, Y.T. and P.S.; writing—review and editing, S.W. and X.D.; visualization, L.W. and Q.C.; supervision, X.D.; project administration, X.D.; funding acquisition, X.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Shandong Provincial Natural Science Foundation (Grant No. ZR2020MF093), the Major Scientific and Technological Projects of CNPC (Grant No. ZD2019-183-003), and the Fundamental Research Funds for the Central Universities (that is, the Opening Fund of National Engineering Laboratory of Offshore Geophysical and Exploration Equipment, Grant No. 20CX02310A).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

CVAcanonical variate analysis
MSAmultivariate statistical analysis
SLAstatistical local analysis
WAMSwide area measurement system
A s projection matrix
d h canonical variate vector
D Q t SNND monitoring index
f h future data vector at the h-th sample instant
MT 2 phase reconstruction statistics matrix
NT t 2 reconstructed statistics vector
p h historical data vector at the h-th sample instant
Q h monitoring statistic at the h-th sample instant
x h data vector at the h-th sample instant
ϕ d h , i SLA primary residual of canonical variate
ψ d h , i SLA improved residual of canonical variate
Σ Covariance matrix
kNNk-nearest neighbor
PCAprincipal component analysis
SNNDstatistical nearest neighbor distance
a projection vector
b projection vector
D T t 2 SNND monitoring index
e h residual vector at the h-th sample instant
F future data matrix
MQ phase reconstruction statistics matrix
NQ t reconstructed statistics vector
P historical data matrix
T h 2 monitoring statistic at the h-th sample instant
X data matrix
ϕ e h , i SLA primary residual of CVA residual
ψ e h , i SLA improved residual of CVA residual

References

  1. Samuelsson, O.; Hemmingsson, M.; Nielsen, A.H.; Pedersen, K.O.H.; Rasmussen, J. Monitoring of power system events at transmission and distribution level. IEEE Trans. Power Syst. 2006, 21, 1007–1008. [Google Scholar] [CrossRef]
  2. Patel, B.; Bera, P. Fast fault detection during power swing on a hybrid transmission line using WPT. IET Gener. Transm. Distrib. 2019, 13, 1811–1820. [Google Scholar] [CrossRef]
  3. Musa, M.H.H.; He, Z.; Fu, L.; Deng, Y. Linear regression index-based method for fault detection and classification in power transmission line. IEEJ Trans. Electr. Electron. Eng. 2018, 13, 979–987. [Google Scholar] [CrossRef]
  4. Chang, H.-H.; Linh, N.V.; Lee, W.-J. A novel nonintrusive fault identification for power transmission networks using power-spectrum-based hyperbolic s-transform-part i: Fault classification. IEEE Trans. Ind. Appl. 2018, 54, 5700–5710. [Google Scholar] [CrossRef]
  5. Costa, F.B. Fault-induced transient detection based on real-time analysis of the wavelet coefficient energy. IEEE Trans. Power Deliv. 2013, 29, 140–153. [Google Scholar] [CrossRef]
  6. Math, H.J.B.; Das, R.; Djokic, S.; Ciufo, P.; Meyer, J.; Ronnberg, S.K.; Zavoda, F. Power quality concerns in implementing smart distribution-grid applications. IEEE Trans. Smart Grid 2016, 8, 391–399. [Google Scholar]
  7. Cheng, C.; Wang, W.; Chen, H.; Zhang, B.; Shao, J.; Teng, W. Enhanced fault diagnosis usign broad learning for traction systems in high-speed trains. IEEE Trans. Power Electron. 2021, 36, 7461–7469. [Google Scholar] [CrossRef]
  8. Huang, S.; Hsieh, C.; Huang, C. Application of morlet wavelets to supervise power system disturbances. IEEE Trans. Power Deliv. 1999, 14, 235–243. [Google Scholar] [CrossRef]
  9. Manglik, A.; Li, W.; Ahmad, S.U. Fault Detection in power system using the Hilbert-Huang transform. In Proceedings of the 2016 IEEE Canadian Conference On Electrical And Computer Engineering (CCECE), Vancouver, BC, Canada, 14–18 May 2016. [Google Scholar]
  10. Ghaderi, A.; Mohammadpour, H.A.; Ginn, H.L.; Shin, Y.-J. High-impedance fault detection in the distribution network using the time-frequency-based algorithm. IEEE Trans. Power Deliv. 2015, 30, 1260–1268. [Google Scholar] [CrossRef]
  11. Salehi, M.; Namdari, F. Fault classification and faulted phase selection for transmission line using morphological edge detection filter. IET Gener. Transm. Distrib. 2018, 12, 1595–1605. [Google Scholar] [CrossRef] [Green Version]
  12. Liu, Z.; Hu, Q.; Cui, Y.; Zhang, Q. A new detection approach of transient disturbances combining wavelet packet and tsallis entropy. Neurocomputing 2014, 142, 393–407. [Google Scholar] [CrossRef]
  13. Zhang, X.; Kano, M.; Song, Z. Optimal weighting distance-based similarity for locally weighted PLS modeling. Ind. Eng. Chem. Res. 2020, 59, 11552–11558. [Google Scholar] [CrossRef]
  14. Deng, X.; Du, K. Efficient batch process monitoring based on random nonlinear feature analysis. Canadian J. Chem. Eng. 2021, in press, 1–12. [Google Scholar] [CrossRef]
  15. Zhang, X.; Kano, M.; Matsuzaki, S. A comparative study of deep and shallow predictive techniques for hot metal temperature prediction in blast furnace ironmaking. Comput. Chem. Eng. 2019, 130, 106575. [Google Scholar] [CrossRef]
  16. Chen, H.; Jiang, B.; Ding, S.X.; Huang, B. Data-driven fault diagnosis for traction systems in high-speed trains: A survey, challenges, and perspectives. IEEE Trans. Intell. Transp. Syst. 2020, in press, 1–17. [Google Scholar] [CrossRef]
  17. Chen, H.; Jiang, B. A review of fault detection and diagnosis for the traction system in high-speed trains. IEEE Trans. Intell. Transp. Syst. 2020, 21, 450–465. [Google Scholar] [CrossRef]
  18. Barocio, E.; Pal, B.C.; Fabozzi, D.; Thornhill, N.F. Detection and visualization of power system disturbances using principal component analysis. In Proceedings of the 2013 IREP Symposium Bulk Power System Dynamics and Control-IX Optimization, Security and Control of the Emerging Power Grid, Rethymnon, Greece, 25–30 August 2013. [Google Scholar]
  19. Guo, Y.; Li, K.; Liu, X. Fault diagnosis for power system transmission line based on PCA and SVMs. In Proceedings of the 2nd International Conference on Intelligent Computing for Sustainable Energy and Environment (ICSEE), Shanghai, China, 12–13 September 2012. [Google Scholar]
  20. Cai, L.; Thornhill, N.F.; Kuenzel, S.; Pal, B.C. Wide-area monitoring of power systems using principal component analysis and k-nearest neighbor analysis. IEEE Trans. Power Syst. 2018, 33, 4913–4923. [Google Scholar] [CrossRef] [Green Version]
  21. Chen, H.; Wu, J.; Jiang, B.; Chen, W. A modified neighborhood preserving embedding-based incipient fault detection with applications to small-scale cyber-physical systems. ISA Trans. 2020, 104, 175–183. [Google Scholar] [CrossRef] [PubMed]
  22. Chen, H.; Jiang, B.; Zhang, T.; Lu, N. Data-driven and deep learning-based detection and diagnosis of incipient faults with application to electrical traction systems. Neurocomputing 2020, 396, 429–437. [Google Scholar] [CrossRef]
  23. Jiang, B.; Braatz, R.D. Fault detection of process correlation structure using canonical variate analysis-based correlation features. J. Process. Control 2017, 58, 131–138. [Google Scholar] [CrossRef]
  24. Li, X.; Yang, Y.; Bennett, I.; Mba, D. Condition monitoring of rotating machines under time-varying conditions based on adaptive canonical variate analysis. Mech. Syst. Signal Process. 2019, 131, 348–363. [Google Scholar] [CrossRef]
  25. Chen, Z.; Yang, C.; Peng, T.; Dan, H.; Li, C.; Gui, W. A cumulative canonical correlation analysis-based sensor precision degradation detection method. IEEE Trans. Ind. Electron. 2019, 66, 6321–6330. [Google Scholar] [CrossRef]
  26. Han, S.; Xu, Z.; Sun, B.; He, L. Dynamic characteristic analysis of power system interarea oscillations using HHT. Int. J. Electr. Power Energy Syst. 2010, 32, 1085–1090. [Google Scholar] [CrossRef]
  27. Hu, Z. Method considering the dynamic coupling characteristic in power system for stability assessment. IET Gener. Transm. Distrib. 2017, 11, 2534–2539. [Google Scholar] [CrossRef]
  28. Zhang, S.; Zhao, C.; Huang, B. Simultaneous static and dynamic analysis for fine-scale identification of process operation statuses. IEEE Trans. Ind. Inform. 2019, 15, 5320–5329. [Google Scholar] [CrossRef]
  29. Li, X.; Duan, F.; Bennett, I.; Mba, D. Canonical variate analysis, probability approach and support vector regression for fault identification and failure time prediction. J. Intell. Fuzzy Syst. 2018, 34, 3771–3783. [Google Scholar] [CrossRef] [Green Version]
  30. Chiang, L.H.; Russell, E.L.; Braatz, R.D. Fault Detection and Diagnosis in Industrial Systems; Springer: London, UK, 2001. [Google Scholar]
  31. Odiowei, P.E.P.; Cao, Y. Nonlinear dynamic process monitoring using canonical variate analysis and kernel density estimations. IEEE Trans. Ind. Inform. 2010, 6, 36–45. [Google Scholar] [CrossRef] [Green Version]
  32. Zhang, Z.; Deng, X. Anomaly detection using improved deep SVDD model with data structure preservation. Pattern Recognit. Lett. 2021, 148, 1–6. [Google Scholar] [CrossRef]
  33. Zhang, X.; Li, Y. Multiway principal polynomial analysis for semiconductor manufacturing process fault detection. Chemom. Intell. Lab. Syst. 2018, 181, 29–35. [Google Scholar] [CrossRef]
  34. Cai, L.; Thornhill, N.F.; Kuenzel, S.; Pal, B.C. Real-time detection of power system disturbances based on k-nearest neighbor analysis. IEEE Access 2017, 5, 5631–5639. [Google Scholar] [CrossRef]
  35. Zhang, A.; Xu, Z. Chaotic time series prediction using phase space reconstruction based conceptor network. Cogn. Neurodynamics 2020, 14, 849–857. [Google Scholar] [CrossRef]
  36. Yu, P.; Yan, X. Stock price prediction based on deep neural networks. Neural Comput. Appl. 2020, 32, 1609–1628. [Google Scholar] [CrossRef]
  37. Basseville, M. On-board component fault detection and isolation using the statistical local approach. Automatica 1998, 34, 1391–1415. [Google Scholar] [CrossRef] [Green Version]
  38. Kruger, U.; Kumar, S.; Littler, T. Improved principal component monitoring using the local approach. Automatica 2007, 43, 1532–1542. [Google Scholar] [CrossRef]
  39. Ge, Z.; Yang, C.; Song, Z. Improved kernel PCA-based monitoring approach for nonlinear processes. Chem. Eng. Sci. 2009, 64, 2245–2255. [Google Scholar] [CrossRef]
  40. Deng, X.; Cai, P.; Deng, J.; Cao, Y.; Song, Z. Primary-auxiliary statistical local kernel principal component analysis and its application to incipient fault detection of nonlinear industrial processes. IEEE Access 2019, 7, 122192–122204. [Google Scholar] [CrossRef]
  41. Ma, H.; Hu, Y.; Shi, H. A novel local neighborhood standardization strategy and its application in fault detection of multimode processes. Chemom. Intell. Lab. Syst. 2012, 118, 287–300. [Google Scholar] [CrossRef]
Figure 1. Flow chart of disturbance detection by SLCVAkNN.
Figure 1. Flow chart of disturbance detection by SLCVAkNN.
Machines 09 00272 g001
Figure 2. Data waveform collected from 904 line exit. (a) Three-phase electric field intensity; (b) Three-phase current.
Figure 2. Data waveform collected from 904 line exit. (a) Three-phase electric field intensity; (b) Three-phase current.
Machines 09 00272 g002
Figure 3. PCA monitoring results on the DATA-2 case.
Figure 3. PCA monitoring results on the DATA-2 case.
Machines 09 00272 g003
Figure 4. PCAkNN monitoring results on the DATA-2 case.
Figure 4. PCAkNN monitoring results on the DATA-2 case.
Machines 09 00272 g004
Figure 5. CVAkNN monitoring results on the DATA-2 case.
Figure 5. CVAkNN monitoring results on the DATA-2 case.
Machines 09 00272 g005
Figure 6. SLCVAkNN monitoring results on the DATA-2 case.
Figure 6. SLCVAkNN monitoring results on the DATA-2 case.
Machines 09 00272 g006
Figure 7. CVAkNN monitoring results on the DATA-6 case.
Figure 7. CVAkNN monitoring results on the DATA-6 case.
Machines 09 00272 g007
Figure 8. SLCVAkNN monitoring results on the DATA-6 case.
Figure 8. SLCVAkNN monitoring results on the DATA-6 case.
Machines 09 00272 g008
Table 1. The collected industrial data sets.
Table 1. The collected industrial data sets.
No.DescriptionDST
DATA-1Data set collected from line 904 exit446
DATA-2Data set collected from line 906, pole 116-3456
DATA-3Data set collected from line 906, pole 90-2445
DATA-4Data set collected from line 906, pole 151-5458
DATA-5Data set collected from line 906, pole 97-1452
DATA-6Data set collected from line 906 exit493
DATA-7Data set collected from line 907 exit420
Table 2. The disturbance detection rate of PCA, PCAkNN, CVAkNN, and SLCVAkNN for the tested data sets.
Table 2. The disturbance detection rate of PCA, PCAkNN, CVAkNN, and SLCVAkNN for the tested data sets.
NO.PCAPCAkNNCVAkNNSLCVAkNN
T 2 Q DT 2 DQ DT 2 DQ DT 2 DQ
DATA-17.50%70.47%26.18%90.89%96.76%64.39%96.83%89.48%
DATA-24.43%29.52%3.85%57.76%92.51%49.71%97.25%96.80%
DATA-35.99%83.06%19.93%97.81%100.00%100.00%97.85%97.97%
DATA-45.15%10.18%8.54%26.78%96.60%51.93%97.36%96.79%
DATA-548.78%96.05%82.81%99.54%100.00%100.00%98.06%98.29%
DATA-64.39%77.44%5.49%92.20%88.51%41.20%97.37%97.25%
DATA-77.95%84.99%11.09%95.97%93.49%55.33%97.47%97.47%
Average12.03%64.53%22.56%80.13%95.41%66.08%97.46%96.29%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, S.; Tian, Y.; Deng, X.; Cao, Q.; Wang, L.; Sun, P. Disturbance Detection of a Power Transmission System Based on the Enhanced Canonical Variate Analysis Method. Machines 2021, 9, 272. https://doi.org/10.3390/machines9110272

AMA Style

Wang S, Tian Y, Deng X, Cao Q, Wang L, Sun P. Disturbance Detection of a Power Transmission System Based on the Enhanced Canonical Variate Analysis Method. Machines. 2021; 9(11):272. https://doi.org/10.3390/machines9110272

Chicago/Turabian Style

Wang, Shubin, Yukun Tian, Xiaogang Deng, Qianlei Cao, Lei Wang, and Pengxiang Sun. 2021. "Disturbance Detection of a Power Transmission System Based on the Enhanced Canonical Variate Analysis Method" Machines 9, no. 11: 272. https://doi.org/10.3390/machines9110272

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop