Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method

Zhu, Zhujun; Zhang, Pei; Liu, Zhao; Wang, Jian

doi:10.3390/pr10101938

Open AccessArticle

Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method

by

Zhujun Zhu

¹,

Pei Zhang

¹,

Zhao Liu

^1,*

and

Jian Wang

²

¹

School of Electrical Engineering, Beijing Jiaotong University, Beijing 100044, China

²

College of Energy and Electrical Engineering, Hohai University, Nanjing 211100, China

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(10), 1938; https://doi.org/10.3390/pr10101938

Submission received: 22 August 2022 / Revised: 17 September 2022 / Accepted: 19 September 2022 / Published: 26 September 2022

(This article belongs to the Special Issue Modeling, Analysis and Control Processes of New Energy Power Systems)

Download

Browse Figures

Versions Notes

Abstract

:

The increase in demand and generator reaching reactive power limits may operate the power system in stressed conditions leading to voltage instability. Thus, the voltage stability assessment is essential for estimating the loadability margin of the power system. The grid operators urgently need a voltage stability assessment (VSA) method with high accuracy, fast response speed, and good scalability. The static VSA problem is defined as a regression problem. Moreover, an artificial neural network is constructed for online assessment of the regression problem. Firstly, the training sample set is obtained through scene simulation, power flow calculation, and local voltage stability index calculation; then, the class imbalance problem of the training samples is solved by the random under-sampling bagging (RUSBagging) method. Then, the mapping relationship between each feature and voltage stability is obtained by an artificial neural network. Finally, taking the modified IEEE39 node system as an example, by setting up four groups of methods for comparison, it is verified that the proposed method has a relatively ideal modeling speed and high accuracy, and can meet the requirements of power system voltage stability assessment.

Keywords:

static voltage stability; machine learning; class imbalance problem; random under-sampling; bagging; artificial neural network

1. Introduction

Voltage stability [1] is the main limiting factor for the safe and reliable operation of power systems. With continued load growth and the penetration of new energy sources, modern power systems have been pushed to operate closer to their voltage stability limits. Over the past few decades, great efforts have been devoted to investigating the mechanisms of voltage instability and developing effective voltage stability assessment (VSA) methods [2].

Generally, voltage profiles show no anomalies before undergoing a voltage collapse due to load changes. Voltage stability margin (VSM) is a static voltage stability index that quantifies how “close” a particular operating point is to the point of voltage collapse [3]. Therefore, the VSM can be used to estimate the steady-state voltage stability limit of a power system. Knowing voltage stability margins is critical for utilities to operate their systems safely and with reliability. The system operator must provide an accurate and fast method to predict the voltage stability margin to initiate the necessary control actions [4].

That proposed a static voltage stability prediction method based on gradient boosting, which has better prediction accuracy [5]. However, its training set data are obtained through the calculation of the cumulative probability function (CPF), which is only applicable to a fixed load power factor case. Ghiocel et al. [6] proposed a new method to directly eliminate the singularity by reformulating the power flow problem. The central idea is to introduce an AQ bus in which the bus angle and the reactive power consumption of a load bus are specified. However, the computation burden is still heavy, and the solution speed cannot meet the requirement of real-time assessment.

With the boom of wide-area measurement systems in smart grids [7,8,9], the availability of large amounts of data acquired by phasor measurement units (PMUs) presents a huge opportunity for data-driven stability assessments. Great efforts have been made to perform such tasks through machine learning techniques.

In [10], a static stability assessment method for a power system based on a decision tree algorithm is proposed, which improves the assessment speed. However, there is no countermeasure for the decision tree over-fitting problem. Lai et al. [11] proposed a transient voltage stability assessment model based on convolutional neural networks, which improves the assessment speed by using statistical analysis for data dimensionality reduction. However, relying only on statistical analysis for data dimensionality reduction, it is easy to ignore individual features. Liang et al. proposed a random forest model for static voltage stability assessment, which makes up for the assessment defects of a single decision tree [12]. However, the selection of features is based on subjective judgment. The voltage stability assessment problem is treated as a classification problem of machine learning, making it difficult to accurately know the degree of voltage stability.

A common feature of these machine learning-based efforts is that they assume that the learning dataset can be generated by system simulations in the desired quantity [13]. Since accurate simulation and modeling, especially load modeling, are considered a great challenge in power systems, errors are inevitably introduced into the learning dataset. Preferably, the learning dataset can be obtained from PMU records, which will significantly improve the quality and reliability of the knowledge base. However, learning machines are likely to suffer from severe class imbalance problems. The system remains stable after most disturbances and becomes unstable only in a few cases. If not handled properly, this imbalance can greatly deteriorate the performance of the learning machine, and the minority class will be ignored and thus leading to misjudging. The class imbalance problem exists not only in the field of power systems, but also widely in other academic and industrial contexts, such as credit fraud detection, biomedical diagnosis, equipment fault diagnosis, and Internet intrusion [14].

Faced with class imbalance problems [15], considerable efforts have been made by machine learning researchers to deal with them [16,17]. Synthetic sampling is the most commonly used method for rebalancing class distributions. However, it cannot be directly applied to voltage stability assessment. Because datasets created by naive replication or linear interpolation may not exist in practice. Besides sampling-related techniques, some cost-sensitive tricks are proposed to build cost-sensitive classifiers. By attaching costs to different classes, these techniques manage to enhance minority learning by drawing more attention to minority classes [18].

To meet the requirements of voltage stability assessment and solve the problem of class imbalance and poor model generalization in machine learning, an online assessment method of static voltage stability using the RUSBagging method is proposed. The method differs from other methods in that:

The problem of VSA is defined as a machine learning regression problem, which is helpful for grid operators to observe the voltage stability state of the power system.

The bagging method of the ensemble framework is used to build the model to improve the generalization ability of the model.

The random under-sampling method is added to bagging, which solves the class imbalance problem to a certain extent and improves the assessment accuracy on minority class samples.

2. Local Voltage Stability Index

Commonly used static voltage stability indexes are [19,20]: the Jacobi singular value index, voltage sensitivity index, load margin index, VCPI index, and local voltage stability index. Compared to other voltage stability indices, the local voltage stability index (L index), which can give normalized index values for different systems, and which is not limited by the randomness of the direction of load growth, are highly applicable and highly accurate.

By the KCL law (Kirchhoff’s current law) there is

Y V = I

, where Y stands for node admittance, V stands for node voltage, and I stands for node current. In addition, according to the value of the node injection current, the network nodes are divided into generator nodes, load nodes, and contact nodes, and the equations of the node network after the division are as follows.

[\begin{array}{l} I_{G} \\ I_{L} \\ 0 \end{array}] = [\begin{matrix} Y_{G G}^{'} & Y_{G L}^{'} & Y_{G K}^{'} \\ Y_{L G}^{'} & Y_{L L}^{'} & Y_{L K}^{'} \\ Y_{K G}^{'} & Y_{K L}^{'} & Y_{K K}^{'} \end{matrix}] [\begin{array}{l} V_{G} \\ V_{L} \\ V_{K} \end{array}]

(1)

where

V_{G}

and

I_{G}

are the voltage and current vectors at the generator node,

V_{L}

and

I_{L}

are the voltage and current vectors at the load node, and

V_{K}

is the voltage vector at the contact node.

By eliminating the contact nodes, the remaining nodes in the network are divided into the set of generator nodes (

α_{G}

) and the set of load nodes (

α_{L}

), and Equation (1) can be transformed as:

[\begin{array}{l} I_{G} \\ I_{L} \end{array}] = [\begin{matrix} Y_{G G} & Y_{G L} \\ Y_{L G} & Y_{L L} \end{matrix}] [\begin{array}{l} V_{G} \\ V_{L} \end{array}]

(2)

where

Y_{G G} = Y_{G G}^{'} - Y_{G K}^{'} Y_{K K}^{' - 1} Y_{K G}^{'}

,

Y_{G L} = Y_{G L}^{'} - Y_{G K}^{'} Y_{K K}^{' - 1} Y_{K L}^{'}

,

Y_{L G} = Y_{L G}^{'} - Y_{L K}^{'} Y_{K K}^{' - 1} Y_{K G}^{'}

,

Y_{L L} = Y_{L L}^{'} - Y_{L K}^{'} Y_{K K}^{' - 1} Y_{K L}^{'}

.

Substituting

Z_{L L} = Y_{L L}^{- 1}

into Equation (2) converts to:

[\begin{array}{l} I_{G} \\ V_{L} \end{array}] = [\begin{matrix} Y_{G G} - Y_{G L} Z_{L L} Y_{L G} & Y_{G L} Z_{L L} \\ - Z_{L L} Y_{L G} & Z_{L L} \end{matrix}] [\begin{array}{l} V_{G} \\ I_{L} \end{array}]

(3)

Reference [15] gives the local voltage stability index

L_{j}

for load node j:

L_{j} = \frac{| \sum_{i \in α_{L}} | \frac{Z_{j i}^{*} {\tilde{S}}_{i}}{Z_{j j}^{*} {\dot{V}}_{i}} | {\dot{V}}_{j} |}{V_{j}^{2} Y_{j j}} = \frac{| \sum_{i \in α_{L}} \frac{Z_{j i}^{*} {\tilde{S}}_{i}}{{\dot{V}}_{i}} |}{V_{j}}

(4)

where

{\dot{V}}_{i}

,

{\dot{V}}_{j}

are the voltage phases of nodes i, j respectively.

{\tilde{S}}_{i}

is the equivalent load of node i.

Z_{j i}^{*}

is the mutual impedance conjugate between loads j,i of the equivalent load impedance matrix

Z_{L L}

.

Y_{j j}

is the self-conductance of the j^th node of the equivalent load conductance matrix

Y_{L L}

.

The local voltage stability index for all load nodes in the network forms the overall system stability index vector

L = [L_{1}, L_{2}, \dots L_{n}]

,

n \in α_{L}

and the maximum index value for the load is selected to define the voltage stability index for the system.

L = {‖ L ‖}_{\infty}

(5)

The relationship between local voltage stability index and system voltage stability is [21]:

L < 1, system voltage stability.

L = 1, system voltage critical stability.

L > 1, system voltage instability.

3. Random Under-Sampling Bagging BP Method for VSA

3.1. BP Neural Network for Regression of VSA

Since the VSA problem is defined as a machine learning regression problem in this article, a model is built to implement the regression. The back-propagation (BP) neural network model is easy to build, has a wide range of adaptability, and the algorithm is easy to implement. Hence, BP neural network is selected to solve this regression problem.

3.1.1. Model of BP Neural Network for Regression of VSA

Neural networks have an adaptive character, changing the weight values during the training process to suit different requirements [22,23]. The internal neural network can be divided into input, hidden and output layers according to the different functional layers.

For regression of the VSA problem, the input variables are the operating states of the power system, such as nodal voltage, branch power flow, and load demand. The output of the model is the L index, which is the result of the voltage stability assessment.

Where X is the input column vector;

x_{i}

is the element in row i. W is the weight matrix; specifically, an element of W can be represented by

w_{f, i j}

. The subscript f indicates the corresponding layer, and the subscript ij indicates the connection between node i in this layer and node j in the next layer. Y is the output column vector;

y_{i}

is the element in row i.

Σ

is the summation symbol, which sums multiple input signals;

φ_{i}

is the activation function of the i-th neuron in the hidden layer and

ϕ_{i}

is the i-th neuron activation function in the output layer;

θ_{i}

is the i-th neuron threshold in the hidden layer and

b_{i}

is the i-th neuron threshold in the output layer.

Neural networks use a large number of hidden layer neurons for data flow processing and network training. Without loss of generality, the neural network data stream processing process is briefly described using a single neuron in Figure 1 as an example.

Assuming that the output of the i-th neuron in the hidden layer is, the output of the i^th neuron can be represented by Equation (6) from Figure 1.

o_{i} = φ_{i} (\sum_{j = 1}^{n} w_{1, i j} \cdot x_{j} + θ_{i})

(6)

Similarly, it can be deduced that the output of the i^th neuron in the output layer is:

y_{i} = ϕ_{i} (\sum_{j = 1}^{m} w_{2, i j} \cdot o_{j} + b_{i})

(7)

3.1.2. Algorithm of BP Neural Network for Regression of VSA

The learning process consists of two processes: forward propagation of the signal and backward propagation of the error. The forward propagation process is shown in Equations (6) and (7). If the actual output of the output layer is not equal to the label value, then it is transferred to the error backpropagation process. The core of the BP neural network is the error back propagation process.

The error back propagation is to backpropagate the output error in a certain form through the hidden layer, and the error is apportioned to all neurons in each layer according to certain rules. To obtain the error signal of the neurons in each layer, and use it as the basis for correcting the weights of each neuron, the specific correction method is shown in Equations (8) to (11). The process of weight correction is also the learning process of the network. In general, this process continues until the network output error is within the set range or until a predetermined learning time or iterations.

Δ w_{2, i j} = η \sum_{p = 1}^{P} \sum_{k = 1}^{K} (T_{k}^{p} - o_{k}^{p}) {ϕ^{'}}_{i} o_{j}

(8)

Δ b_{i} = η \sum_{p = 1}^{P} \sum_{k = 1}^{K} (T_{k}^{p} - o_{k}^{p}) {ϕ^{'}}_{i}

(9)

Δ w_{1, i j} = η \sum_{p = 1}^{P} \sum_{k = 1}^{K} (T_{k}^{p} - o_{k}^{p}) {φ^{'}}_{i} w_{2, i j} {ϕ^{'}}_{j} x_{j}

(10)

Δ θ_{i} = η \sum_{p = 1}^{P} \sum_{k = 1}^{K} (T_{k}^{p} - o_{k}^{p}) {φ^{'}}_{i} w_{2, i j} {ϕ^{'}}_{j}

(11)

where

Δ w_{2, i j}

is the weight correction between the i-th neuron in the hidden layer to the j-th neuron in the output layer.

Δ b_{i}

is the threshold correction for the i-th neuron in the output layer.

Δ w_{1, i j}

is the weight correction between the i-th neuron in the input layer to the j-th neuron in the hidden layer.

Δ θ_{i}

is the threshold correction for the i-th neuron in the hidden layer. p is the sample index and P is the total number of training samples.

η

is the weight correction learning rate.

T_{k}^{p}

is the expected output value of the k^th output neuron for the p^th sample data.

3.2. Random Under-Sampling Bagging for Improving Model Accuracy of VSA

In the actual operation of the power system, the system is in a stable state in most cases. Stable data is far more than unstable data or critically stable data, which is a typical data imbalance problem. In fact, for the stable operation of the power system, the value of unstable or critically stable sample data is higher than that of stable sample data. Therefore, solving the data imbalance problem in the voltage stability assessment of the power system has a positive effect on improving the accuracy of the model in the critical stable operating state.

3.2.1. Random Under-Sampling Method for Solving Class Imbalance Problem in VSA

Data sampling is a type of data preprocessing method, which can solve the learner bias problem caused by data imbalance to a certain extent. Data sampling is generally divided into two categories: under-sampling and over-sampling. Over-sampling achieves the balance of original skewed data by introducing a new minority of instances, while under-sampling does the opposite. However, the over-sampling method will generate the wrong samples, which damages the learning result of minority samples [23]. Therefore, an under-sampling method is selected in this article. The schematic diagram of random under-sampling is shown as Figure 2.

To generate a balanced data set for training, the under-sampling method is used to resample the original set. Assuming that the size of the resampled data sets is S, where

S \leq N_{P} \times 2

,

N_{P}

is the size of the majority set P. Randomly sample the number of instances from both the majority set P and the minority set N without replacement and put them into the new training data set

D

[24].

To make sure that every subset

D_{i}

for training is relatively independent and as many instances of the original sets as possible are covered, a concept of overlap rate was proposed in [25]. The overlap rate of two data sets is defined as follows:

Given two data sets

D_{1}

,

D_{2}

with the size of

M

,

M_{S}

is the number of the same samples in

D_{1}

and

D_{2}

, so the overlap rate

R_{0}

of

D_{1}

and

D_{2}

is:

R_{0} (D_{1}, D_{2}) = \frac{M_{S}}{M}

(12)

The threshold value

R_{Threshold}

is set to limit the subset

D_{t}

obtained from the t under-sampling by:

R_{0} (D_{t}, D_{i}) < R_{Threshold}, i = 1, 2, \dots, t - 1

(13)

3.2.2. Bagging with Random Under-Sampling for Improving Model Accuracy of VSA

Bagging is the abbreviation of bootstrap AGGregatING, the representative of the parallel ensemble learning method. Multiple different training sets are constructed by the method of bootstrap sampling (re-sampling). Then the corresponding weak learners are trained in each training set. Finally, the final model after the aggregation of the weak learners is obtained [26].

For the bagging method, each weak learner uses the same model. It is necessary to distinguish the training data sets of each weak learner. If the training data set is directly divided and different weak learners are trained on each subset, the weak learner will miss the key information in the original training set, which limits the performance of the weak learner.

The proposal of bootstrap sampling solves the above problems well. This sampling method ensures the independence of different training subsets as much as possible while using more samples. Specifically, the method of repeatable sampling is adopted. There are repeated samples in these samples, so they are the true subset of the original data set. Assuming that the probability of each sample being sampled during the sampling process is equal, it is not difficult to calculate that when the number of samples n is large, about 63.2% of the original samples will be drawn in one bootstrap. Use bootstrap to sample M times to obtain M sample sets, and build a basic learner on each sample set to obtain M different learners.

Another step in bagging is model aggregation. For classification problems, the one with the largest proportion of the results of M weak learners is selected as the final classification result by voting; for regression problems, the outputs of M weak learners are averaged. Table 1 is a brief flow of the bagging method, which can be used for both classification and regression.

However, the original bagging does not take into account the imbalance problem. The data set of every training group is still imbalanced, and integration does not contribute to solving the imbalance problem. After the bootstrap sampling method, the random under-sampling method is added to improve the applicability of bagging to imbalanced data.

Figure 3 shows a schematic diagram of the framework of the RUSBagging method. First, the training set is randomly sampled to form multiple training subsets with differences in data characteristics. Then, using random under-sampling to obtain balanced data subsets. The weak learner is trained based on each balanced subset. Finally, the results are synthesized to obtain the final comprehensive result. After training, the test set is used to test the training effect of the model.

4. Modeling of Static Voltage Stability Assessment Based on Machine Learning

Based on power flow calculation and local voltage stability index calculation, the static voltage stability assessment problem of the power system is treated as a supervised machine learning problem. With the help of the machine learning method, the mapping relationship between the operating state and voltage stability is mined. The idea frame diagram is shown in Figure 4.

In the framework shown in Figure 4, there are mainly four parts: scene generation, sample generation, model building, and model training. The scene simulation is carried out considering the characteristics of the actual operation scene. In addition, the power flow calculation is performed on the simulated scene. The power flow calculation result is a feature variable of the sample corresponding to the scene. Based on the power flow calculation result, the local voltage stability index is calculated too. The index value is the corresponding label value (true value) of the sample in the scene.

The scenario simulation mainly considers the following factors: load demand, generator status, and new energy output power. Specifically, for the load demand, there are heavy load demand and light load demand. For the generator status, the generator does not reach the limit of reactive power and the reactive power of some units reaches the limit. There are two reasons for considering this factor: First, when the generator node transforms into a PQ node, the voltage stability state of the system will change abruptly. Second, the calculation of the L index needs to determine the type of system nodes in advance. When the type of node changes, the L index calculation model needs to be updated. For the output of new energy, the node where the unit is located is regarded as the PQ node. The load side also considers the batch connection to the grid and withdraws from the grid of electric vehicles.

The above factors only consider typical scenarios, so the number of scenarios is limited. Therefore, in the simulation, a mixed simulation of various factors is adopted to expand the number of scenarios. After the scenario simulation is completed, the power flow calculation is carried out for each scenario to obtain the voltage amplitude and phase angle of each node, the active and reactive power output of the generator, and the line power flow. These power flow calculation results and the load demand together constitute the features of samples. The L index corresponds to the features of samples packaged into complete training data. Since the L index can be directly calculated based on the power flow state, it does not require continuous power flow calculation like PV analysis, so the sample collection speed is very fast. The training set, validation set, and test set are randomly selected according to the ratio of 90%, 5%, and 5%.

BP network belongs to supervised learning [27]. In the process of neural network modeling, the selection of activation function, loss function, and the optimization algorithm is required. In the design of hyperparameters, such as the number of hidden layers, the number of neurons in the hidden layer, the number of parallel BP networks, etc., it needs to be set according to specific problems, and these hyperparameters rely more on empirical values.

(1): Activation function

The activation function is the key to the nonlinear mapping function of the neural network. Common activation functions include Sigmoid, ReLU classes (ReLU, LReLU, RReLU), Tanh and Softmax. Through the nonlinearization of the input data by the above activation function, combined with the deep superposition of the neural network, the fitting of the nonlinear function is realized. In this paper, the LReLU activation function is selected as the activation function of the BP network, because the dead zone of LReLU has a small range. At the same time, LReLU can effectively avoid the problem of gradient disappearance, and also alleviate the problem of neuron death of ReLU, which is beneficial to the neural network. The curve of LReLU is shown in Figure 5. The expression of the LReLU activation function is:

f (x) = \max {0.1 x, x}

(14)

The domain of the LReLU function is negative infinity to positive infinity. LReLU alleviates the problem of ReLU neuron death and solves the problem that some neurons cannot be activated.

(2): Loss function

The loss function is used to measure the difference between the output value of the model and the true value of the sample. Through the back-propagation process, the loss function is minimized, the weight of the network is corrected, and the gap between the output value of the model and the true value of the sample is continuously narrowed to achieve network convergence.

For different learning models, such as regression models and classification models, the type of loss function needs to be selected. For the classification model, the cross-entropy loss function is generally used. For the regression model, the mean square error loss function is generally used. In this paper, the problem of voltage stability assessment is defined as a regression problem, so the mean square error function is chosen as the loss function.

E = \frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}

(15)

where

E

represents the output value of the loss function, N represents the number of samples used in a parameter update process, and

y_{i}

and

{\hat{y}}_{i}

represent the true value and predicted value of the i^th sample label, respectively.

(3): Optimization algorithm

After completing the construction of the loss function, it is necessary to implement parameter correction through the optimization algorithm. The deep learning optimization algorithm mainly includes the basic optimization algorithm and the adaptive parameter optimization algorithm. The representative algorithm of the basic optimization algorithm is the stochastic gradient descent method, which keeps the learning rate unchanged during the training process. It cannot dynamically adapt to the training requirements. In addition, it is easy to fall into the local optimum point. The representative algorithm of the adaptive parameter optimization algorithm is Adam. The learning rate is gradually attenuated to better adapt to the training requirements as the learning progresses, shorten the training time, and improve the training effect. In this paper, the Adam algorithm [28] is used as the optimization algorithm for network training.

{\begin{cases} m_{t} = μ \cdot m_{t - 1} + (1 - μ) \cdot g_{t} \\ n_{t} = v \cdot n_{t - 1} + (1 - v) \cdot g_{t}^{2} \\ {\hat{m}}_{t} = m_{t} / (1 - μ) \\ {\hat{n}}_{t} = n_{t} / (1 - v) \\ Δ θ_{t} = - {\hat{m}}_{t} \cdot η \cdot g_{t} / \sqrt{n_{t} + ε} \end{cases}

(16)

In the formula,

g_{t}

represents the gradient,

m_{t}

represents the first-order moment estimation of the gradient,

n_{t}

represents the second-order moment estimation of the gradient,

{\hat{m}}_{t}

and

{\hat{n}}_{t}

represent the corrected values of

m_{t}

and

n_{t}

, respectively.

μ

and

v

represent the first-order momentum and the second-order momentum, respectively. Momentum coefficient,

ε

means avoiding smoothing terms with 0 denominators,

η

means learning rate. The standard settings for

μ

and

v

are 0.9 and 0.999, respectively, and the default value for

η

is 0.001. A satisfactory training effect can be obtained by applying this set of hyperparameters during the training process, and no special adjustment is generally required.

5. Results

To verify the effectiveness of the proposed model, the modified IEEE39 system case is taken as an example, as shown in Figure 6. The IEEE39 system [29] has 39 nodes, 19 load nodes, 10 thermal power units, and 46 branches (including transformers) with the following modifications: replacing the thermal power generator on bus-39 with wind turbines with a capacity of 650 MVA and removing the load of bus-39.

The neural network model is built based on the Pytorch framework, and the power system simulation is performed based on the PSSE simulation platform. An analysis is developed from the perspectives of model training time, mean square error (MSE), and mean absolute percentage error (MAPE).

In machine learning, MSE is generally used as the error of model training, and it is used as the objective function to update the parameters. The expression of MSE is shown in Formula (17):

MSE = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i}^{'} - y_{i})}^{2}

(17)

where

y_{i}^{'}

is the predicted value of the i^th sample, and

y_{i}

is the true value of the i^th sample. The advantage of MSE is to amplify extreme errors and avoid huge deviations in the model. The disadvantage is that it is not intuitive and it is difficult to explain its meaning after squaring.

In order to intuitively reflect the difference between the actual value and the predicted value, there is MAPE, which is expressed as Formula (18):

MAPE = \frac{1}{n} \sum_{i = 1}^{n} \frac{| y_{i}^{'} - y_{i} |}{y_{i}}

(18)

The value of MAPE is intuitive and has a clear meaning, but when the actual value is very small, it is easy to produce misleading information. Therefore, MAPE is generally not used for the loss function of regression problems with small real values, but it can be used as a more intuitive method to measure the model error.

In summary, MSE is used to evaluate the overall performance of the model, and MAPE is used to evaluate the performance of the model on batch instances.

The training time of different methods and the MSE and MAPE error of each method on the test set are shown in Table 2.

SVR stands for support vector regression. The test set is divided into multiple batches of data, and each batch of data is calculated to obtain the batch MAPE index and plot the results, as shown in Figure 7.

As shown in Figure 7, the errors of the four methods on the test set are all small, and the advantage of method 1 is not obvious. This is because the test set is also a class-imbalanced data set, so the advantage of the under-sampling method is not prominent on the whole test set.

To further illustrate the applicability of the proposed method to the class imbalance problem, the minority class samples in the test set are screened out, and then four methods are used for comparison based on the minority test set. The results are shown in Table 3.

As shown in Figure 8, on the screened test set, the proposed method has obvious advantages over other methods. It has a lower error on minority class samples. Compared with methods 3 and 4, method 2 also shows the adaptability to the class imbalance problem to a certain extent. This should be credited to the bagging framework.

6. Conclusions

The comparative analysis proves that the proposed static voltage stability assessment method not only has high accuracy, strong adaptability, and short modeling time but also has the following advantages:

(1): In the sample preparation stage, various operating scenarios such as load demand, new energy output power, and EV status were considered, which greatly improved the scenario applicability of the model.
(2): Defining the static voltage stability assessment problem as a regression problem of machine learning, which essentially improves the model assessment accuracy, provides voltage stability information with a quantitative index, and helps grid operators to better observe the grid state.
(3): Using the random under-sampling bagging framework provides a method to solve the problem of imbalanced data in the field of power system operating, which comprehensively improves the accuracy of the model.

Author Contributions

Conceptualization, P.Z. and J.W.; methodology, P.Z. and Z.L.; software, Z.Z. and Z.L.; validation, Z.Z. and J.W.; formal analysis, P.Z. and Z.L.; investigation, J.W. and Z.L.; resources, P.Z.; data curation, Z.Z. and J.W.; writing—original draft preparation, Z.Z.; writing—review and editing, P.Z. and J.W.; visualization, Z.Z.; supervision, P.Z.; project administration, P.Z.; funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Nature Science Foundation of China, grant number 52107068.

Data Availability Statement

Not applicable.

Acknowledgments

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kundur, P. Power System Stability and Control; McGraw-Hill: New York, NY, USA, 1993. [Google Scholar]
Van Cutsem, T.; Vournas, C. Voltage Stability of Electric Power Systems; Kluwer: Cambridge, UK, 1998. [Google Scholar]
Srivastava, L.; Singh, S.N.; Sharma, J. Estimation of loadability margin using parallel self-organizing hierarchical neural network. Comput. Electr. Eng. 2000, 26, 151–167. [Google Scholar] [CrossRef]
Suganyadevi, M.V.; Babulal, C.K. Online Voltage Stability Assessment of Power System by Comparing Voltage Stability Indices and Extreme Learning Machine; Springer: Berlin, Germany, 2013. [Google Scholar]
Qiang, W.; Hao, C.; Lian, L. Static Voltage Stability Margin Prediction Based on Natural Gradient Boosting and Its Influencing Factors Analysis. In Proceedings of the CSU-EPSA, Prague, Czech Republic, 23–25 June 2022; pp. 1–9. [Google Scholar]
Ghiocel, S.G.; Chow, J.H. A Power Flow Method Using a New Bus Type for Computing Steady-State Voltage Stability Margins. IEEE Trans. Power Syst. 2014, 29, 958–965. [Google Scholar] [CrossRef]
Luo, F.; Dong, Z.Y.; Chen, G.; Xu, Y.; Meng, K.; Chen, Y.Y. Advanced pattern discovery-based fuzzy classification method for power system dynamic security assessment. IEEE Trans. Ind. Informat. 2015, 11, 416–426. [Google Scholar] [CrossRef]
Gholami, M.; Sanjari, M.J.; Safari, M.; Akbari, M.; Kamali, M.R. Static security assessment of power systems: A review. Int. Trans. Electr. Energy Syst. 2020, 30, e12432. [Google Scholar] [CrossRef]
Liu, S.; Liu, L.; Yang, N.; Mao, D.; Shi, R. A data-driven approach for online dynamic security assessment with spatial-temporal dynamic visualization using random bits forest. Int. J. Electr. Power Energy Syst. 2021, 124, 106316. [Google Scholar] [CrossRef]
Ding, C.; Zhang, P.; Meng, X.; Li, W.; Wang, Y. Online Evaluation on Static Voltage Stability Margin Based on Classification and Regression Tree Algorithm. Proc. CSU-EPSA 2020, 32, 93–100. [Google Scholar]
Wenqing, L.; Qingwu, G.; Chunhui, G.; Liu, H.; Liu, W.; Liuchuang, W.U. Transient voltage stability evaluation based on feature and convolutional neural network. Eng. J. Wuhan Univ. 2019, 52, 815–823. [Google Scholar]
Xiurui, L.; Daowei, L.; Hongying, Y. Data-Driven Situation Assessment of Power System Static Voltage Stability. Electr. Power Constr. 2020, 41, 126–132. [Google Scholar]
Zhu, L.; Lu, C.; Dong, Z.Y. Imbalance learning machine-based power system short-term voltage stability assessment. IEEE Trans. Ind. Inform. 2017, 13, 2533–2543. [Google Scholar] [CrossRef]
Johnson, J.M.; Khoshgoftaar, T.M. Survey on deep learning with class imbalance. J. Big Data 2019, 6, 1–54. [Google Scholar] [CrossRef]
Li, Y.; Zhang, M.; Chen, C. A Deep-Learning intelligent system incorporating data augmentation for Short-Term voltage stability assessment of power systems. Appl. Energy 2022, 308, 118347. [Google Scholar] [CrossRef]
Haixiang, G.; Yijing, L.; Shang, J.; Mingyun, G.; Yuanyue, H.; Bing, G. Learning from class-imbalanced data: Review of methods and applications. Expert Syst. Appl. 2017, 73, 220–239. [Google Scholar] [CrossRef]
Malave, N.; Nimkar, A.V. A survey on effects of class imbalance in data pre-processing stage of the classification problem. Int. J. Comput. Syst. Eng. 2020, 6, 63–75. [Google Scholar] [CrossRef]
Khan, S.H.; Hayat, M.; Bennamoun, M.; Sohel, F.; Togneri, R. Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 3573–3587. [Google Scholar] [PubMed]
Rao, N.A.; Vijaya, P.; Kowsalya, M. Voltage stability indices for stability assessment: A review. Int. J. Ambient. Energy 2021, 42, 829–845. [Google Scholar]
Salama, H.S.; Vokony, I. Voltage stability indices–A comparison and a review. Comput. Electr. Eng. 2022, 98, 107743. [Google Scholar] [CrossRef]
Kessel, P.; Glavitsch, H. Estimating the Voltage Stability of a Power System. IEEE Trans. Power Deliv. 2007, 1, 346–354. [Google Scholar] [CrossRef]
Wei, W.; Xu, Y. Deterministic convergence of an online gradient method for neural networks. J. Comput. Appl. Math. 2002, 144, 335–347. [Google Scholar]
Zhang, L.; Wang, F.; Sun, T.; Xu, B. A constrained optimization method based on BP neural network. Neural Comput. Appl. 2018, 29, 413–421. [Google Scholar] [CrossRef]
Shi, X.; Xu, G.; Shen, F. Solving the data imbalance problem of P300 detection via random under-sampling bagging SVMs. In Proceedings of the 2015 International Joint Conference on Neural Networks, Killarney, Ireland, 12–17 July 2015; pp. 1–5. [Google Scholar]
Błaszczyński, J.; Stefanowski, J. Actively balanced bagging for imbalanced data. In Proceedings of the International Symposium on Methodologies for Intelligent Systems, Graz, Austria, 23–25 September 2017; pp. 271–281. [Google Scholar]
Bühlmann, P. Bagging, boosting and ensemble methods. In Handbook of Computational Statistics; Springer: Berlin, Germany, 2012; pp. 985–1022. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning. MIT Press: Cambridge, UK, 2016. [Google Scholar]
Guan, N.; Lei, S.; Yang, C.; Xu, W.; Zhang, M. Delay Compensated Asynchronous Adam Algorithm for Deep Neural Networks. In Proceedings of the 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), Guangzhou, China, 12–15 December 2017; IEEE: Piscataway, NJ, USA, 2017. [Google Scholar]
Sriyanyong, P.; Song, Y.H. Unit commitment using particle swarm optimization combined with Lagrange relaxation. In Proceedings of the Power Engineering Society General Meeting, San Francisco, CA, USA, 16 June 2005. [Google Scholar]

Figure 1. Structure diagram of typical BP neural network.

Figure 2. Schematic diagram of random under-sampling.

Figure 3. Schematic diagram of the random under-sampling bagging method.

Figure 4. The framework for static VSA of power system based on RUSBagging.

Figure 5. The curve of LReLU.

Figure 6. Modified IEEE 39 bus system.

Figure 7. MAPE results of each method on the whole test set.

Figure 8. MAPE results of each method on the minority test set.

Table 1. Algorithm of the random under-sampling bagging method.

Input: dataset

D = {(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{n}, y_{n})}

, weak learner algorithm, number of weak learner M
For i = 1, 2, …, M:
Using the bootstrap sampling method on the dataset

D

to generate a subsampled set

D_{i}

Using the random under-sampling method on the dataset

D_{i}

to generate a balanced subsample set

D_{i}^{'}

Training the ith weak learner

G_{i} (x)

with a balanced subsample set

D_{i}^{'}

End
Output: the final model

Table 2. Comparison of the performance of different methods on the whole test set.

Num	Methods	Train-Time/s	MSE	MAPE
1	BP-RUSBagging	312.44	6.0252 × 10⁻⁷	0.0011
2	BP-Bagging	290.35	4.7632 × 10⁻⁷	0.0014
3	BP	213.54	2.2391 × 10⁻⁶	0.0018
4	SVR	564.21	9.3797 × 10⁻⁶	0.0022

Table 3. Comparison of the performance of different methods on the minority test set.

Num	Methods	Train-Time/s	MSE	MAPE
1	BP-RUSBagging	312.44	1.7763 × 10⁻⁵	0.0167
2	BP-Bagging	290.35	8.1022 × 10⁻⁵	0.0284
3	BP	213.54	2.2391 × 10⁻⁴	0.0411
4	SVR	564.21	1.4397 × 10⁻³	0.0622

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, Z.; Zhang, P.; Liu, Z.; Wang, J. Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method. Processes 2022, 10, 1938. https://doi.org/10.3390/pr10101938

AMA Style

Zhu Z, Zhang P, Liu Z, Wang J. Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method. Processes. 2022; 10(10):1938. https://doi.org/10.3390/pr10101938

Chicago/Turabian Style

Zhu, Zhujun, Pei Zhang, Zhao Liu, and Jian Wang. 2022. "Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method" Processes 10, no. 10: 1938. https://doi.org/10.3390/pr10101938

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Static Voltage Stability Assessment Using a Random UnderSampling Bagging BP Method

Abstract

1. Introduction

2. Local Voltage Stability Index

3. Random Under-Sampling Bagging BP Method for VSA

3.1. BP Neural Network for Regression of VSA

3.1.1. Model of BP Neural Network for Regression of VSA

3.1.2. Algorithm of BP Neural Network for Regression of VSA

3.2. Random Under-Sampling Bagging for Improving Model Accuracy of VSA

3.2.1. Random Under-Sampling Method for Solving Class Imbalance Problem in VSA

3.2.2. Bagging with Random Under-Sampling for Improving Model Accuracy of VSA

4. Modeling of Static Voltage Stability Assessment Based on Machine Learning

5. Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI