1. Introduction
Energy conversion is increasingly drawing attention due to the goals of the Kyoto Protocol and Paris Climate Agreement. Renewable energy sources are steadily being incorporated into distribution systems, and the systems are changing accordingly. However, with this transition, distribution systems can suffer problems such as voltage instability, thermal overload, and monitor distribution systems to ensure stable operation. However, meters installed in power distribution systems are often limited to substations or low-voltage networks [
1] due to the high cost of installing phasor measurement unit (PMU) necessary to make the most of a limited number of meters.
Traditionally, supervisory control and data acquisition (SCADA) systems have been used to observe the state of the system. A common task SCADA performs is state estimation that relies on unsynchronized and slow measurements [
2]. Measurements based on SCADA have some limitations and errors, such as slow duty cycle and measurement delay due to communication bias. To solve these problems and limitations, wide-area monitoring protection and control (WAMPAC) has been proposed [
3,
4]. The PMU is a key component of WAMPAC, facilitating real-time calculations and synchronized phasor measurements of voltages and currents in the power grid. PMUs can utilize global positioning systems (GPS) to achieve precision and accuracy, so their use is rapidly increasing to improve monitoring of power grids. However, since PMUs are expensive equipment, installing them in all locations would require an astronomical cost. In order to solve this problem, studies have been conducted to optimize the number and installation location of PMUs to secure system observability at the lowest cost.
There are numerous studies of optimal meter placement using heuristic algorithms in power systems. The authors of [
5] proposed heuristic search-based solutions to minimize the cost of installing meters by accepting a trade-off between estimation accuracy and meter installation costs. In [
6], a heuristic meter placement for load estimation is presented; this method can handle large distribution systems with only moderate computation requirements, making it easier to handle the actual constraints of meter failure backups, space availability, existing meters, and unbalanced distribution systems. Additionally, their approach [
6] provides a reasonable compromise in terms of the complexity of meter deployment for distribution planning and operation. In [
7], an ordinal optimization technique was introduced in an attempt to improve the estimation accuracy. According to the author, the simplicity and computational efficiency of this algorithm are much improved over other discovery-based search algorithms, making it suitable for practical purposes. In [
8], genetic algorithm (GA)-based solutions are provided to improve estimation accuracy. A number of studies have been conducted on the optimal placement (OP) of meters, with the intent of securing the observability of the system and thus increasing the estimation accuracy. In [
9], a GA technique is used to determine optimal meter placement, in which the optimal meter layout was identified for maintaining the system’s observability. In [
10], a GA was used for optimal meter placement, for static estimation of harmonic sources in the power system. GA techniques tend to show similar results to complete enumeration techniques. One of the conclusions of this study was that the superiority of GA techniques over complete enumeration increases as the number of meters or the system size used for placement increases. In [
11], binary particle swarm optimization (BPSO) techniques were used to optimally position PMUs. BPSO-based approaches have been used to investigate PMU placement with two main objectives: minimizing the number of PMUs required to maintain full observability of the system and maximize measurement redundancy for all buses in the system.
Numerous studies have been conducted on the placement of meters to secure observability or improve state estimation accuracy. However, the optimal meter placement for this purpose has constraints that require minimal meter guarantees. If it is impossible to achieve observability of the power system with only the installed meters, pseudo-measurements can be used. The basic method of generating pseudo-measurements is to use historical data from feeders and load profiles (LPs) [
12]. However, pseudo-measurements generated in this way tend to have poor accuracy and high variance in state estimations [
13]. Numerous studies have obtained more accurate pseudo-measurements using artificial intelligence (AI) techniques [
13,
14,
15,
16,
17,
18,
19,
20]. In [
14], optimal meter placement was achieved by estimating the voltage data via AI techniques; with their method, it is possible to quickly select the optimal meter position based on simple rules, with only the voltage magnitude estimates taken into account. The authors in [
13] proposed an artificial neural network (ANN)-based pseudo-measurement technique to estimate the active/reactive power of the line for state estimation in the distribution system. The proposed technique generated two ANN models that trained LPs and offline load flows, or historical data, to determine the active/reactive injection power.
In [
15], linear regression (LR), support vector machine, and feed-forward neural network techniques were used to estimate the voltage at the measured point, and the voltage and phase angle at the unmeasured point, for comparison of the results. Modeling with AI models tends to be more accurate than that based on LPs. It also enables better state estimation with fewer meters than is required to achieve observability or improve estimation accuracy. However, the basis for selecting the input bus was not considered. In [
16], pseudo-measurements were modeled using ANN and a Gaussian mixture model (GMM) in a low-voltage distribution system; the GMM was used to determine the variance of the pseudo-measurements generated by the ANN. In [
17], the author estimated pseudo-measurements using gradient-boosting tree techniques; this approach requires extensive training time, but can be performed offline as it uses only historical data. The trained model provides fast predictions. The main advantage is that the model is learned based on user-level measurement data, unlike the LP-based model, and generates more accurate pseudo-measurements. In addition, the model is built step-by-step, so it can be applied to distribution systems with frequent topology changes. Depending on the topology, a subset can be used to generate pseudo-measurements. In [
18], ANN and Fourier decomposition were used to model the pseudo-measurements’ typical active power/reactive power and the standard deviation profile. In [
19], an extreme learning machine-based pseudo-measurement modeling technique is proposed that uses injection power measured by the supervisory control and data acquisition as input, and real and imaginary parts of the bus voltage as output. This technique not only improves the accuracy of the state estimation, but also significantly reduces the computational time. In [
20], a convolutional neural network-based method for pseudo-measurement modeling in a distribution system state estimation was studied. This method improves the efficiency of the calculations. In many cases, pseudo-measurement modeling using AI techniques requires model generation through learning data. The meter’s position influences AI-based pseudo-measurement modeling, as data acquisition in the power system is primarily done by meters. However, as the number of nodes in the target system increases, or as the number of meters installed reaches half of the total number of nodes, the number of nodes required for exhaustive exploration also increases. Therefore, the study is needed to derive optimal meter placement through efficient exploration, for AI-based pseudo-measurement modeling.
In order to calculate voltage magnitude and phase angle of the buses by the power flow calculation, at least two parameters among active/reactive power, voltage magnitude, and phase angle of all buses are required. As obtaining sufficient parameter information needed for power flow calculation of all buses in a system lacking meters is impossible, power flow calculation cannot be used to calculate the parameters of unmeasured points. Even if the same unmeasured point is targeted, a different LR model is generated as the measured point is changed. It is difficult to determine which model is generated according to the measurement point because there are insufficient data to explain the relationship between the measurement point and the unmeasured point. Accordingly, approaches based on metaheuristics can find good solutions in less computational time, even for uncertain and highly complex problems like PMU placement for supervised learning-based pseudo-measurement modeling [
21].
Generating supervised learning models requires sufficient data to be used for learning and validation. In this study, the active/reactive power injected into each node was randomly generated with Gaussian noise (
= 25%), and power flow calculations were used to obtain voltage and phase angle data. Power flow calculations were implemented by pandapower, a Python open source library.
where
,
,
,
,
, and
are the active and reactive power of the
ith bus of the network, the elements of the conductance matrix and susceptance matrix, and the voltages of the
ith bus and
kth bus, respectively. Equations (
1) and (
2) were used to generate data for supervised learning model training. The details of this process can be found in [
15]. In this paper, two metaheuristic algorithms, GA and PSO, which have the most related papers among metaheuristic techniques on Google Scholar, were used [
22]. Optimal meter placement results were compared with random search (RS) [
23] and brute force (BF) [
24] techniques. The estimated pseudo-measurements based on OP were compared to pseudo-measurements based on LPs, in which each pseudo-measurement was used as a state variable for state estimation. The criterion for the optimal position is the total vector error (TVE) of all unmeasured points estimated by LR techniques. The main contributions of the proposed approach are: (i) Provides an approach for optimal PMU placement for supervised learning-based pseudo-measurement modeling techniques, which previous studies have not focused on. (ii) Even in highly complex systems, excellent placement can be found in a short time through effective search strategies of metaheuristic techniques. (iii) Various metaheuristic techniques can be applied using a proposed framework that is not limited to specific metaheuristic techniques. (iv) In [
15], only the performance of the estimated pseudo-measurement value was evaluated, but in this paper, the weighted least squares state estimation was performed considering the uncertainty of the pseudo-measurement value and compared with the load profile-based method. The rest of this paper consists of four sections. The pseudo-measurement modeling technique based on LR is described in
Section 2. Problem formulation and the application of metaheuristic techniques for optimal meter placement is introduced in
Section 3. Simulation results for comparison of optimal meter placement and the performance of the pseudo-measurements generated at that location are provided in
Section 4, and a summary and conclusion are provided in
Section 5.
3. Problem Formulation
In this section, the problem of PMU placement through metaheuristic techniques is formalized. Metaheuristic techniques explore PMU placement based on the TVE of unmeasured buses estimated by the LR model. Different LR models are generated depending on PMU placement, and the generated LR model estimates the parameters of the unmeasured point based on the parameters of the measured point. Estimating the model that will be generated is difficult because it requires a formula that clearly identifies the relationship between the parameters of the measured and unmeasured points. Metaheuristic methods can provide solutions to these problems through their own search strategies, whereas mathematical optimization techniques, such as linear programming and quadratic programming, are difficult to apply.
The criteria for assessing the accuracy of measurements of a PMU are defined in [
25]. The phasor value of a sine wave is represented by two values: the amplitude and the phase. TVE can be utilized as an indicator of a PMU’s performance by considering these two values together. TVE is calculated by comparing theoretical values with phasors of measurements. The expression is as follows (Equation (
4)):
where
,
are real and imaginary part value of phasor measured by PMU;
,
are real and imaginary part value of theoretical phasor calculated by power flow calculation. If the phasor of the unmeasured bus No.
n estimated by LR is inputted, the TVE of the bus is outputted.
The placement information and number of PMUs determine the data used to learn LR models. The values estimated by the learned LR model consist of the voltage magnitude and angle. Both are combined by TVE, making them one value. In other words, optimal PMU placement requires a solution to the nonlinear optimization problem that minimizes the value of the objective function, which depends on the placement and number of PMUs. The mathematical expression is the following (Equations (
5)–(
8)):
using Equation (
4), TVEs of each unmeasured buses can be obtained from Equation (
6). The metaheuristic algorithm adjusts
to reduce
.
where
is a vector consists of TVEs of each unmeasured buses, and
is placement of PMUs, the vector consists of locations where PMUs are installed.
is the number of buses PMU installed,
is the total number of particles (populations), and
is the number of unmeasured buses. In other words, a metaheuristic algorithm is used to find the best placement
that minimizes average TVE. Metaheuristic algorithms are performed with the goal of finding combinations with lower average TVE values as the number of iterations (generations) increases.
Figure 1 shows the overall conceptual diagram of optimal PMU placement framework using metaheuristic technique. Voltage magnitude and phase angle data obtained from placement derived by metaheuristic techniques are used as input data to LR model training. Subsequently, test sets are used to evaluate the TVE. This process is repeated until the termination condition is satisfied. In this paper, first, a vector with 100 random PMU placement combinations is generated. The value of the vector can be any bus number except for the slack bus, because it is assumed that the PMU is installed on the bus. After that, each vector goes to the supervised learning model creation process to generate a linear regression model that estimates the voltage and phase angle of a bus that is not installed based on the voltage and phase angle of the bus where the PMU is installed. This process is repeated until all 100 vectors have been performed. Then, the value of each vector is updated to find a vector with a lower average TVE value through the Eigen strategy of the metaheuristic algorithm. This corresponds to the crossover and mutation process in the case of GA, and update velocity in the case of PSO. This process is performed 20 times. After all 20 runs are completed, the optimal location is selected as the location corresponding to the vector with the lowest average TVE value among the vectors found while 100 vectors are updated 20 times. The linear regression model created based on this optimal position generates pseudo-measured values for unmeasured points, and is used as a state variable for state estimation along with the measured values at the point where the PMU is installed.
In
Figure 1, the box corresponding to the ’Metaheuristic algorithm’, any algorithm that outputs a vector containing the arrangement information of the PMU according to iterative execution can be used even if it is not GA or PSO. The box corresponding to the ’Supervised Learning Model’ can be applied not only to linear regression techniques but to any regression technique that can estimate the parameters of the unmeasured point with the measurement value of the PMU.
3.1. Genetic Algorithms
GA has been used in science and engineering as a computational model for adaptive algorithms and natural evolution systems to solve real-world problems [
26]. By simulating the survival-of-the-deficit evolution strategy of chromosomes, the optimal combination of PMU placement is derived by crossover and mutation processes.
GA has a crossover process among multiple objects, this process enables efficient exploration compared to simple random exploration. Furthermore, the mutation process can improve the phenomenon in which the solution falls into local optima, and is widely applied to solving nonlinear or computationally complex problems.
Figure 2 shows the overall process of GA. To determine optimal PMU placement, GA was applied as follows:
- 1.
Generate population: The combinations are chosen at random from a set of , where N is the total number of buses and is the number of PMU-installed buses. Selected buses are used as input buses for LR, which estimates pseudo-measurement.
- 2.
Evaluate average TVE: Using
voltage magnitude and phase angle data and active power and reactive power of the slack bus as inputs,
voltage magnitude and phase angle of unmeasured points are estimated, where
is a number of unmeasured buses. After that, convert the estimated voltage magnitude and phase angle for unmeasured points to TVE through Equation (
4). The evaluation of that combination is based on the average TVE of
unmeasured buses.
- 3.
Selection: The combinations with the lowest average TVE value based on fitness values and the combinations chosen at random as lucky survivors are chosen to be the parents of the next generation.
- 4.
Crossover: A total of combinations surviving from the selection process are grouped into to create five combinations for each pair through the crossover process. A total of 20 pairs produce 100 combinations, resulting in 100 new combinations. The crossover process was carried out by combining the PMU placement information of each pair and then choosing the number of random extraction.
- 5.
Mutation: of the generated combinations are selected as mutation and one of the information (bus position that PMU is installed) is changed to a random value. After completing this process, a new generation consisting of 100 combinations is born.
- 6.
Return to the “Evaluate average TVE” process. These processes are repeated until the th generation. The information (PMU position) of the gene that had the best value since the th generation is selected as the optimal PMU.
The parameters of GA are considered as shown in
Table 1.
3.2. Particle Swarm Optimization
Particle swarm optimization (PSO) is an optimization technique implemented by mimicking the collective behavioral characteristics of organisms such as birds and fish. The value of the candidate solution expressed as the position of the particle is found through iterative computation to find the optimal solution of the objective function, and the position of the candidate solution is updated through a simple equation as shown below (Equations (
9) and (
10)).
where
is the position vector of the
ith particle, and the new position
is updated reflecting the velocity at the existing position. In this paper,
consists of a combination of buses that placed PMUs.
is the velocity associated with the renewal of particles and is updated with reference to
Table 2, where
p is known as “personal best (
pbest)”. It means the coordinates of the best solution obtained so far by that specific individual, while
g is the “global best (
gbest)”, the overall best solution obtained by the swarm. The best placements of each particle are
p, and the best placement of all particles are
g in this problem. Values of cognitive factors (
) and social factors (
) determine weight of
p and
g, respectively. Inertia weight (w) determines the contribution rate of a particle’s previous velocity to its velocity at the current time step [
27]. Parameters about these variables refer to [
28]. In Equation (
10),
and
are uniformly distributed random variable in the range [0,1], i.e., random numbers generated from a uniform distribution in [0,1], so that both the social and the cognitive components have a stochastic influence on the velocity update rule in Equation (
10).
PSO is characterized by several particles within the exploration space, used to derive optimal solutions and update their locations by considering the information of individual particles and the entire group at the same time. A detailed description of this technique can be found in [
27,
29].
Figure 3 shows the PSO process. To determine the optimal PMU placement using PSO, a PSO algorithm was applied as follows:
- 1.
Initialization: combinations of PMU placement were generated. The combinations were generated at random from the combination set , where N is the system size (total bus number) and is the number of PMU-installed buses. The combinations that occurred were applied as the PMU installation point in the simulation.
- 2.
Evaluate the fitness value: The average TVE of
unmeasured buses becomes the evaluation of that combination. The calculation of fitness values was carried out with Equation (
4), such as the approach used with the GA described earlier.
- 3.
Update and : Based on the fitness value, the position with the lowest average TVE value for an individual particle and the position with the lowest TVE value for all particles are updated.
- 4.
If the number of iterations is reaches , the process moves on to step 6, otherwise, it moves on to step 5.
- 5.
Velocity and position were updated: the velocity equation reflecting inertial, social, and cognitive factors was used to update the position of each particle. The parameter information is shown in
Table 2. The position of each updated particle moves back to step 2, and the process is repeated.
- 6.
After all iterations are terminated, the coordinate of is selected as the optimal placement.
The parameters of PSO are considered as shown in
Table 2.