Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems

Gomes, Thiago Eliandro de Oliveira; Borniatti, André Ross; Garcia, Vinícius Jacques; Santos, Laura Lisiane Callai dos; Knak Neto, Nelson; Garcia, Rui Anderson Ferrarezi

doi:10.3390/en16052485

Open AccessArticle

Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems

by

Thiago Eliandro de Oliveira Gomes

^1,*

,

André Ross Borniatti

¹,

Vinícius Jacques Garcia

¹

,

Laura Lisiane Callai dos Santos

²,

Nelson Knak Neto

²

and

Rui Anderson Ferrarezi Garcia

³

¹

Graduate Program in Production Engineering, Federal University of Santa Maria, Santa Maria 97105-900, Brazil

²

Academic Coordination, Campus Cachoeira do Sul, Federal University of Santa Maria, Santa Maria 97105-900, Brazil

³

State Electric Power Company (CEEE), Equatorial Energy Group, Porto Alegre 91410-400, Brazil

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(5), 2485; https://doi.org/10.3390/en16052485

Submission received: 6 February 2023 / Revised: 1 March 2023 / Accepted: 3 March 2023 / Published: 5 March 2023

(This article belongs to the Special Issue Power Quality in Smart Grids: Advanced Technology for System Regulation and Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

Reliability is an important issue in electricity distribution systems, with strict regulatory policies and investments needed to improve it. This paper presents a mixed integer linear programming (MILP) model for clustering electrical customers, maximizing system reliability and minimizing outage costs. However, the evaluation of reliability and its corresponding nonlinear function represent a significant challenge, making the use of mathematical programming models difficult. The proposed heuristic procedure overcomes this challenge by using a linear formulation of reliability indicators and incorporating them into the MILP model for clustering electrical customers. The model is mainly defined on a density-based heuristic that constrains the set of possible medians, thus dealing with the combinatorial complexity associated with the problem of empowered p-medians. The proposed model proved to be effective in improving the reliability of real electrical distribution systems and reducing compensation costs. Three substation cluster scenarios were explored, in which the total utility compensations were reduced by approximately USD 86,000 (1.80%), USD 67,400 (1.41%), and USD 64,000 (1.3%). The solutions suggest a direct relationship between the reduction in the compensation costs and the system reliability. In addition, the alternative modeling approach to the problem served to match the performance between the distribution system reliability indicators.

Keywords:

clustering electrical customers; reliability in power distribution systems; nonlinear; financial compensation

1. Introduction

Continuity in the supply of electrical energy is one of the success criteria for an energy system. However, the occurrence of interruptions in the supply of energy to end users, sometimes due to possible failures, significantly compromises the operational and economic conditions of the electrical utilities [1,2].

In the energy market, the performance of distribution systems is evaluated according to the level of the continuity of delivery and is regulated by entities that standardize the technical activities relevant to their operation, in order to guarantee that each customer receives energy with an acceptable quality level [3,4].

For the electrical distribution utilities, the quality of service is a central problem, and its impacts are evaluated through the reliability aspects of the service provided during the energy supply [5,6]. As for the quality of service, utilities monitor metrics of frequency and duration of outages, designed to regulate the performance of these companies in relation to the desired goals and requirements [1,7,8].

In Brazil, the dimensions of the quality of service provided by the distributors are regulated by the National Electric Energy Agency (ANEEL), responsible for overseeing the distributors [4,6]. Among the parameters used to evaluate the service are the collective and individual interruption indices. The collective indicators refer to the calculation of the duration and frequency of the continuity of service for customers, respectively, the SAIDI and SAIFI, which are used to quantify for how long and how often customers experience interruptions [4,7,8,9]. Individual interruption, on the other hand, is portrayed by the CAIDI and CAIFI indicators. The first is used to determine the duration, while the second determines the frequency of the interruption in the energy supply to the customer [2,8,9].

Although the dimensions of the quality of the service of the energy supply consider the continuity indices, for instance, the SAIFI and SAIDI, other indicators are also used for its quantification; for example, the cost of interruption caused to the customer is used to estimate the losses in monetary in damages paid to customers due to power interruption events [7,10,11]. In the Brazilian context, service interruptions that are considered outside the established targets impose the obligation on concessionaires to grant automatic financial compensation to all impacted customers, which, consequently, cause a burden on their annual budget [3,4].

One of the ways of assessing the performance of utilities recommended by ANEEL refers to the composition of sets of customers, gathered according to technical, geographic, environmental, and socioeconomic characteristics and listed according to their respective significance in relation to the reliability indicators. It is up to the companies to work on the formation of these sets, respecting the guidelines established by the regulatory agency, in an attempt to guarantee the quality of the energy supply and the reduction in the financial penalties due to the violation of the limits of the indicators [4,12].

Power utilities are constantly looking for solutions to create more reliable, cost-effective, and interactive power distribution systems [7]. Several studies have examined the costs of energy utilities and how the costs impact their decisions.

There has been significant research related to the dependence and improvement of the reliability of the energy distribution system [13,14,15,16,17], but little research has related these indicators to costs and clustering.

The introduction of smart meters benefits the planning and operation of the distribution networks and demand management in addition to reducing costs [18]. Different models and approaches have been proposed to minimize costs related to [19] the reliability and [14,20] the switch installation.

There are several reasons for researching substation clustering models in electrical power distribution systems. First, reliability is a crucial issue in power systems, and clustering substations can significantly improve system reliability [8]. Second, bundling can lead to reductions in compensation costs, which can be beneficial for both the utility and consumers [17].

In practice, power utilities can apply the economic considerations of clustering substations in a variety of ways. However, the substation clustering context lacks models that include both reliability and cost reduction aspects.

In addition, economic considerations must be balanced with other technical and operational factors that may influence the utility’s decision-making policies in relation to government regulations.

Despite the importance of clustering substations, there are gaps in the literature, such as the lack of models that simultaneously consider the compensation costs and the system reliability. More research is needed to fill these gaps and contribute to optimizing substation clustering in electrical power distribution systems.

The hypothesis assumed in the modeling for the optimization of the clustering of consumers belonging to physically adjacent substations is that the use of Mixed Integer Linear Programming (MILP) will result in compact and contiguous clustering in relation to the center defined by the weighted median. This process will provide a shorter time in determining the settings and analyzing their contributions to the quality of the service.

The second hypothesis assumed is that the clustering of substations will promote the minimization of the financial compensation associated with the violation of the limits of the reliability indicators. This will allow the reallocation of the financial resources of the concessionaires in the area.

Given these research gaps, this article proposes an approach for clustering a set of geographically connected customers in the electrical distribution network according to economic criteria and regulatory standards [11].

The combinatorial nature of the reclustering problem [21] suggests that the associated complexity can be adequately addressed by defining the problem based on the number of customers and their related clusters [22,23,24].

In the context of the problem of clustering electricity customers, it is possible to form groups of customers in different ways; however, restrictions, such as geographical restrictions on the distribution network or control of the combinatorial nature of the problem, limit the possibilities.

In this investigation, the challenges associated with the clustering of electricity customers are addressed, with a view to analyzing the impacts on the costs of energy discontinuity, when customers are submitted to new clustering configurations [25,26,27].

This article aims to propose a model to solve the customer clustering problem, formulated based on the MILP for substation clustering. As a decision criterion adopted, the maximization of the reliability of the distribution system was assumed, while seeking to minimize the costs associated with compensation.

This approach improved the applicability of the solution in the electrical distribution sector, resulting in a MILP model that can be used in distribution networks of realistic size and complexity. The main contributions of the work are as follows:

1.: The definition of a heuristic approach to clustering customers with a reference point (center) using p-medians;
2.: A MILP model based on the linear formulation of reliability indicators;
3.: MILP complexity control for application in real and large distribution systems;
4.: The possibility of identifying the impact on the reliability performance, with different consumer set configurations;
5.: The analysis of the reduction in the compensation credited to consumers arising from violations of the indicator limits;
6.: The reallocation of investments and optimization of the use of materials, teams, and network, enabling an increase in the quality of the electricity supply service;
7.: In the context of the consumer clustering problem based on the MILP model to group substations, there is a lack of research that addresses the influence of clustering on the costs of financial compensation.

The remainder of this paper is structured as follows: Section 2 presents the literature review, while the definition of the problem is presented in Section 3; the clustering approach is presented in Section 4; and finally, the results and the discussion and conclusions are presented in Section 5 and Section 6, respectively.

2. Clustering Electrical Customers

The last stage of energy supply, the distribution system, has the purpose of transporting electricity from the transmission system and delivering it to customers [28,29]. Therefore, assuming an approach that considers the set of customers of an electrical distribution network becomes very appealing from the point of view of computational complexity [28].

One of the main points is the ability of distributors to group customers, considering their electrical characteristics, which consequently makes it possible to avoid unnecessary costs. The clustering of customers in the distribution network addressed in this paper is defined according to the division of the service area into several substations [30], and the formation of customer sets is based on solving the clustering problem equivalent.

Although practical applications of cluster analysis are found in different areas [21], particularly on data clustering in the electrical area, the clustering context is applied to the behavior of different types of electrical customers, from which parameters are used regarding the size, economic activity, and energy consumption [31,32], and the clustering assigns similar electrical customers to the same class [32].

Although clustering methods are often used to model the energy consumption [33,34,35,36,37], customer clustering can be approached from different aspects. As for energy quality, the researchers in [38] directed their work toward improving energy quality assessments, and another work aimed to detect abnormal energy uses that could compromise the reliability of the electrical network [34].

In Benitez et al. [39], the static K-means algorithm was applied to identify energy consumption patterns in Spanish homes and also to assess their general consumption trends more quickly depending on the applied technique. Rhodes et al. [40] employed K-means together with Probit regression to determine so-called seasonal groups, based on variations in energy use to determine the daily profiles of residential customers. Another use of the clustering algorithm K-means, with the addition of Fuzzy C-means, was proposed by Sharma and Singh [34] with the aim of defining customer load profiles, similar to Rhodes et al. [40].

Biscarri et al. [31] investigated the applicability and performance of different clustering techniques under a framework that tested different numbers of clusters and different validation measures. The clustering problem was addressed with the development of a set of rules based on the identification of the load profiles of similar customers, selected based on their consumption and other economic criteria, to then classify new customers automatically, in such a way as to provide different tariffs established by the electrical utility.

Also exploring load profile studies, Panapakidis and Moschakis [41] developed a successful application of the K-means algorithm for cluster-based analysis of the daily load profile. However, unlike the other clustering algorithms available in the literature, the work of Falabretti and Sabbatini [30] proposed a clustering algorithm in which the MILP served as the basis for electrical losses along the lines, for the layout of the existing distribution network, and for the system reliability.

For analysis and obtaining relevant information for decision-making regarding substation clustering, Jiang, Wu, and Zhan [42], Corigliano et al. [43], and Huang et al. [44] employed different approaches.

Jiang, Wu, and Zhan [42] presented the clustering problem according to an analysis of the characteristics of composite substations through a multi-objective model and a clustering algorithm for the adequate choice of substations, aiming to improve the efficiency and safety of the electric power system.

Corigliano et al. [43] proposed the use of k-means data clustering techniques to identify patterns and group locations that had similar characteristics for the problem of locating electrical power distribution substations, considering restrictions such as the need for safe distances between the stations and installation costs.

Huang et al. [44] modeled the clustering of electrical power substation load data, using the spectral clustering technique to group load data into homogeneous groups and improve the accuracy of substation load forecasting, aiming to optimize the operation and electrical power system planning.

Regardless of the approach, there was a recognized success in the application of clustering algorithms as techniques to extract and form datasets from similarities between elements. Especially in the energy sector, reports pointed to valuable strategies precisely because of the adequacy and speed with which responses were provided [31,41,45,46].

3. The Problem Definition

The problem considered in this work refers to the substation clustering in an electrical distribution network. Ultimately, the problem refers to the clustering of customers belonging to the substations, with some similarities to the problem addressed by Moreno et al. [47] and Assis et al. [48] in territorial planning.

The differentiation is related to the dimension of the customers’ connection to predetermined physical sets: the electrical circuits of the substations that, later, will constitute functional sets for the purpose of calculating the performance indicators of the quality of the electricity distribution service [49]. Even if the customer sets electrically linked to a substation are assumed, the clustering of these substations must have a reference in order to identify the proximity of one substation to another, therefore evaluating the similarity and dissimilarity between the considered sets. For this reason, the empowered p-median problem [50,51,52] is the formulation that best approximates the intended approach.

Initially, the proposed formulation had the data, parameters, and variables defined in Table 1.

The objective function of Equation (1) minimizes the distance between each customer and the reference point of the considered substation.

M i n i m i z e \sum_{i \in T} \sum_{j \in T} D_{i j} . x_{i j},

(1)

subject to:

\sum_{j \in T} x_{i j} = 1, \forall i \in T

(2)

x_{i j} \leq y_{j}, \forall i, j \in T

(3)

\sum_{j \in T} y_{j} \leq | S |

(4)

x_{i j} = x_{i j}, \forall i \in T_{s}, \forall s \in S, j \in T

(5)

x_{i j}, y_{j} \in {0, 1}, \forall i, j \in T .

(6)

In this problem, customers are all assigned to substations with their corresponding locations. The objective function of Equation (1) minimizes the sum of the distances

D_{i j}

between each customer i and the reference point j linked to a substation. The constraints of Equation (2) guarantee that all customers are allocated to one and only one substation. The constraints defined in Equation (3) establish that every customer i must be assigned to a location j whenever there is a substation installed in this, i.e.,

y_{j} = 1

. The number of substations assumed to be installed at every reference point j must be limited to the cardinality of the set S, according to Equation (4). The constraints given in Equation (5) ensure that all points originally linked to existing substations are jointly assigned to a new substation located at the reference point

j \in T

. Finally, the domain of the decision variables x and y are defined in Equation (6).

The problem defined in Equations (1)–(6) solves the allocation of customers to the new substations considered from the determination of the points j that are linked to the variables y. It should be noted that this model does not include reliability, either as a criterion or even as a restriction. Therefore, the reliability criterion adopted in this work includes the group indicators that make reference to SAIDI and SAIFI [9], herein represented respectively by

d e c_{j}

and

f e c_{j}

, as well as other variables derived from these indicators. With regard to the customer indicators CAIDI and CAIFI, they are represented by

D I C_{i}

and

F I C_{i}

, respectively.

The calculations of

d e c_{j}

and

f e c_{j}

are given in Equations (7) and (8), respectively.

d e c_{j} \geq \frac{\sum_{i \in T} D I C_{i} . x_{i j}}{\sum_{i \in T} x_{i j}}, \forall j \in T_{n}

(7)

f e c_{j} \geq \frac{\sum_{i \in T} F I C_{i} . x_{i j}}{\sum_{i \in T} x_{i j}}, \forall j \in T_{n} .

(8)

Assuming a possible regulatory restriction regarding the limits for aggregating customers, Equation (9) defines that any connection of a customer to a reference point of a new substation is limited to a maximum distance

D_{m a x}

.

D_{i j} . x_{i j} \leq D_{m a x}, \forall i \in T, \forall j \in T_{s} .

(9)

Finally, the objective function originally defined in Equation (1) now also includes the minimization of the continuity indicators, according to Equation (10).

M i n i m i z e α_{1} . \sum_{i \in T} \sum_{j \in T} D_{i j} . x_{i j} + α_{2} . max_{j \in T_{s}} {d e c_{j}} + α_{3} . max_{j \in T_{s}} {f e c_{j}} .

(10)

Hence, the problem considered in this work assumes the objective function of Equation (10) and the constraints of Equations (2)–(9).

4. The Clustering Approach for Electrical Customers

The customer clustering problem is defined as the search for similarities between the elements, using techniques based on clustering analysis [53,54]. The main purpose is to find a smaller number of entities that best represent all the original elements, thus better assisting some analysis or planning/operation action in the electrical network [28].

Customer reclustering assumes the geographic location and the possible aggregation of contiguous substations in new areas, thus forming new sets of customers [48]. However, the application of this approach has challenges that require subsidies for its resolution, such as:

(a): The combinatorial complexity as the number of power distribution substations grows, requiring a high computational load for its resolution;
(b): The nonlinear nature of the power flow that includes integer and continuous variables.

In substation clustering, first the number of sets (T) is calculated based on the problem defined in Section 4.2. From the practical problem perspective, the reclustering task is restricted to an area contiguity matrix (Figure 1). This matrix is associated with the parameter

D_{i j}

, relating the medians of each substation to the others and indicating whether the areas are contiguous (

D_{i j} < < M

) or non-contiguous (

D_{i j} = M

), where M is defined according to Table 1.

When the new sets defined in the customer reclustering problem are considered, the assignment of these customers is based on a reference point of each substation. This reference point is generally assumed as the centroid, used to define the partitioning of customers [21]. Herein in this work, nevertheless, the reference point of each substation was assumed to be any one of the customers assigned to the median of the set (substation) [51].

Even though distance is the commonly used criterion to determine the median of each set [55], in this work, a density measure was considered that weights the distance of each customer to the respective median with the load of this customer. This measure allows clusters whose formation are also influenced by load density.

The proposed method for substation clustering was based on the reclustering strategy, affecting the existing customer sets

T_{n}

and assuming that T is an arbitrary integer that can be greater, equal to, or less than

T_{n}

. Mainly inspired by the substation reliability indices, the steps linked to the proposed method are illustrated in Figure 2.

From the input data, a mathematical model based on the MILP is proposed to define a restricted set of possible medians for the reclustering problem, which involves the construction of new customer sets in an attempt to obtain better reliability indices and to reduce the amount of financial compensation, according to Equations (11) and (21).

4.1. The Proposed Algorithm

In order to reduce the computational complexity, a heuristic based on Ahmadi and Osman’s constructive procedure [56] was developed to define a set of candidate medians as a parameter in the MILP model.

The median concept is linked to the median of each customer set to be created with the mathematical model described in Section 4.2, with the variables

x_{i j}

and

y_{j}

and from the definition of set

T_{n}

.

Algorithm 1 describes the procedure that systematically creates the set of candidate medians

T_{n}

and then solves the MILP model, defined in Section 4.2. The input data comprised the customer set (T), the indicators (

D I C

and

F I C

), the substations (S) and sets of customers linked to each one of them (

T S

), the normalization factors

α

, the value M, the matrix distance D, the limit of iterations

L I M_{i t}

, and the

ϵ

value that controls the acceptable margin of error to characterize the algorithm convergence. As output, Algorithm 1 furnished the

s o l

solution that defined the sets and pertinence of each existing substation to the newly created sets.

Algorithm 1: The clustering algorithm for electrical customers.

Steps 1–3 corresponded to the initialization of the variables

s o l

,

d i f

, and

i t

. The outermost loop between the steps 4–18 corresponded to the systematic determination of the set

T_{n}

(steps 5–11) and of solving the mathematical model of Section 4.2 in step 13. The comparison between the value of the solution

s o l

with the previous solution

s o l_{p}

was conducted in step 15. The construction of the candidate points for the median [56] made between the steps 5–11 had two stages: (i) the density of the points of the substation s was defined in the step 8, considering the distances of the matrix D only for those points that were not already selected in

T_{n}

; and (ii) the selection of one of the points in step 9, assuming a weighted roulette-based selection [57,58].

4.2. The Proposed Mathematical Model

The problem defined by Equations (2)–(10) presented some characteristics that made its resolution difficult, namely:

(a): By considering each customer individually, the corresponding problem presented a high level of complexity, even when scenarios with a number of substations less than a dozen were considered;
(b): The median of each substation could be each one of the thousands of customers linked to it;
(c): The objective function (Equation (10)) was nonlinear due to the reliability indicators involved.

In order to reduce the complexity of the model defined by Equations (2)–(10), the following approaches were assumed:

(d): The set of candidate medians was defined by a heuristic procedure, thus redefining the objective function originally built in Equation (10) to the new form given by Equation (11);
(e): The reliability indicators were approximated to make Equations (7) and (8) linear, replacing them with Equations (18) and (19), together with the definition of Equations (16) and (17).

The modified model based on the approaches mentioned in (d) and (e) is defined in Equations (11)–(21).

M i n i m i z e α_{1} . \sum_{i \in T} \sum_{j \in T_{n}} D_{i j} . x_{i j} + α_{2} . max_{j \in T_{n}} {d e c_{j}} + α_{3} . max_{j \in T_{n}} {f e c_{j}},

(11)

subject to:

\sum_{j \in T_{n}} x_{i j} = 1, \forall i \in T

(12)

x_{i j} \leq y_{j}, \forall i \in T, \forall j \in T_{n}

(13)

\sum_{j \in T_{n}} y_{j} \leq | S |

(14)

x_{i j} = x_{i j}, \forall i \in T_{s}, \forall s \in S, j \in T_{n}

(15)

d i c r_{i} \geq \frac{\frac{D I C_{i}}{\sum_{j \in T_{s}, i \neq j} D I C_{j}}}{\frac{1}{| T_{s} | - 1}}, \forall i \in T_{s}, \forall s \in S

(16)

f i c r_{i} \geq \frac{\frac{F I C_{i}}{\sum_{j \in T_{s}, i \neq j} F I C_{j}}}{\frac{1}{| T_{s} | - 1}}, \forall i \in T_{s}, \forall s \in S

(17)

d e c_{j} \geq \sum_{i \in T} d i c r_{i} . x_{i j}, \forall j \in T_{n}

(18)

f e c_{j} \geq \sum_{i \in T} f i c r_{i} . x_{i j}, \forall j \in T_{n}

(19)

d i c r_{i}, f i c r_{i}, d e c_{j}, f e c_{j} \geq 0, \forall i \in T, j \in T_{n}

(20)

x_{i j}, y_{j} \in {0, 1}, \forall i \in T, j \in T_{n} .

(21)

5. Results and Discussion

Once the mathematical formulation adopted for the MILP problem of clustering customers of the energy distribution system based on reclustering substations was developed, its effectiveness was evaluated with a real network extracted from a utility in the southern region of Brazil. The utility has more than 1.7 million customers, distributed in 72 cities in Southern Brazil, covering more than 73 thousand km². Figure 3 presents an overview of the regions served by the utility.

The utility service area involves 61 (sixty-one) customer sets (red area in Figure 3), four of them were selected (the yellow area in Figure 3) as a case study, corresponding to almost 7% of all the sets: Set 1 to 4, entitled

T_{s_{1}}

,

T_{s_{2}}

,

T_{s_{3}}

, and

T_{s_{4}}

. However, the sets were served by five substations, named in this study as:

s_{1}

,

s_{2}

,

s_{3}

,

s_{4}

, and

s_{5}

.

For validation, the degree of the precision of the calculations was checked, obtaining partial results of the penalties for each of the existing substations, which served as references for comparisons with the results made available by the utility, referring to the sets of the region under study.

The compensation values were considered for a monthly period, over the years 2017–2019, to customers of the four sets (

T_{s_{1}}

:

T_{s_{4}}

), whose amounts are presented in Table 2.

The difference between the real values and those obtained by the proposed model is explained by the treatment of the data, where some occurrences of customer units were eliminated due to data inconsistency.

5.1. Results Analysis

Once the calculations were validated, the clustering model was run and compared to the initial scenario. Thus, with each new configuration, new sets were tested, where the substations were grouped forming a set of substations

T_{n} = {j | y_{j} = 1}

, while the remaining substations remained ungrouped, moving through combinations conditioned to the set of substations S.

The decision of the best configuration was determined considering the difference in the total values obtained for each new configuration in relation to the original configuration (zero configuration). The figures present the values of each analyzed year, and the representation of the financial compensations is presented by its total (blue line). A negative change indicated a decrease in financial rewards compared to the initial setting, while a positive percentage change indicated an increase.

Multiple scenarios of combinations were tested. The first scenario consisted of two clusters of substations. This scenario presented 13 valid configurations, and their results are shown in Figure 4.

Of these configurations, Configuration 11, composed by

T_{n} = {{s_{1}, s_{2}, s_{4}, s_{5}}, {s_{3}}}

, was the most predominant, providing the largest reduction in the financial compensation value of approximately USD 86,000 (1.80%), followed by Configuration 13, with a reduction of approximately USD 81,600.

The annual values of the highlighted configuration varied between USD 9000 and USD 34,000, respectively, in 2020 and 2019.

In the second presented scenario, three clusters of substations were formed. The values displayed in Figure 5 highlight Configuration 2, which considered the clustering of

T_{n} = {{s_{1}, s_{2}, s_{4}}, {s_{3}}, {s_{5}}}

, resulting in a reduced financial compensation value of approximately USD 67.4 thousand (1.41%), while Configuration 16 obtained the second-highest reduction in total costs in the same period, resulting in USD 64,010.47.

On the other hand, the worst result was achieved by Configuration 17, with an increase of 1.32% compared to the original value. Regarding the annual values, the year 2019 benefited the most, presenting a reduction of USD 24,000 with Configuration 2.

For the third scenario, four clusters of substations were formed. The values presented in Figure 6 show the results obtained after solving the clustering problem, resulting in seven possible configurations.

In the new configuration, two substations were grouped together to form a new set of consumers, while the remaining three substations remained ungrouped. Therefore,

T_{n} = {{s_{1}, s_{4}}, {s_{2}}, {s_{3}}, {s_{5}}}

.

The comparison carried out, as shown in Figure 6, showed that for configurations 1 and 3, their corresponding compensation values remained unchanged. Configuration 2 resulted in the highest total reduction among the values presented, corresponding to a total of USD 64,010.46, an approximately 1.3% reduction in financial compensation by the utility under analysis. In contrast, the two worst configurations were in Configuration 6, with an increase of USD 63,274.59 and Configuration 4, with an increase of approximately USD 40,500.00.

Figure 7 illustrates the results referring to the annual financial compensation of the best solutions (bars), compared to the initial configuration values (blue line).

The analysis of the chart of the annual financial compensation values indicated a clear reduction in the values after the implementation of the substation clustering model. This drop in values is a strong indication that the proposed model was efficient in its solutions and contributed to significant savings in costs related to the distribution of electricity.

It is important to point out that this reduction in the amounts of financial compensation did not compromise the reliability of the system, as evidenced by the analysis of the performance indicators presented in the results section. Therefore, it can be concluded that the application of the proposed substation clustering model had a positive impact on the electrical system and resulted in significant improvements in terms of cost and reliability.

However, this article presented an approach to optimize substation clustering in order to minimize compensation costs and maximize distribution system reliability. After implementing the substation clustering model, three optimal configurations were obtained, which presented satisfactory results in relation to the value of the initial configuration, and the decision regarding the ideal configuration was made in favor of the scenario composed of two substation clusters, with a 1.8% reduction in the amount of compensation.

Although the proposed approach initially sought to reduce the compensation costs, it was essential to evaluate the impact of this strategy on the system reliability. In this sense, the results obtained were analyzed in order to verify the evolution of the reliability conditions in each configuration.

Figure 8 illustrates the variations of the SAIDI (blue line) and SAIFI (red line) indicators from the initial configuration for the period from 2017 to 2020, together with the average variation of these indicators over this period. It is important to highlight that these results were obtained from the best configurations for the main explored scenarios.

After analyzing the figure, initially the results indicated that scenario 3, composed of four clusters of substations, presented the best results for the SAIDI and SAIFI indicators, with values of 33.2% and 8.5%, respectively. However, it is important to note that the lowest values do not necessarily indicate the best option.

Regarding the SAIDI indicator, there was an increase of 2.9% in 2017 compared to the initial value; however, there were reduced results ranging between 8% and 31% in the following years, resulting in an average of 14% below the configuration over the four years analyzed. The same pattern applied to the analysis of the SAIFI indicator. There was an approximate increase of 6%, 2%, and 5% in 2019, 2018, and 2017, respectively, from the baseline. However, this indicator was influenced by a sharp drop of 8% in 2020, resulting in an average variation of 1.1% over the four years.

Additionally, Table 3 lists the summary of the indicators evaluated in the model, considering the results obtained in the model and evaluating them in comparison with the initial (original) configuration.

In the end, it was observed that in scenario 1, there was a reduction in the individual values of the SAIDI from 24.72 to 21.18, corresponding to a decrease of 14%. However, the SAIFI indicator showed an increase of 0.27 compared to the initial value. These results were modest when compared to those presented by the best performance scenario (scenario 3).

However, the configurations modeled for the indices were appropriate for substation clustering when considering the financial compensations as the decision variable for the ideal configuration. This was noted, presenting in Figure 9 an overview of the respective evolutions in the reliability conditions over the analyzed period, when the configurations were compared to each other.

The graphical analysis allowed comparing the evolution of the indicators over time. When examining the three main scenarios explored, illustrated in Figure 9, similarities were noted in the way the indicators evolved over the period from 2017 to 2020. This means that, regardless of the configuration adopted, it was possible to obtain an average reduction in the SAIDI and SAIFI indicators, which suggests that the indicators were associated, although they did not necessarily cause changes in each other.

The indicators obtained in the three scenarios showed optimal configurations with similar evolutions in relation to reliability, corroborating the decision for scenario 1.

Two other scenarios were tested. One involved clustering the five substations considered in the study, resulting in a reduction of USD 4183.90, representing approximately 0.1%. The fifth scenario considered each of the five substations as a set, but it did not show any improvement in the compensation values when compared to the value generated by the original configuration.

5.2. Discussion

As reported in Section 2, the analyzed literature addressed different substation clustering techniques. Jiang, Wu, and Zhan [42] reported that there were two main approaches to clustering substations, depending on the differences in their load patterns or the types of electricity customers associated with them. However, clustering substations with unusual load characteristics can decrease the clustering accuracy.

Jiang, Wu, and Zhan [42], Corigliano et al. [43], and Huang et al. [44] highlighted the importance of clustering substations to obtain relevant information for decision-making and to present the different approaches used to choose substation installation sites, analyze the characteristics, and forecast the substation loads.

In this study, a heuristic approach was proposed for substation clustering based on a reference point, using the p-median problem. This approach was applied to a multi-objective MILP model based on the linear formulation of reliability indicators, seeking to control the complexity of the problem. This strategy proved to be particularly relevant, considering the potentiated p-median problem, mentioned by Gnägi and Baumann [59], which requires efficient solutions for large instances.

According to Gnägi and Baumann [59], in practical clustering applications, the size of clusters is usually limited, leading to the need for an extension of the p-median problem, called the capacitated p-median problem. The authors also mentioned that there are several classic heuristics for formulating and solving this problem, including binary linear programming, but these formulations are more suitable for small-scale instances.

Similar to Yan et al. [60], Xu, Yu, and Yang [61], Li et al. [62], and Liu [63], the spatial complexity of clustering was addressed based on the adjacency matrix (D), describing only the adjacency relationship between possible substations to group in order to achieve a small storage space.

From a practical standpoint, the task of reclustering was linked to the contiguity matrix of areas (Figure 1). Based on this, new set configurations were proposed, exhausting all possible combinations while considering the union of two substations and maintaining a total of four sets, to allow for comparison with the original configuration.

Despite not demonstrating a significant number of clusters formed, the clustering model for substations still achieved its objective of forming clusters, providing a viable path and reducing the number of possible combinations by 13.3%, 20%, and 30% for scenarios 1, 2, and 3 respectively.

The model enabled a reduction in the number of combinations to be analyzed by identifying those that were less relevant to the problem at hand. In this case, the application of the model was efficient and could be extremely useful in reducing the number of combinations to be analyzed, making the analysis more feasible. With this information, it is possible to direct efforts towards the most relevant variables and increase the chances of success in solving the problem at hand, to make more informed and assertive decisions.

When analyzing Figure 6, it is observed that configurations 1 and 3 presented values equal to the original values. This was because the three configurations were within the same range of the limits of the annual SAIDI and SAIFI indicators.

In Rodrigues, Araújo, and Penido [14] and Anteneh et al. [16], the studies discussed approaches involving the optimization of the distribution network through reconfiguration to improve the network reliability by reducing the SAIDI and SAIFI indices.

Rodrigues, Araújo, and Penido [14] proposed a multiobjective approach for the allocation of disconnect switches on feeders using genetic algorithms, which resulted in a significant reduction of 43.77% in the SAIDI index. On the other hand, Anteneh et al. [16] used genetic algorithms to determine the best network configuration for optimal placement of switches, which resulted in reductions of 77.33% in the SAIFI compared to the average value of the system reliability index in base years, and of 80% in the SAIDI.

However, the results obtained do not diminish the importance of this research. It is important to emphasize that the values presented are an average of a four-year period, and it is fundamental to analyze each year individually to better understand the context and the factors that affected the SAIFI.

The research presented significant results and demonstrated that most configurations brought values below their limits, as evidenced in Figure 8. In addition, even if some results surpassed the initial value in some years, it was possible to obtain a general average reduction in the SAIDI and SAIFI indicators over the period of analysis.

The goal of reducing financial tradeoffs and maximizing network reliability has been achieved. However, finding the best configuration for the entire electrical system, considering the various possible combinations between substation configurations, remains a challenge.

The implementation of the best configuration depends on the decision makers, who must identify the areas that need attention and direct their efforts towards specific improvements, according to their objectives. For example, this may involve reducing the costs of financial compensation or improving the quality of the electricity distribution service, such as reducing the frequency of interruptions in the electricity supply.

The utilities must take into account the financial and cost impact of any decision that affects the performance and efficiency of the electricity distribution system. When deciding on the implementation of substation clustering, they need to consider investment costs in infrastructure and technology, maintenance, allocation of financial resources, among others, in their decision-making policies.

Usually, decisions are influenced by broader policies, such as government regulations and standards related to the safety, quality, and efficiency of energy supply. Therefore, utilities must ensure that their decisions are aligned with the costs and benefits of their actions to maintain a balance between economic considerations and the need to improve system reliability.

The results of the study, presented in Figure 9, which compared different scenarios for improving the quality of electrical service, are highly significant. The findings indicate that the evaluated configurations can be considered viable options for decision-making regarding the improvement in the performance indicators. Additionally, the similarity of the results across the three scenarios suggests that the choice of a specific configuration will depend on the particular characteristics and needs of each region or situation.

Utilities should have models that allow them to quickly perform cost–benefit analyses to compare the cost of implementing any model, its potential benefits, and reductions.

In this way, the results obtained from the analysis of the scenarios can be used to aid decision-making regarding the implementation of improvements in the electrical service, considering the particularities and needs of each case. It is important to emphasize that the detailed analysis of the results and the consideration of other relevant factors must be carried out before making the final decision.

6. Conclusions

In this paper, a heuristic solution based on Mixed Integer Linear Programming was proposed to solve the customer clustering problem in electrical distribution systems, focusing on maximizing the system reliability and on minimizing the compensation costs.

The solution approach was applied to a utility of Southern Brazil, assuming the historical customer data of four years (2017–2020). Starting from the four original customer sets over five substations, an exhaustive approach corroborated the solution reached by the proposed heuristic when the customer clustering problem was solved.

Regarding the contributions of this work, we note the introduction of linear reliability indicators to allow the use of the MILP model and the further development of a heuristic that aimed to reduce the MILP’s computational complexity when restricting the median definition.

The present study emphasized the importance of maximizing the reliability of the distribution system, in addition to minimizing compensation costs. The results obtained demonstrate that the three optimal configurations showed a favorable evolution in relation to reliability, reinforcing the importance of considering this aspect when making decisions about the clustering of substations.

The substation clustering of customers approach resulted in significant reductions in the total utility compensation costs of USD 86,382.41 (1.80%) for Scenario 1, comprised of two clusters of substations, about USD 67,400 (1.41%) for Scenario 2, consisting of three clusters of substations, and approximately USD 64,000 (1.3%) for Scenario 3, consisting of four clusters of substations, but the solution with two clusters, composed by (

T_{n} = {{s_{1}, s_{2}, s_{4}, s_{5}}, {s_{3}}}

), was the most viable among all tested cluster configurations.

The solutions suggest that it is possible to reduce compensation costs by maximizing the reliability of the distribution system, without the need for new investments in solutions. In addition, the evolution of the indicators indicates that the evaluated configurations are viable options to improve the performance of the quality of the electrical service. The decision on the best configuration will depend on the specific characteristics and needs of each region or situation.

It is important to point out that the different ways of measuring are not mutually exclusive; on the contrary, they are reconcilable and complementary. They are different perspectives but with the same purpose and converging to the same point.

For future work, it is suggested to explore the clustering approach by considering topics such as using the criterion of the number of customers to form sets along with the criterion of the contiguity of areas, applying the dynamic method to calculate the limits of the collective indicators for the newly formed sets, expanding the number of sets, substations, and customers analyzed, as well as considering other indicators and the financial impact with the costs of investment in new equipment, maintenance costs, and others.

Author Contributions

Writing—original draft preparation, T.E.d.O.G. and A.R.B.; Conceptualization, T.E.d.O.G., A.R.B. and V.J.G.; Data curation, A.R.B.; Formal analysis, T.E.d.O.G., A.R.B. and V.J.G.; Investigation, T.E.d.O.G., A.R.B. and V.J.G.; Methodology, T.E.d.O.G., A.R.B. and V.J.G.; Funding acquisition, L.L.C.d.S. and R.A.F.G.; Project administration, L.L.C.d.S. and R.A.F.G.; Resources, L.L.C.d.S. and R.A.F.G.; Software, V.J.G.; Supervision, V.J.G.; Validation, A.R.B. and V.J.G.; Writing—review and editing, T.E.d.O.G., A.R.B., V.J.G., L.L.C.d.S. and N.K.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was carried out with the financial support of the Research and Technological Development Program of the Electric Sector CEEE-D, regulated by ANEEL and executed by CEEE Grupo Equatorial Energia, through Public Call No. CEEE-D 001/2018.

Data Availability Statement

The data from which the results presented in this article were derived are available upon request to the corresponding author. The data are not publicly available, as they contain information about the customers of the utility under study.

Acknowledgments

The authors express sincere thanks to CEEE Grupo Equatorial Energia and the Federal University of Santa Maria for the technical and financial support provided during the research.

Conflicts of Interest

The authors declare no conflict of competing financial interests or personal relationships that could have appeared to influence this paper.

Abbreviations

The following abbreviations are used in this manuscript:

ANEEL	Agência Nacional de Energia Elétrica
CAIDI	Customer Average Interruption Duration Index
CAIFI	Customer Average Interruption Frequency Index
DEC	Duração Equivalente de Interrupção por Unidade Consumidora
DIC	Duração de Interrupção Individual por Unidade Consumidora ou por Ponto de Conexão
$d i c r_{i}$	Approximate DIC indicator
FEC	Frequência Equivalente de Interrupção por Unidade Consumidora
FIC	Frequência de Interrupção Individual por Unidade Consumidora ou por Ponto de Conexão
$f i c r_{i}$	Approximate FIC indicator
MILP	Mixed Integer Linear Programming
SAIDI	System Average Interruption Duration Index
SAIFI	System Average Interruption Frequency Index

References

Peyghami, S.; Palensky, P.; Blaabjerg, F. An Overview on the Reliability of Modern Power Electronic Based Power Systems. IEEE Open J. Power Electron. 2020, 1, 34–50. [Google Scholar] [CrossRef] [Green Version]
Parol, M.; Wasilewski, J.; Wojtowicz, T.; Arendarski, B.; Komarnicki, P. Reliability Analysis of MV Electric Distribution Networks Including Distributed Generation and ICT Infrastructure. Energies 2022, 15, 5311. [Google Scholar] [CrossRef]
Dětřich, V.; Skala, P.; Matonoha, K.; Špaček, Z.; Göhler, M.; Blažek, V. Modeling of supply interruptions in MV cable distribution networks for a more accurate estimation of the cost of penalty payments. IEEE Trans. Power Syst. 2006, 21, 605–610. [Google Scholar] [CrossRef]
Agência Nacional de Energia Elétrica. Procedimentos de Distribuição de Energia Elétrica No Sistema Elétrico Nacional—PRODIST Módulo 8—Qualidade da Energia Elétrica, Revisão 12; Technical Report; Agência Nacional de Energia Elétrica (ANEEL): Brasília, Brazil, 2020. [Google Scholar]
Küfeoğlu, S.; Lehtonen, M. Interruption costs of service sector electricity customers, a hybrid approach. Int. J. Electr. Power Energy Syst. 2015, 64, 588–595. [Google Scholar] [CrossRef]
Barbosa, A.d.S.; Shayani, R.A.; de Oliveira, M.A.G. A multi-criteria decision analysis method for regulatory evaluation of electricity distribution service quality. Util. Policy 2018, 53, 38–48. [Google Scholar] [CrossRef]
Wang, B.; Camacho, J.A.; Pulliam, G.M.; Etemadi, A.H.; Dehghanian, P. New reward and penalty scheme for electric distribution utilities employing load-based reliability indices. IET Gener. Transm. Distrib. 2018, 12, 3647–3654. [Google Scholar] [CrossRef] [Green Version]
Tur, M.R. Reliability assessment of distribution power system when considering energy storage configuration technique. IEEE Access 2020, 8, 77962–77971. [Google Scholar] [CrossRef]
IEEE Std 1366-2012 (Revision of IEEE Std 1366-2003); IEEE Guide for Electric Power Distribution Reliability Indices. IEEE: New York, NY, USA, 2012; pp. 1–43. [CrossRef]
Tragoonthai, S.; Chaitusaney, S. Optimal budget allocation for preventive maintenance of distribution system considering customer outage cost and reliability indices. In Proceedings of the ECTI-CON 2017—2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, Phuket, Thailand, 27–30 June 2017; pp. 600–603. [Google Scholar] [CrossRef]
Majid, S.N.A.A.; Salim, N.A.; Mohamad, H.; Yasin, Z.M. Assessment of expected customer interruption cost due to power system contingency by sensitivity analysis. In Proceedings of the PECon 2020—2020 IEEE International Conference on Power and Energy, Penang, Malaysia, 7–8 December 2020; pp. 171–175. [Google Scholar] [CrossRef]
Agência Nacional de Energia Elétrica. Nota Técnica No 0102, de 3 de Dezembro de 2014. Revisão da Metodologia de Definição de Limites para os Indicadores de Continuidade DEC e FEC das Distribuidoras; Technical Report; Agência Nacional de Energia Elétrica (ANEEL): Brasília, Brazil, 2014. [Google Scholar]
Gangwar, P.; Singh, S.N.; Chakrabarti, S. Network reconfiguration for the DG-integrated unbalanced distribution system. IET Gener. Transm. Distrib. 2019, 13, 3896–3909. [Google Scholar] [CrossRef]
Rodrigues, F.M.; Araújo, L.R.d.; Penido, D.R.R. Optimization of Reliability Through Switch Reconfiguration in Distribution Systems. IEEE Lat. Am. Trans. 2019, 17, 972–982. [Google Scholar] [CrossRef]
Poudel, S.; Dubey, A.; Bose, A. Risk-Based Probabilistic Quantification of Power Distribution System Operational Resilience. IEEE Syst. J. 2020, 14, 3506–3517. [Google Scholar] [CrossRef] [Green Version]
Anteneh, D.; Khan, B.; Mahela, O.P.; Alhelou, H.H.; Guerrero, J.M. Distribution network reliability enhancement and power loss reduction by optimal network reconfiguration. Comput. Electr. Eng. 2021, 96, 107518. [Google Scholar] [CrossRef]
Banerjee, A.; Chattopadhyay, S.; Gavrilas, M.; Grigoras, G. Optimization and estimation of reliability indices and cost of Power Distribution System of an urban area by a noble fuzzy-hybrid algorithm. Appl. Soft Comput. 2021, 102, 107078. [Google Scholar] [CrossRef]
Al-Wakeel, A.; Wu, J.; Jenkins, N. K-means based load estimation of domestic smart meter measurements. Appl. Energy 2017, 194, 333–342. [Google Scholar] [CrossRef] [Green Version]
Salyani, P.; Nourollahi, R.; Zare, K.; Razzaghi, R. A new MILP model of switch placement in distribution networks with consideration of substation overloading during load transfer. Sustain. Energy Grids Netw. 2022, 32, 1–12. [Google Scholar] [CrossRef]
Esmaeili, S.; Anvari-Moghaddam, A.; Jadid, S.; Guerrero, J.M. Optimal simultaneous day-ahead scheduling and hourly reconfiguration of distribution systems considering responsive loads. Int. J. Electr. Power Energy Syst. 2019, 104, 537–548. [Google Scholar] [CrossRef]
Queiroga, E.; Subramanian, A.; dos Anjos, F.; Cabral, L. Continuous greedy randomized adaptive search procedure for data clustering. Appl. Soft Comput. 2018, 72, 43–55. [Google Scholar] [CrossRef]
Araújo, R.J.P.; Strauch, M.T.; Kagan, N. Optimization of Distribution Systems Continuity Indicators using Immunological Artificial Algorithms. In Proceedings of the 2010 IEEE/PES Transmission and Distribution Conference and Exposition: Latin America (T&D-LA), São Paulo, Brazil, 8–10 November 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 875–882. [Google Scholar] [CrossRef]
Bernardon, D.P.; Garcia, V.J.; Ferreira, A.S.Q.; Canha, L.N.; Abaide, A.d.R. Distribution Network Reconfiguration Starting from Fuzzy Multicriteria Decision Making Algorithms. In Proceedings of the 2009 Electronics, Robotics and Automotive Mechanics Conference (CERMA), Cuernavaca, Mexico, 22–25 September 2009; pp. 440–445. [Google Scholar] [CrossRef]
Chen, X.; Chen, Y.; Wu, Z.; Yi, Y.; Rong, H. Flexible Distribution System Reconfiguration Using Graph Theory and Topology Identification Technology. In Proceedings of the 2018 International Conference on Power System Technology (POWERCON), Guangzhou, China, 6–8 November 2018; pp. 2008–2014. [Google Scholar] [CrossRef]
Bernardon, D.P.; Pfistcher, L.L.; Canha, L.N.; de Mello, A.P.C.; Abaide, A.d.R.; Sperandio, M.; Garcia, V.J.; Ramos, M.J.S. Sistemas de Distribuição no Contexto das Redes Elétricas Inteligentes: Uma Abordagem para Reconfiguração de redes, 1st ed.; AGEPOC: Santa Maria, CA, USA, 2015; p. 163. [Google Scholar]
Jafari, A.; Ganjeh Ganjehlou, H.; Baghal Darbandi, F.; Mohammdi-Ivatloo, B.; Abapour, M. Dynamic and multi-objective reconfiguration of distribution network using a novel hybrid algorithm with parallel processing capability. Appl. Soft Comput. 2020, 90, 1–20. [Google Scholar] [CrossRef]
Guimarães, I.G.; Bernardon, D.P.; Garcia, V.J.; Schmitz, M.; Pfitscher, L.L. A decomposition heuristic algorithm for dynamic reconfiguration after contingency situations in distribution systems considering island operations. Electr. Power Syst. Res. 2021, 192, 106969. [Google Scholar] [CrossRef]
Bichels, A. Sistemas Elétricos de Potência: Métodos de análise e Solução; EDUTFPR: Curitiba, Brazil, 2018; p. 466. [Google Scholar]
Nguyen, T.T.; Truong, A.V.; Phung, T.A. A novel method based on adaptive cuckoo search for optimal network reconfiguration and distributed generation allocation in distribution network. Int. J. Electr. Power Energy Syst. 2016, 78, 801–815. [Google Scholar] [CrossRef]
Falabretti, D.; Sabbatini, G. A new clustering method for the optimization of distribution networks layout considering energy efficiency and continuity of service. Sustain. Energy Grids Netw. 2022, 30, 100654. [Google Scholar] [CrossRef]
Biscarri, F.; Monedero, I.; García, A.; Guerrero, J.I.; León, C. Electricity clustering framework for automatic classification of customer loads. Expert Syst. Appl. 2017, 86, 54–63. [Google Scholar] [CrossRef]
Piao, M.; Ryu, K.H. Local characterization-based load shape factor definition for electricity customer classification. IEEJ Trans. Electr. Electron. Eng. 2017, 12, S110–S116. [Google Scholar] [CrossRef]
Granell, R.; Axon, C.J.; Wallom, D.C. Impacts of Raw Data Temporal Resolution Using Selected Clustering Methods on Residential Electricity Load Profiles. IEEE Trans. Power Syst. 2015, 30, 3217–3224. [Google Scholar] [CrossRef] [Green Version]
Sharma, D.D.; Singh, S.N. Aberration detection in electricity consumption using clustering technique. Int. J. Energy Sect. Manag. 2015, 9, 451–470. [Google Scholar] [CrossRef]
Hsu, D. Comparison of integrated clustering methods for accurate and stable prediction of building energy consumption data. Appl. Energy 2015, 160, 153–163. [Google Scholar] [CrossRef] [Green Version]
Tureczek, A.; Nielsen, P.S.; Madsen, H. Electricity consumption clustering using smart meter data. Energies 2018, 11, 859. [Google Scholar] [CrossRef] [Green Version]
Cai, H.; Shen, S.; Lin, Q.; Li, X.; Xiao, H. Predicting the Energy Consumption of Residential Buildings for Regional Electricity Supply-Side and Demand-Side Management. IEEE Access 2019, 7, 30386–30397. [Google Scholar] [CrossRef]
Jasiński, M.; Sikorski, T.; Borkowski, K. Clustering as a tool to support the assessment of power quality in electrical power networks with distributed generation in the mining industry. Electr. Power Syst. Res. 2019, 166, 52–60. [Google Scholar] [CrossRef]
Benítez, I.; Quijano, A.; Díez, J.L.; Delgado, I. Dynamic clustering segmentation applied to load profiles of energy consumption from Spanish customers. Int. J. Electr. Power Energy Syst. 2014, 55, 437–448. [Google Scholar] [CrossRef]
Rhodes, J.D.; Cole, W.J.; Upshaw, C.R.; Edgar, T.F.; Webber, M.E. Clustering analysis of residential electricity demand profiles. Appl. Energy 2014, 135, 461–471. [Google Scholar] [CrossRef] [Green Version]
Panapakidis, I.P.; Moschakis, M.N. Consumer Load Profile Determination with Entropy-Based K-Means Algorithm. Int. J. Electr. Electron. Commun. Sci. 2019, 13, 144–149. [Google Scholar] [CrossRef]
Jiang, Z.; Wu, H.; Zhan, Z. Compound substation characteristics analysis based on multi-objective model and cluster-correct algorithm. Electr. Power Syst. Res. 2019, 175, 105880. [Google Scholar] [CrossRef]
Corigliano, S.; Rosato, F.; Ortiz Dominguez, C.; Merlo, M. Clustering Techniques for Secondary Substations Siting. Energies 2021, 14, 28. [Google Scholar] [CrossRef]
Huang, M.; Zheng, X.; Liao, Z.; Huang, X. Modeling and Analysis for Power Substation Load Data based on Spectral Clustering. In Proceedings of the 2021 IEEE 4th International Electrical and Energy Conference (CIEEC), Wuhan, China, 28–30 May 2021; pp. 1–4. [Google Scholar] [CrossRef]
Cembranel, S.S.; Lezama, F.; Soares, J.; Ramos, S.; Gomes, A.; Vale, Z. A Short Review on Data Mining Techniques for Electricity Customers Characterization. In Proceedings of the 2019 IEEE PES GTD Grand International Conference and Exposition Asia, (GTD Asia 2019), Bangkok, Thailand, 19–23 March 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 194–199. [Google Scholar] [CrossRef]
Yang, J.; Zhao, J.; Wen, F.; Dong, Z. A Model of Customizing Electricity Retail Prices Based on Load Profile Clustering Analysis. IEEE Trans. Smart Grid 2019, 10, 3374–3386. [Google Scholar] [CrossRef]
Moreno, S.; Pereira, J.; Yushimito, W. A hybrid K-means and integer programming method for commercial territory design: A case study in meat distribution. Ann. Oper. Res. 2020, 286, 87–117. [Google Scholar] [CrossRef]
Assis, L.S.d.; Franca, P.M.; Usberti, F.L. A redistricting problem applied to meter reading in power distribution networks. Comput. Oper. Res. 2014, 41, 65–75. [Google Scholar] [CrossRef] [Green Version]
Rajabi, A.; Eskandari, M.; Ghadi, M.J.; Li, L.; Zhang, J.; Siano, P. A comparative study of clustering techniques for electrical load pattern segmentation. Renew. Sustain. Energy Rev. 2020, 120. [Google Scholar] [CrossRef]
Lorena, L.A.N.; Senne, E.L.F. Local Search Heuristics for Capacitated p-Median Problems. Netw. Spat. Econ. 2003, 3, 407–419. [Google Scholar] [CrossRef]
Ríos-Mercado, R.Z.; Álvarez-Socarrás, A.M.; Castrillón, A.; López-Locés, M.C. A location-allocation-improvement heuristic for districting with multiple-activity balancing constraints and p-median-based dispersion minimization. Comput. Oper. Res. 2021, 126, 105106. [Google Scholar] [CrossRef]
Oksuz, M.K.; Buyukozkan, K.; Bal, A.; Satoglu, S.I. A genetic algorithm integrated with the initial solution procedure and parameter tuning for capacitated P-median problem. Neural Comput. Appl. 2022, 35, 6313–6330. [Google Scholar] [CrossRef]
Wang, Y.; Wu, Z.; Li, Q.; Zhu, Y. A model of telecommunication network performance anomaly detection based on service features clustering. IEEE Access 2017, 5, 17589–17596. [Google Scholar] [CrossRef]
Lin, S.C.; Chen, C.J.; Lee, T.J. A Multi-Label Classification with Hybrid Label-Based Meta-Learning Method in Internet of Things. IEEE Access 2020, 8, 42261–42269. [Google Scholar] [CrossRef]
Teichgraeber, H.; Brandt, A.R. Clustering methods to find representative periods for the optimization of energy systems: An initial framework and comparison. Appl. Energy 2019, 239, 1283–1293. [Google Scholar] [CrossRef]
Ahmadi, S.; Osman, I.H. Greedy random adaptive memory programming search for the capacitated clustering problem. Eur. J. Oper. Res. 2005, 162, 30–44. [Google Scholar] [CrossRef]
Bäck, T. Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms; Oxford University Press: New York, NY, USA, 1996; p. 328. [Google Scholar] [CrossRef]
Zhu, Y.P.; Yang, Q.; Gao, X.D.; Lu, Z.Y. A Ranking Weight Based Roulette Wheel Selection Method for Comprehensive Learning Particle Swarm optimization. In Proceedings of the 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Prague, Czech Republic, 9–12 October 2022; pp. 1–7. [Google Scholar] [CrossRef]
Gnägi, M.; Baumann, P. A matheuristic for large-scale capacitated clustering. Comput. Oper. Res. 2021, 132, 1–15. [Google Scholar] [CrossRef]
Yan, Y.; Ma, M.; Bao, W.; Liu, C.; Lin, H.; Peng, L.; Cui, C. Load Balancing Distribution Network Reconfiguration Based on Constraint Satisfaction Problem Model. In Proceedings of the 2018 China International Conference on Electricity Distribution (CICED), Tianjin, China, 17–19 September 2018; pp. 2515–2519. [Google Scholar] [CrossRef]
Xu, Y.; Yu, T.; Yang, B. Reliability assessment of distribution networks through graph theory, topology similarity and statistical analysis. IET Gener. Transm. Distrib. 2019, 13, 37–45. [Google Scholar] [CrossRef]
Li, H.; Zhu, L.; Hou, K.; Jia, H. Application of Adjacency Matrix in Probabilistic Energy Flow Calculation Method considering Coupling Failure. In Proceedings of the 2021 IEEE 5th Conference on Energy Internet and Energy System Integration (EI2), Taiyuan, China, 22–25 October 2021; pp. 1596–1601. [Google Scholar] [CrossRef]
Liu, Z.; Barahona, M. Graph-based data clustering via multiscale community detection. Appl. Netw. Sci. 2020, 5, 1–20. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Area contiguity matrix.

Figure 2. Overview of the proposed algorithm.

Figure 3. Area served by the distributor under study.

Figure 4. Differences in the amounts of financial compensation for two sets of substations.

Figure 5. Differences in the amounts of financial compensation for three sets of substations.

Figure 6. Differences in the amounts of financial compensation for four sets of substations.

Figure 7. Annual financial compensation.

Figure 8. Annual and average reliability indicators of the best configurations.

Figure 9. (Left): Evolution of the SAIDI Indicator for the multiple scenarios presented. (Right): Evolution of the SAIFI Indicator for the multiple configuration scenarios presented. Each of the presented scenarios represents an obtained configuration.

Table 1. Sets, input data, and variables of the p-median model.

Set	Description
T	The set of customers;
$D I C_{i}$	The outage time for customer i;
$F I C_{i}$	The outage frequency for customer i;
S	The set of substations;
$T_{s}$	The set of customers in the substation s, $T_{s} \in P (T S)$ ;
$T S$	The set of sets of customers for all substations: $T S = {T_{s} \| s \in S}$ ;
$T_{n}$	The set of new substations: $T_{n} = {j \| y_{j} = 1}$ ;
$D_{m a x}$	The maximum distance between each customer and the reference point of the substation to which the customer is linked;
$α_{k}$	The normalization factor for the magnitude associated with the objective function: $k = 1$ for distance; $k = 2$ for $d e c$ ; and $k = 3$ for $f e c$ ;
$\| G \|$	The cardinality of a hypothetical set G;
Input data	Description
M	A large number, typically $10^{9}$ ;
$D_{i j}$	The distance from customer i to the substation that has its center at point j;
Variable	Domain / Description
$x_{i j}$	1 if the customer i is assigned to a substation whose reference point is located at point j; 0 otherwise;
$y_{j}$	1 when the point j is used as the center of a substation, and 0 otherwise.

Table 2. Monthly compensation amounts for 2017, 2018, and 2019.

Year	Utility Values (USD)	Values of the Proposed Methodology (USD)
2017	1,028,948.02	988,206.40
2018	1,104,963.62	1,002,307.18
2019	1,332,717.73	1,083,477.35
Total	3,466,629.37	3,073,990.93

Table 3. Annual SAIDI and SAIFI values for the original configuration and their average values.

Indicator	2017	2018	2019	2020	Average
SAIDI Initial	20.07	18.67	26.39	33.74	24.72
SAIDI Final	17.09	19.47	22.75	25.42	21.18
SAIFI Initial	11.43	12.91	13.52	15.92	13.44
SAIFI Final	11.39	14.84	13.91	14.70	13.71

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gomes, T.E.d.O.; Borniatti, A.R.; Garcia, V.J.; Santos, L.L.C.d.; Knak Neto, N.; Garcia, R.A.F. Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems. Energies 2023, 16, 2485. https://doi.org/10.3390/en16052485

AMA Style

Gomes TEdO, Borniatti AR, Garcia VJ, Santos LLCd, Knak Neto N, Garcia RAF. Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems. Energies. 2023; 16(5):2485. https://doi.org/10.3390/en16052485

Chicago/Turabian Style

Gomes, Thiago Eliandro de Oliveira, André Ross Borniatti, Vinícius Jacques Garcia, Laura Lisiane Callai dos Santos, Nelson Knak Neto, and Rui Anderson Ferrarezi Garcia. 2023. "Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems" Energies 16, no. 5: 2485. https://doi.org/10.3390/en16052485

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Clustering Electrical Customers with Source Power and Aggregation Constraints: A Reliability-Based Approach in Power Distribution Systems

Abstract

1. Introduction

2. Clustering Electrical Customers

3. The Problem Definition

4. The Clustering Approach for Electrical Customers

4.1. The Proposed Algorithm

4.2. The Proposed Mathematical Model

5. Results and Discussion

5.1. Results Analysis

5.2. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI