Next Article in Journal
Wind Power Interval Prediction with Adaptive Rolling Error Correction Based on PSR-BLS-QR
Previous Article in Journal
Simulation of a Pilot Scale Power-to-Liquid Plant Producing Synthetic Fuel and Wax by Combining Fischer–Tropsch Synthesis and SOEC
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Generators’ Economic Withholding Behavior Based on a SCAD-Logit Model in Electricity Spot Market

College of Economics and Management, Shanghai University of Electric Power, Shanghai 200090, China
*
Author to whom correspondence should be addressed.
Energies 2022, 15(11), 4135; https://doi.org/10.3390/en15114135
Submission received: 19 April 2022 / Revised: 23 May 2022 / Accepted: 31 May 2022 / Published: 4 June 2022

Abstract

:
The effective identification of the economic withholding behavior of the generators can help ensure the fair operation of the electricity market. A SCAD-logit model is proposed to improve the performance of the logit model for the massive data of electricity market. First, a social network analysis method is used to construct an equity relationship graph of the generators to obtain a set of key monitoring generators. An indicator system for identifying the economic withholding behavior of the generators is constructed based on structure conduct performance (SCP) theory. The indicators are screened by the smoothed clipped absolute deviation (SCAD) penalty regression method to reduce the collinearity and improve identification efficiency. Then, a SCAD-logit model is established to identify the economic withholding of key monitoring generators, so that the boundary contributions of each indicator to the economic withholding behavior are obtained. The confusion matrix, ROC curve, and AUC values are used to evaluate the model’s performance. Finally, the model is applied to the electricity spot market, and the method can identify the generators that exercise economic withholding behavior with a correct rate of 96.83%. Indicators such as market share, quotation fluctuation degree, high quotation index, and volume price index can be used as important indicators for identifying the economic withholding behavior.

1. Introduction

At present, China’s electricity spot market has entered into trial operation, which is a critical period for the development of the electricity market. During trial operations, imperfect trading rules lead to market power abuse [1,2], and economic withholding is a prominent form [3]. Economic withholding refers to the behavior of generators wherein they declare a high price so that part of their power generation capacity does not win the bid, thereby raising the market-clearing price and making generators or their affiliates ally to obtain high profits. Economic withholding can seriously affect the safety, stability, and healthy development of the electricity spot market. Around the world, electricity market prices in California and Spain have soared due to economic withholding [4,5]. Therefore, it is urgent to quickly and effectively identify the economic withholding behavior of generators.
The abuse of market power includes many forms; however, current research does not distinguish them. Ref. [6] created the local market power index that measures market power abuse based on market concentration, transmission constraints, and demand–supply ratio. Ref. [7] reviewed the measurement indicators of market power abuse. Ref. [8] established an indicator system to evaluate the multilevel market power abuse. These studies measure and assess the market’s abuse of market power by constructing indicators; however, there are many forms of market power abuse, such as economic withholding, physical withholding, and collusion bidding, and each form has a unique way of being exercised. Therefore, as market participants become more proficient in market rules, it is necessary to provide different identification methods for different forms of market power abuses, and ensure the fair operation of the market.
In research on the economic withholding behavior of generators, Ref. [9] analyzed the internal mechanism of economic withholding behavior from the perspective of the relationship between electricity spot prices and generation failures. Ref. [10] analyzed the incentive effect of a unified marginal clearing price mechanism on economic withholding based on the Cournot game model of generators. Ref. [11] studied the economic withholding behavior that would lead to more generators being more profitable in the market. Ref. [12] discussed the conditions under which economic withholding occurs and the resulting market impact by determining whether the supply will decrease during periods of high demand. Ref. [13] used a two-stage market liquidation method to alleviate economic withholding behavior. The aforementioned papers analyzed the exercise conditions and consequences of economic withholding behavior, focusing on a theoretical analysis, and did not monitor and identify economic withholding. Refs. [14,15] proposed a withholding capability return index and a distortion–withheld index to monitor and identify the economic withholding behavior of generators, respectively. Ref. [16] used the distortion–withheld index to explore the potential ability for economic withholding in the market. Ref. [17] used two dynamic withholding indicators to identify the possibility of generators exercising economic withholding. Ref. [18] proposed a capacity collusion index to monitor economic withholding behavior in power trading from a preventive perspective. These papers monitored and identified economic withholdings; however, the proposed indicators required the individual identification of all generators.
This paper uses Social network analysis (SNA) to narrow the scope of identification. SNA is a structural analysis method developed based on graph theory that studies the political, social, or economic structure between objects from the perspective of the relationship. Ref. [19] used economy, distance, carbon emissions, and other factors to build a spatial relationship network and analyzed the synergy of carbon emission networks between cities. In [20], the SNA method was shown to be an effective tool for monitoring market power in the electricity market. This paper constructs a graph about the equity relationship between generators, and based on the SNA method, the key monitoring generators are obtained, and the scope of identification is narrowed.
The logit model can identify market power abuse, such as economic withholding, in multiple fields. Ref. [21] constructed an identification model of listed financial risks of the companies based on a logit model. Ref. [22] constructed a multivariate binary logit Early Warning Model for systemic banking crises. Compared with other identification methods, the logit model can provide model explanations and obtain the impact size of various indicators [23]. The transaction data of the electricity spot market are complex and high-dimensional. This study combines variable selection methods to improve the performance of the logit model. In existing research, variable selection methods primarily include the shrinkage method, dimensionality reduction method, and smoothly clipped absolute deviation (SCAD) penalty regression method [24]. This paper adopts the SCAD penalty regression method, which can remove variables with similar characteristics, thereby eliminating the collinearity between indicators and reducing the data dimension [25].
In this paper, a method to identify the economic withholding behaviors of generators is proposed. First, the key monitoring generators are obtained through the SNA method, and the structure conduct performance (SCP) theory is used to construct an economic withholding indicator system. Then, the indicators are screened by the SCAD penalty regression method, and a SCAD-logit model is established to identify the economic withholding of generators. Finally, the confusion matrix, receiver operating characteristic (ROC) curve, and area under curve (AUC) values are used to evaluate the performance of the model.

2. Key Monitoring Generators

In some regions of China, there are a few power generation groups that have large market shares and hold multiple power plants [26]. Several power generation groups often form an interest alliance, and generators belonging to interest alliances are more likely to carry out economic withholding behavior. Moreover, it is more difficult for generators outside the interest alliance to exercise economic withholding behavior. Therefore, the focus is on the market behavior of the generators within the interest alliance.
First, an equity relationship graph G ( V , N ) of the generators is constructed. V is the vertices set representing the generators; N is the edges set representing equity relationship between the generators. The weight m i h of the edge between generator i and generator h is defined as
m i h = ( β i + β h ) a = 1 A z a g i a g h a
where 1 < i h < Z , Z is the total number of generators; β i and β h are the installed capacity of generator i and generator h, respectively; z a is the market share of power generation group a in the power generation market, 1 a A , where A is the total number of power generation groups; and g i a , g h a are the shareholding ratios of generator i and generator h, respectively, in power generation group a. The weight of the equity relationship among generation groups is the product of the sum of the total shareholding ratios of all power generation groups in the market and the sum of installed capacity.
Second, cohesive subgroups analysis is used in G ( V , N ) [27], and the G ( V , N ) is divided into multiple subgraphs, and V is divided into subsets V f ( f = 1 , 2 , , F ) . In V f ( f = 1 , 2 , , F ) , there is a connected edge between all generators.
Third, the intermediate central potential φ f of the generator subset V f is calculated, that is, the closeness degree of the generator subset V f [28]. The calculation formula for φ f is as follows:
φ f = i f ( δ f , max δ f , i ) τ f 1
where δ f , i is the betweenness centrality of generator i in the generator subset V f , which can measure the closeness of generator i with other generators in the set; δ f , max is the maximum betweenness centrality of the generators in the set; τ f is the number of generators in the set; and δ f , i can be obtained using the following equation:
δ f , i = 2 h f v f η h v ( i ) η h v τ f 2 3 τ f + 1
where η h v is the number of edges directly connected between generator h and generator v, and η h v ( i ) is the edges connected by the generator i between generator h and generator v.
Finally, according to the magnitude of the intermediate central potential φ f , the generator subset V f can be obtained. The generator subsets whose intermediate central potential exceeds the threshold are selected. The key monitoring generator V * is the union of the obtained generator subsets.

3. Indicator System for Identifying Generators’ Economic Withholding Behavior

Based on the SCP theory in industrial economics [29], an indicator system for identifying economic withholding behavior of the generators is constructed. In SCP theory, there is a causal relationship between market structure, behavior, and performance. To obtain the ideal market performance, it is necessary to adopt active government control, improve market structure, and regulate market behavior.
Therefore, based on the SCP theory, an indicator system for identifying economic withholding behavior of the generators is divided into three aspects: market structure, bidding strategy, and clearing result. As an established factor, the market structure reflects the characteristics of the region, determines the bidding strategy of market participants, and ultimately affects the result of market clearing. The indicator system is illustrated in Figure 1.

3.1. Market Structure Indicators

Market structure indicators primarily reflect the market position of market entities. Two indicators are selected: market share and residual supply index.
(1)
Market share
This indicator refers to the declared capacity of generator divided by the total declared capacity. The calculation formula is as follows:
S i , t = q i , t q t × 100 %
where q i , t is the declared capacity, and q t is the total declared capacity.
This indicator reflects the generator size. Regardless of grid congestion, as the market share of a generator increases, its conditions for using economic withholding to raise the clearing price improve [30].
(2)
Residual supply index
This indicator refers to the percentage of the difference between the total declared capacity on the market and the declared capacity of individual generators in the total market demand. The calculation is as follows:
I i , t R S = q t q i , t D t × 100 %
where D t is the total market demand at time t.
This indicator measures the position of the generator in the market. When the remaining supply index of a generator is less than 100%, it indicates that the generator is in a critical position [31]. In the case of inelastic demand prices, to meet market demand, the generator will be used for output, even if it declares a high price.

3.2. Bidding Strategy Indicators

The bidding strategy indicator reflects the bidding strategy formed by the generator during the quotation period, based on the maximization of revenue. The selected indicators are weighted average quotation, quotation fluctuation degree, quotation and cost deviation range, high quotation index, and capacity price index.
(1)
Weighted average quotation
The weighted average quotation refers to the product of the price and the capacity in the declaration curve of a generator divided by the total declared capacity [32]. The formula is as follows:
P ¯ i , t = w = 1 d i , t P i , t , w Q i , t , w w = 1 d i , t Q i , t , w
where P i , t , w is the w-segment price quoted; Q i , t , w is the declared volume of the w-segment quoted price quoted, which corresponds to P i , t , w ; and d i , t is the total number of quotations.
The weighted average quotation reflects the quotation of the generator. If the generator exercises economic withholding behavior, the quotation sequence of this generator will generally be much higher than the market average. This indicator can be used to determine whether the generator has abnormal quotations and analyze whether the market is controlled by economic withholding.
(2)
Quotation fluctuation degree
The quotation fluctuation degree is the ratio of the weighted standard deviation of the quotation to the weighted average quotation [33]. The formula is as follows:
Q ˜ i , t = w = 1 d Q i , t , w ( P i , t , w P ¯ i , t ) 2 P ¯ i , t
The degree of quotation fluctuation can reflect changes in the generator quotes. The economic withholding behavior of generators will avoid supervision by the regulatory authorities as much as possible, so that only part of the capacity is quoted at a high price. By contrast, the other capacity quotes are normal or even lower than the market average. At this time, the average quotation of the generator may be normal, but the volatility of the quotation will increase accordingly. As the quotation volatility of generators increases, the quotation strategies adopted in different quotation segments will become more variable, and the possibility of raising the price through economic withholding will increase.
(3)
Quotation and cost deviation range
The quotation and cost deviation range is the difference between the highest quotation and marginal cost. The formula is as follows:
R i , t = P ˜ i , t C i , t
where in Equation (8), P ˜ i , t is the highest quotation; and C i , t is the marginal cost.
This indicator reflects the difference between the highest quotation and marginal cost, which can measure the possibility of deliberately raising the quotation to exercise economic withholding behavior.
(4)
High quotation index
The high price index refers to the sum of the product of the price and the capacity in the high-price-declaration segment of the generator declaration sequence. The formula is as follows:
W i , t = w = w _ w ¯ P i , t , w Q i , t , w
P i , t , w _ - 1 C < P i , t , w _
w = 1 w ¯ 1 Q i , t , w < 90 % w = 1 d Q i , t , w w = 1 w ¯ Q i , t , w
where C is the average power generation cost, which is usually replaced by the average on-grid price of the previous year. In fact, taking a value near the average power generation cost will not significantly impact the high quotation index. Equations (10) and (11) are the constraint conditions of w _ and w ¯ , respectively. At time t, we take the declaration section w _ for generator i in the market. The declared electricity price of the section w _ is greater than the average power generation cost, and the sum of the declared capacity of the section w _ and its previous sections is less than 90% of the available capacity.
The normal operation reserve of a power system is approximately 90%. Therefore, when the load exceeds the first 90% of the declared capacity and enters the last 10% of the remaining capacity range, it indicates that the load supply is tight. At this time, the electricity spot market begins to fall short of demand and transforms into an imperfectly competitive market. Risky psychology will drive the generator to report a high price in the final declared capacity. Therefore, even if the generator reports a high price in the final remaining capacity range, it can be considered reasonable [34]. A diagram of the price declaration interval for generators is shown in Figure 2.
In Figure 2, P* is the upper limit of the market price and θ is the value of the intersection of the quotation curve and the average power generation cost mapped to the horizontal axis.
If the high price declared in the last 10% of the remaining capacity range is not considered an economic withholding behavior, then
90 % w = 1 d Q i , t , w < w = 1 w ¯ Q i , t , w
In this case, the declared capacity with respect to w ¯ needs to be revised as follows:
90 % w = 1 d Q i , t , w w = 1 w ¯ 1 Q i , t , w
As the high quotation index increases, so does the possibility that the generator will exercise economic withholding behavior to raise the market-clearing price during the bidding period. However, this indicator ignores the high price of the last 10% of the capacity, which may give some generators a chance to take advantage, i.e., take the risk of reporting a high price in the last segment of the capacity and reporting a high price or not reporting a high price in the other quotation segments. Therefore, the volume price index is chosen as a supplement.
(5)
Capacity price index
This indicator is the weighted sum of the segment capacity in the generator quote data [35]. The calculation formula is as follows:
E i , t = w = 1 d [ ( P i , t , w C ) 3 ( Q i , t , w Q i , t , A v a i l ) × 100 ]
where Q i , t , A v a i l is the available capacity.
Suppose that the capacity price index of the generator is large. In this case, the quotation is mostly high range. There is suspicion of raising the clearing price through economic withholding behavior.

3.3. Clearance Result Indicators

The clearance result indicators reflect the results of generator bidding and are an important basis for measuring the high profitability of generators. Three indicators are selected: high bid-winning rate, out-of-sequence capacity index, and average return rate.
(1)
High bid-winning rate
This indicator reflects the ratio of the high price quantity that has won the bid for the declared quantity of electricity. The calculation formula is as follows:
B i , t = Q h i , t Q c i , t
where Q h i , t is the electricity that generator i declared as high price and won the bid with at time t, where a price higher than the market average is still selected as the high price; and Q c i , t is the declared electricity.
The high bid-winning rate reflects the fact that generators have raised their prices of declared capacity. Regarding the identification of generator economic withholding behavior, as this indicator decreases, the behavior of part of the generator’s capacity to quote high prices and raise the clearing price becomes more obvious, and the generator is more likely to exercise economic withholding behavior.
(2)
Out-of-sequence capacity index
This indicator is the ratio of the out-of-sequence capacity of the generator to the actual declared capacity divided by the ratio of the unsuccessful capacity of the system to the capacity of participating in the quotation [36]. The calculation formula is as follows:
U i , t = Q i , t OC q i , t × q t q t ( D t Q t nbid ) × 100
where Q i , t OC is the unsuccessful power; and Q t nbid is the sum of the fixed output of generators that have not participated in the quotation at time t.
In an ideal electricity spot market, the out-of-sequence capacity index should be 100. Suppose that the out-of-sequence capacity index of a generator in a certain period of time is high, especially when it is higher than 100. In this case, the quotation of the generator may be high. This generator should attract the attention of the market supervision department.
(3)
Average rate of return
The average rate of return is the ratio of the overall revenue to the number of generators. The calculation formula is as follows:
H i , t = i = 1 L B ^ i , t Q i , t L i
where B ^ i , t is the bid-winning rate of generator, Q i , t is the bid-winning capacity, and L i is the number of generators in the generator set where generator i is located.
The average rate of return is used to measure the overall profitability of the generator subset. As the average rate of return grows larger, the more the generator subset as a whole becomes more profitable, and the possibility of the generators in the subset exercising economic withholding behavior increases.

3.4. Indicator System Architecture

The aforementioned indicators are combined to obtain a system of identifying indicators for the economic withholding of generators. An indicator system for identifying economic withholding behavior of the generators is listed in Table 1.

4. Variable Selection and Model Construction

Multiple indicators are prone to dimensionality disasters, excessive calculations, and multi-collinearity during the identification process. Therefore, reducing the data dimension and removing collinearity can enhance the performance of the model.
First, the logit discrete choice model is constructed, and the SCAD penalty regression method is used to filter the indicators. Variable selection can reduce the impact of data dimensions and the collinearity [37]. Second, the SCAD-logit model is used to identify the economic withholding behavior of generators. Finally, the confusion matrix, ROC curve, and AUC values are used to evaluate the model’s performance.

4.1. Logit Discrete Choice Model

Regarding the identification results of generators, there are two behaviors: exercise and non-exercise of economic withholding. A logit discrete choice model is used to make the binary selection for generators.
A binary logit discrete choice model is established that introduces a latent variable y i and satisfies the following relationship:
y i = β 0 + j = 1 J x i j β j + ε i = β 0 + β x i + ε i
where i = 1 , 2 , , n ; j = 1 , 2 , , J ; x i j is explanatory variables; β 0 and β j are regression coefficients; ε i are random error terms; and the distribution function [38] is P ( ε i z ) = e z / ( 1 + e z ) . To simplify the subsequent formula expression, we assume x i to be the indicator vector of the generator i, and β to be the coefficient vector.
y i is a categorical variable that represents the exercise and non-exercise of economic withholding. The formula is as follows:
y i = { 0    y i < 0 1    y i 0
The probabilities of the generator i being judged as two types of results are as follows:
P ( y i = 1 ) = P ( y i 0 ) = P ( ε i β 0 β x i ) = e ( β 0 + β x i ) 1 + e ( β 0 + β x i )
P ( y i = 0 ) = 1 P ( y i = 1 ) = 1 1 + e ( β 0 + β x i )
The binary logit discrete choice model [39] is constructed as follows:
L [ e ( β 0 + β x i ) ] = ln P ( y i = 1 ) P ( y i = 0 ) = β 0 + β x i

4.2. SCAD-Logit Model

The SCAD penalty regression method [40] solves the multi-collinearity problem based on the idea of penalty. It can compress a smaller regression coefficient to 0 and is an approximate unbiased estimate method. It compensates for the lack of a Lasso variable selection method, which always has biased estimates and unstable results [41].
First, the log-likelihood function of the discrete choice model of (22) is as follows:
log L ( β 0 , β ) = log i = 1 n p i = i = 1 n log [ P ( y i = 0 ) 1 y i P ( y i = 1 ) y i ] = i = 1 n [ y i ( β 0 + β x i ) log ( 1 + e β 0 + β x i ) ]
where p i is the categorical variable probability of generator i.
Next, we combine Equation (23) and the SCAD penalty regression method to eliminate multi-collinearity and perform variable selection. According to ref [42], the aforementioned formula is considered negative, and the penalty function p λ ( β j ) is added to transform it into a Lagrange’s equation:
β ^ = arg min β [ log L ( β 0 , β ) + j = 1 J p λ ( β j ) ]
where λ is the adjustment parameter, which aims to weigh the model fitting effects and the number of variables used. A suitable λ prevents the model from falling into overfitting and enables variable selection. Generally, as the value of λ decreases, the punishment is weaker and more variables tend to be used; as the value of λ increases, the punishment is stronger and fewer variables are used. The penalty function p λ ( β j ) is defined as follows:
p λ ( β j ) = { λ | β j | , | β j | λ | β j | 2 2 a λ | β j | + λ 2 2 ( a 1 ) , λ < | β j | a λ ( a + 1 ) λ 2 2 ,   | β j | > a λ
where a is a constant and a > 2 . The derivative form of the aforementioned equation is as follows:
p λ ( β j ) = λ [ I ( | β j | λ ) + ( a λ | β j | ) + ( a 1 ) λ I ( | β j | > λ ) ] sgn ( β j )
where I ( ) is a characteristic function, sgn ( ) is a signum function, and ( a λ | β j | ) + = max ( 0 , a λ | β j | ) .
It can be seen from Equation (25) that when | β j | λ , | p λ ( β j ) | = λ , the SCAD penalty function penalizes the coefficient; when λ < | β j | a λ , | p λ ( β j ) | = λ ( 0 + ( a λ | β j | ) + ( a 1 ) λ ) = ( a λ | β j | ) + a 1 < λ , which indicates that the penalty of the SCAD penalty function on the coefficient is weakened; and when | β j | > a λ , p λ ( β j ) = 0 , the SCAD penalty function cancels the penalty for the coefficient.
The choice of the penalty parameter λ is an important part of the SCAD penalty regression method. In this paper, the K-fold cross-validation method is selected to determine λ .
The candidate set of λ is determined, N different values of λ are considered, and the candidate set Γ = ( λ 1 , λ 2 , , λ N ) is obtained. The indicator data are divided into K subsets. Each time, the K−1 subsets are used as the training set, and the remaining subsets are used as the test set. In this way, there are K divisions and trainings. For the c-th division, ω c is the training set and ψ c is the test set. We use different λ δ ( δ = 1 , 2 , , N ) to perform regression analysis on the training set data and obtain the regression coefficient β ^ ( λ δ ) . The obtained regression coefficient is substituted into the test set, and the value of the loss function is obtained using the following equation:
s c = i ψ c { log [ 1 + e β 0 + β ^ ( λ δ ) x i ] y i [ β 0 + β ^ ( λ δ ) x i ] }
The loss function s c under different partitions is added to obtain the statistic S V ( λ δ ) , and λ δ is selected to minimize S V ( λ δ ) as an adjustment parameter. The equation is as follows:
S V ( λ δ ) = c = 1 K s c
In summary, based on Equation (24), the cross-validation method is used to calculate the statistics, and indicator screening is then realized. Then, according to the screening results of the SCAD penalty regression method and the binary logit model, a SCAD-logit model for the identifying economic withholding behavior of the generators is constructed. The SCAD-logit model is as follows:
L [ P ( y i = 1 ) ] = β ˜ 0 + β ˜ x ˜ i
where β ˜ 0 and β ˜ are the indicator coefficients obtained after estimation; x ˜ i are the indicators obtained after screening. According to the aforementioned model, it is possible to identify whether the generator exercises economic withholding behavior and to conduct an indicator significance test.

4.3. Performance Evaluation

In this study, the confusion matrix, ROC curve, and AUC value are used to evaluate the performance of the SCAD-logit model.
(1)
Confusion matrix
The confusion matrix is a standard format used for model accuracy evaluation, as shown in Table 2.
The SCAD-logit model is a two-classification model. Under the generator economic withholding identification task, there are four different combinations of the identify behavior and the real behavior. TP in Table 2 indicates that the real behavior is 1, and the identify behavior is 1, which means that the generator has successfully identified the economic withholding behavior. TN means that the real behavior is 0, and the identify behavior is 0; that is, it has successfully identified that the generator has not exercised economic withholding behavior. FP means that the real behavior is 0, and the identify behavior is 1, which means that the generator that has not exercised economic withholding is mistakenly identified as exercising economic withholding. FN indicates that the real behavior is 1, the identify behavior is 0, and the generator that is about to exercise economic withholding is mistakenly identified as not exercising economic withholding. The first two, TP and TN, indicate successful model identification and positively affect the supervision of the electricity spot market; the latter two, FP and FN, indicate misidentification. There is no positive effect on the supervision of the electricity spot market, and excessive supervision increases market risks.
(2)
ROC curve
The ROC curve is generally used to evaluate model identification ability as a coordinate schema analysis tool.
The two main indicators in the ROC curve are the true positive rate and the false positive rate, also called sensitivity and specificity. Generally, the false positive rate is the abscissa, and the true positive rate is its ordinate. The true positive rate formula is TP/(TP + FN), and the false-positive rate is TN/(FP + TN). For model evaluation, it is natural to expect that a high true positive rate and low false positive rate indicate success. Therefore, if the model identification ability is good, the ROC curve is close to the upper-left corner.
(3)
AUC value
The AUC value represents the area under the ROC curve. As an application of the evaluation model, its value range is between 0 and 1. Because the ROC curve is closer to the upper-left corner, the ideal value of AUC is 1, and the closer it is to 1, the better is the model identification effect [43].

4.4. SCAD-Logit Model’s Process

The SNA method is used to screen the generators to obtain a set of key monitoring generators. An indicator system for identifying economic withholding behavior is constructed using SCP theory, combined with the SCAD penalty regression method and logit discrete selection model to identify the economic withholding behavior of generators. The specific process is shown in Figure 3 and proceeds as follows:
(1)
The SNA method is used to construct an equity relationship graph G ( V , N ) with the generator as the vertex and the equity relationship of the generators as the edge. Then, the cohesive subgroup analysis method and intermediate central potential indicator screening are used to obtain the key monitoring generators.
(2)
According to the SCP theory, an indicator system for economic withholding behavior is constructed from three aspects: market structure, bidding strategy, and clearing results.
(3)
The generator sample set and test set are constructed, the indicators values of the key monitoring generators are calculated, and dimensionless processing of the indicators data is performed. Then, the logit discrete selection model is established. The SCAD penalty regression method and collinearity test are used to select indicators and eliminate collinearity indicators.
(4)
Indicators that pass the collinearity test are selected and the SCAD-logit model is established to identify the economic withholding based on the sample set data. Then, a significance test of each indicator in the model is carried out to verify its explanatory effect on the identification of economic withholding behavior. If the significance test is not passed, the corresponding indicators are eliminated; otherwise, we proceed to the next step.
(5)
The SCAD-logit model is used to identify the test set data and obtain generators that exercise economic withholding behavior.
(6)
The confusion matrix, ROC curve, and AUC value are used to evaluate the model’s performance.

5. Case Analysis

In this section, the data of the electricity spot market are selected and the 3-day transaction data of 216 generators are employed as the identification objective.

5.1. Key Monitoring Generators Selection

Based on the equity structure information of 216 generators, the equity data are calculated using Equation (1) to obtain the adjacency matrix M and normalize it. The results are shown in Table 3.
M is the adjacency matrix constructed by the weight of the equity relationship of the generators. The data of adjacency matrix M in Table 3 are imported into the Ucinet software. The equity relationship between generators is visualized to generate a graph G ( V , N ) for the equity analysis. Figure 4 is a schematic diagram of part of the graph G ( V , N ) . The equity relationship data of the generator are shown in Table A2.
The SNA method is used to conduct cohesive subgroup analysis on the generator equity analysis network and obtain multiple generator subsets V f ( f = 1 , 2 , , F ) . Based on Equations (2) and (3), the intermediate central potential of the generator is calculated. The resulting fleet is sorted according to the magnitude of the intermediate central potential. The lower quartile of the intermediate central potential φ f is used as the threshold. Generators larger than the threshold are selected. The union of the obtained generators is the key monitoring generators V * . A total of 168 key monitoring generators are screened.

5.2. Indicators Selection

Based on the obtained 168 monitoring generators, the data from 96 clearings per day are selected as the experiment cycle. The indicators listed in the indicator system to identify economic withholding behavior based on transaction data in the selected electricity spot market are calculated. Then, min-max dimensionless processing is performed on the generator indicator data as the basic data for subsequent indicator selection. The proportion of the sample set that exercises economic withholding behavior and the part that does not exercise it is considerably uneven in the sample data. A total of 500 samples are randomly selected from the exercise and non-exercise economic withholding behavior. The indicator data are shown in Table A1.
The 10-fold cross-validation method is used based on the SCAD penalty regression method. Different mean square errors are obtained from different λ values as the basis. Specific results as shown in Figure 5.
In Figure 5, the vertical and horizontal coordinates represent the mean square error and λ values, respectively. The mean square error changes with changes of λ . When λ = 6 , the error is the smallest, and there are four indicators whose coefficients are not 0.
To show the process of indicator selection more intuitively, the effect diagram of the change in each indicator coefficient with the penalty parameter is shown in Figure 6.
In Figure 6, the ordinate is the indicators coefficient, the abscissa is the value of λ , and the red vertical line is the cross-validation result, that is, λ = 6 . The indicators change dynamically with λ . When λ is 0, the penalty function p λ ( β j ) is the smallest, and all indicators have not been filtered out; as λ increases, the indicator coefficients gradually approach zero and finally stabilize at zero, thus satisfying the expected purpose of indicator selection.
The indicator selection results are obtained, and a collinearity test is carried out among the indicators. The results are listed in Table 4.
If the selected indicator variance inflation factor is greater than 10 or the tolerance is less than 0.1, it indicates collinearity between the indicators. From the results of the collinearity test in Table 5, the variance inflation factors of the quotation fluctuation degree, market share, high quotation index, and capacity price index are all less than 10, and the tolerance of all is greater than 0.1; hence, there is no collinearity in the selected indicators.
In summary, the selected indicators are the quotation fluctuation degree, market share, high quotation index, and capacity price index. The economic withholding behavior identification model is constructed based on the four indicators obtained.

5.3. Identification Results

Based on the results of indicator selection, the SCAD-logit model is established using the quotation fluctuation degree, market share, high quotation index, and capacity price index, and the significance of the model is tested. The estimated results are presented in Table 5.
Table 5 shows that the quotation fluctuation degree, market share, high quotation index, and capacity price index all passed the significance test at a significance level of 0.05. The four indicators in the model all have a significant linear relationship with economic withholding and prove that the built model can have a good effect on identifying the economic withholding behavior. The SCAD-logit model for economic withholding identification of generators is as follows:
L [ P ( y i = 1 ) ] = β ˜ 0 + β ˜ x ˜ i = - 6.223 + 3.718 Q ˜ i , t + 2.739 S i , t + 52.233 W i , t + 2.625 E i , t
The obtained model shows that the quotation fluctuation degree, market share, high quotation index, and capacity price index all positively impact the economic withholding behavior of generators. The obtained indicator is consistent with the behavioral mechanism of the economic withholding of generators. As the aforementioned four indicators increase, the more favorable factors for the economic withholding behavior of generators become, and the possibility of withholding increases.
The coefficient of the high quotation index is the largest, among the regression coefficients of the obtained model, indicating that it has the largest contribution to the identification of generator economic withholding behavior. Under the condition that the electricity spot market adopts the unified clearing of the marginal electricity price, it is reasonable for the generator to declare a high price for a part of the electricity to benefit itself. Similarly, the degree of quotation fluctuation also has a greater marginal contribution to economic withholding. During the quotation period, generators need to adopt various quotation strategies to raise the market-clearing price, so the monitoring quotation fluctuation degree is positive for economic withholding behavior. The last two indicators, the marginal contribution of market share and capacity price index, are smaller than the high quotation index and quotation fluctuation degree. However, they also positively impact economic withholding behavior, which cannot be ignored.
Therefore, the power market supervision department can monitor the degree of quotation fluctuation, market share, high quotation index, and capacity price index. Moreover, the regression coefficients of the aforementioned four indicators are all positive; that is, all have a positive impact on economic withholding behavior. Therefore, economic withholding can be effectively controlled by suppressing the corresponding generator indicators.
Based on the established model, to identify a certain transaction process from 168 generators in the electricity spot market in the region, the probability of economic withholding behavior of the generators is as follows:
P ( y i = 1 ) = 1 1 + e ( β ˜ 0 + β ˜ x ˜ i , t ) = 1 1 + e ( 6.223 + 3.718 Q ˜ i , t + 2.739 S i , t + 52.233 W i , t + 2.625 E i , t )
This case assumes that when the probability of economic withholding behavior of the generator is P ( y i = 1 ) 0.5 , the generator is considered to exercise economic withholding behavior; otherwise, the generator is not considered to exercise economic withholding behavior. The identification results of economic withholding of generators are shown in Table 6. More indicator data and identification results are shown in Table A3.
The indicator data of the six generators are analyzed, and the relevant data are shown in Table 7.
The impact of the four indicators on economic withholding behavior is further analyzed. Further analyze the changes in the economic withholding probability. The results are shown in Figure 7.
Figure 7 shows that the degree of quotation fluctuation, market share, high quotation index, and capacity price index have an important impact on the economic withholding probability. The probability increases with the dynamic changes in various indicators until it reaches a peak. The slope of the curve in descending order is the high quotation index, quotation fluctuation degree, market share, and capacity price index, and the marginal contributions also increase sequentially, when the probability is 0.5, which is consistent with the results obtained from the aforementioned model analysis.

5.4. Performance Results of the Model

To evaluate the performance of the SCAD-logit model, it is compared with the ordinary logit model, and Lasso-logit. The aforementioned sample set is used, 70% of which is randomly selected as the training set, and the remaining 30% is used as the test set. The aforementioned model is run 100 times. The overall correct rate, mean, and standard deviation of the FP and FN errors in the confusion matrix are calculated. The results of the performance evaluation of the model are presented in Table 8.
Table 8 shows that for the evaluation results of the overall correct rate, FP, and FN, the results of the SCAD-logit model are better for both the training set and the testing set. The overall correct rate is higher, and the error rates of FP and FN are lower. At the same time, the standard deviation of the SCAD-logit model is also smaller, indicating that the SCAD-logit model is better than the ordinary logit model and Lasso-logit model in terms of stability.
The ROC curve is compared to verify the effectiveness of the model. The results are shown in Figure 8.
Figure 8 shows that the model performance of the SCAD-logit model is better, because its ROC curve is closer to the upper left corner of the graph. Next, we calculate the area under the ROC curve and the AUC value. The results are listed in Table 9.
When the AUC value of the model increases, the performance of the model and accuracy are improved. Table 9 shows that the performance of the SCAD-logit model is better, as it is 9.25% higher than that of the ordinary logit model and 4.96% higher than that of the Lasso-logit model.
Based on the aforementioned analysis, the performance of the SCAD-logit model is superior to that of ordinary logit model and Lasso-logit model, and the result is consistent with that of a previous study. The SCAD-logit model uses the SCAD penalty regression method to compress some collinearity indicators coefficients to 0. The interference of some weakly correlated indicators on the model is avoided, thereby improving the performance of the model.
This method can identify the generators with economic withholding behavior. After excluding the generators that exercise economic withholding, then recalculating the quotation fluctuation degree, the high quotation index and the capacity price index. The quotation fluctuation degree Q ˜ i , t is reduced by an average of 6.7%, the high quotation index W i , t is reduced by an average of 19.4%, the capacity price index E i , t is reduced by an average of 8.2%. Therefore, the method can regulate the market order, make the market run fairly and justly, and prevent the electricity price from overflowing the upper limit. Market players can compete fairly and consumer property losses are avoided.

6. Conclusions

As the pilot electricity spot market has increased in recent years, market data have gradually become high-dimensional and complex. This study uses the SNA method to screen generators, based on the SCP theory, to construct an indicator system for identifying economic withholding behavior; it integrates the SCAD penalty regression method into the logit discrete choice model; and proposes a SCAD-logit identification method based on economic withholding behavior of generators. The main conclusions are as follows:
(1)
Facing complex and high-dimensional data in the electricity spot market, the SNA method can use cohesive subgroup analysis and intermediate central potential to screen out key generators, narrowing the search scope for data analysis and daily supervision. The SCAD penalty regression method can filter and obtain important identification indicators of economic withholding from various indicators, such as market share, quotation fluctuation degree, high quotation index, and capacity price index. Moreover, the collinearity and weakly related indicators are eliminated, thereby reducing the data dimension and the supervision cost of market managers.
(2)
The identification method is used in the electricity spot market, and the generators that exercise economic withholding are identified with a correct rate of 96.83%.
(3)
The SCAD-logit model clearly shows the contribution of each indicator to the identification of economic withholding behavior. Furthermore, effective control of economic withholding can be achieved by restricting relevant indicators.
Electricity markets around the world have great differences in market structure, bidding strategy and clearing mechanism. Therefore, when using this method, it is necessary to modify the indicator system to identify economic withholding behavior of the generators according to local characteristics. Moreover, with the further development of the electricity market, the economic withholding behavior of generators will become more concealed and challenging to be monitored. Other factors need to be considered according to the particularity of the region, such as network structure and congestion transmission congestions, profit margins of the generation groups and strategic biddings, etc.

Author Contributions

Validation, J.X.; Writing—original draft, S.C.; Writing—review & editing, B.S. and X.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 51507099).

Institutional Review Board Statement

Not applicable for studies not involving humans or animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are not publicly available due to trade secrets involved.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

m i h equity relationship between the generator i and the generator h
β i installed capacity of the generator i (MW)
φ f the intermediate central potential of the generator subset f
δ f , i the betweenness centrality of the generator i in the generator subset f
τ f the number of generators in the subset f
η j v the number of edges directly connected between the generator h and the generator v
q i , t the declared capacity (MW)
I i , t RS the remaining supply index
D t the total market demand at time t
C i , t the marginal cost
P ˜ i , t the highest quotation
P i , t , w the w-segment price quoted by the generator i at time t (Yuan/MWh)
Q i , t , w the declared volume of w-segment quoted price quoted (WM)
d i , t the total number of quotations
Q ˜ i , t the quotation fluctuation degree
U i , t the out-of-sequence capacity index
Q i , t the bid winning power
E i , t the volume and price index
B i , t the bid winning rate
R i , t the quotation and cost deviation range
Q h i , t the electricity that the generator i declared as high price and won the bid at time t
Q c i , t the declared electricity
Q i , t OC the unsuccessful power
H i , t the average rate of return of the generator subset where the generator i is located at time t
L i the number of generators in the generator subset where the generator i is located
x i j explanatory variables
y i explained variables
ε i random error term
β j regression coefficients
x i the economic withholding identification indicator vector of the generator i
β the coefficient vector corresponding to the indicator

Appendix A

Table A1. A part of the indicator data in the case analysis.
Table A1. A part of the indicator data in the case analysis.
Serial Number of GeneratorsMarket ShareResidual Supply IndexWeighted Average QuotationQuotation Fluctuation DegreeQuotation and Cost Deviation RangeHigh Quotation IndexCapacity Price IndexHigh Bid Winning RateOut-of-Sequence Capacity IndexAverage Rate of Return
10.7860.4060.9520.3220.4790.8220.3680.6730.9920.633
20.4100.4620.9560.3150.8811.0000.3710.6500.6420.867
30.1200.8210.9510.3260.5570.4790.3530.6260.6750.652
40.9850.6670.9340.3520.2660.4360.3161.0000.4670.310
50.8540.8170.8020.3690.0800.0270.2500.0090.7080.924
60.5390.6020.7990.1760.1410.0310.2600.5540.7500.057
70.6950.2990.6940.6690.4130.0010.2870.9550.3330.367
80.5490.2240.8050.6780.4610.0470.7920.2760.2000.633
90.4010.2440.0000.7620.2120.0460.7310.3160.4080.696
100.5990.6830.8100.8730.8080.0460.8930.0960.9420.361
110.2620.8070.9110.0900.7510.0280.4830.1600.2080.057
120.8570.7670.3790.3220.3870.0310.0760.9290.8250.734
130.6210.1830.2040.9640.6600.0300.4730.9090.2920.696
140.8800.4540.2750.7390.4870.0460.9750.0400.1830.335
150.6340.3340.0760.3380.4140.0370.3300.8290.2580.241
160.2420.7750.2190.4480.4470.0180.4850.2680.1750.203
170.4290.6340.4010.8750.0250.0270.0500.5360.3080.082
180.3330.4940.1380.1320.3400.0270.8990.0680.0330.228
190.0910.3990.9360.2890.1040.0190.0970.1080.6330.703
200.1400.3670.9150.8170.4290.0020.3430.9010.7000.209
210.6970.2510.0490.2670.6830.0280.8240.3480.1000.234
220.1040.7150.2160.6370.4410.0190.0390.5360.6330.905
230.3840.7700.7500.0070.5570.0140.5320.7650.6000.209
240.6310.8210.9350.9790.8250.0340.2800.3040.9000.703
250.0810.7900.0100.2200.0030.0400.2250.2200.1420.278
260.6280.6220.3400.0660.5590.0010.0720.8490.0000.462
270.0880.2060.0640.5360.7580.0400.5790.3280.7080.759
280.3060.8940.3780.7430.4160.0420.8780.3600.6920.070
290.8010.6520.7400.7070.7730.0460.7150.3480.7250.316
300.8890.0650.8320.2520.1850.0040.7680.0400.0330.468
310.8130.0860.0870.3490.4340.0280.9060.7490.3420.006
320.8650.1700.5230.5720.4450.0220.6230.3560.6920.316
330.7130.9890.5110.3690.8780.0330.1700.9250.7830.551
340.7580.8610.5750.5030.2240.0190.9290.8290.3000.354
350.2400.4450.9810.5710.7700.0200.1620.0720.8750.399
360.0590.5030.9470.1840.0600.0240.3430.1760.8830.576
370.4100.3610.0910.2200.4680.0320.3940.6520.9420.551
380.3890.7230.9780.8500.6390.0100.5740.0000.2670.658
390.7020.8120.0270.4140.8590.0320.3720.2520.9420.943
400.7740.1130.4270.3630.4450.0200.7080.1360.7750.930
410.2610.5810.1210.9440.6910.0080.1820.3000.3080.310
420.5120.8970.4630.3980.9950.0040.6920.3640.9670.937
430.1630.2540.7040.3660.2620.0140.0090.3280.6420.962
440.0230.2090.7060.6280.0570.0260.1620.1480.4830.329
450.8570.9410.0980.9580.2980.0260.3540.0040.6080.108
460.6560.6980.9810.8660.8610.0440.0060.7930.0420.228
470.2580.1980.6020.8630.7420.0210.3120.9370.0420.677
480.8880.0160.1480.0030.6780.0230.9490.5241.0000.222
490.4370.4890.3440.1070.8650.0050.3170.7850.6250.259
500.1380.0380.6830.9100.5810.0130.0870.7200.5500.633
510.7310.1990.5980.5200.1740.0450.4580.3760.2670.291
520.7310.9640.7160.7630.9710.0170.8750.2920.6330.342
530.8620.4180.7180.6390.9740.0460.5860.2480.4000.228
540.8480.3370.5951.0000.5740.0400.0060.3760.1080.076
550.0430.3210.1640.7390.6970.0120.9380.4000.1920.133
560.2540.1060.0960.0850.2330.0250.3890.3840.4670.696
570.7250.1860.7040.7200.8640.0370.4160.4520.0750.930
580.8200.5850.6000.0140.6200.0250.8130.8650.8500.203
590.6860.5980.6750.6550.4460.0190.1720.0200.9000.937
600.5900.1470.7870.8340.9250.0450.7220.2760.7170.354
Table A2. A part of the data of the equity relationship of the generators.
Table A2. A part of the data of the equity relationship of the generators.
123456789101112131415161718
11.000
20.8511.000
30.6940.7831.000
40.3830.3760.4001.000
50.1510.1510.1610.7941.000
60.0000.1450.1160.6060.9781.000
70.0000.0000.0000.0000.0000.0001.000
80.5700.4360.5520.2880.1190.0000.0001.000
90.7510.5490.6810.3860.1560.0000.0000.5461.000
101.0000.6220.8120.3500.1410.0000.0000.7880.9641.000
110.3870.4420.4910.4010.1630.0000.0000.1890.3780.4291.000
120.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0001.000
130.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0001.000
140.3670.4870.0550.8730.0510.0970.9410.5050.6470.6930.2160.1960.889
150.4030.2480.9540.2900.5620.2870.5720.5640.1120.4140.6930.6280.4360.297
160.9750.3580.7110.6410.8780.3370.0810.9860.0470.5980.8280.8980.5320.4440.582
170.6050.2370.5380.0000.9960.4070.8410.1610.5820.3680.8470.5040.5860.2340.1970.162
180.5680.8780.8950.6900.1380.5660.1550.4970.4980.1330.3280.6080.3640.4490.6750.5040.355
Table A3. A part of the indicator data of generators and identification results.
Table A3. A part of the indicator data of generators and identification results.
Serial Number of GeneratorsQuotation Fluctuation DegreeMarket ShareHigh Quotation IndexCapacity Price IndexActual CategoryIdentification Category
10.0970.8780.6830.90300
20.1500.7930.7210.84500
30.9430.8800.8010.98611
40.9400.7560.6420.90811
50.2510.0200.5740.11100
60.1900.1950.6710.86800
70.8400.7710.4790.34100
80.3070.4110.4460.83400
90.6400.7640.4520.37700
100.0410.9280.7540.22211
110.6390.8620.5140.58600
120.1040.6070.4580.61300
130.0580.0190.8160.17700
140.7270.7800.4400.89000
150.0130.3390.5640.85000
160.0620.7820.8280.19400
170.9860.3130.8420.21611
180.1680.0130.4150.29800
190.3980.8900.4870.19900
200.1920.3310.7850.72900
210.1190.4550.6980.79300
220.5830.4650.4890.58600
230.3320.1730.4500.24300
240.3050.7220.4170.98100
250.1670.1030.6500.52500
260.9300.9150.4380.69010
270.8160.9850.6030.79111
280.7500.5520.6810.71200
290.6030.1670.6520.91900
300.7330.2850.5510.12800
310.3310.1940.7060.26500
320.0900.2030.4710.25600
330.8150.3490.6360.97600
340.4910.4020.8280.27400
350.6830.8590.7700.36400
360.6010.2160.7870.05400
370.7430.2510.7350.02900
380.4340.5090.4320.21100
390.0860.1120.7720.45800
400.3390.4200.5780.32100
410.1770.5720.5620.81300
420.6040.9670.5760.11711
430.1240.3490.6320.28900
440.1660.0330.8340.16600
450.2220.6450.5080.52500
460.2570.4930.8360.70100
470.0090.5150.5410.71900
480.4340.8700.5760.49700
490.2640.5110.7450.27300
500.1010.4300.4870.30500

References

  1. Peng, X.; Tao, X. Cooperative game of electricity retailers in China’s spot electricity market. Energy 2018, 145, 152–170. [Google Scholar] [CrossRef]
  2. Guo, H.; Chen, Q.; Zhang, Y.; Liu, K.; Xia, Q.; Kang, C. Constraining the oligopoly manipulation in electricity market: A vertical integration perspective. Energy 2020, 194, 116877. [Google Scholar] [CrossRef]
  3. Ameri, M.; Rahimiyan, M.; Latify, M.A. Capacity withholding constrained by operational limits of generation under financial virtual divestiture in a day-ahead market. IEEE Trans. Power Syst. 2018, 33, 771–780. [Google Scholar] [CrossRef]
  4. Joskow, P.L. California’s electricity crisis. Oxf. Rev. Econ. Policy 2001, 17, 365–388. [Google Scholar] [CrossRef]
  5. Fabra, N.; Toro, J. Price wars and collusion in the Spanish electricity market. Int. J. Ind. Organ. 2005, 23, 155–181. [Google Scholar] [CrossRef] [Green Version]
  6. Li, C.; Xia, Q.; Kang, C.; Jiang, J. Novel approach to assess local market power considering transmission constraints. Int. J. Electr. Power Energy Syst. 2008, 30, 39–45. [Google Scholar] [CrossRef]
  7. Dolmatova, M.; Kozlovskiy, D.; Khrustaleva, O.; Sultanova, T.; Vasin, A. Market parameters dependent indices for competition evaluation in electricity market. Electr. Power Syst. Res. 2021, 190, 106762. [Google Scholar] [CrossRef]
  8. Zhang, J.; Zhang, L.; Yu, L. Layered index system for evaluating the market power in electricity market. Proc. CSEE 2006, 26, 123–128. [Google Scholar]
  9. Bergler, J.; Heim, S.; Hüschelrath, K. Strategic capacity withholding through failures in the German-Austrian electricity market. Energy Policy 2017, 102, 210–221. [Google Scholar] [CrossRef] [Green Version]
  10. Salarkheili, S.; Nazar, M.S. Capacity withholding analysis in transmission-constrained electricity markets. IET Gener. Transm. Distrib. 2016, 10, 487–495. [Google Scholar] [CrossRef]
  11. Nappu, M.B.; Bansal, R.; Saha, T.K. Market power implication on congested power system: A case study of financial withheld strategy. Int. J. Electr. Power Energy Syst. 2013, 47, 408–415. [Google Scholar] [CrossRef] [Green Version]
  12. Kwoka, J.; Sabodash, V. Price spikes in energy markets: “Business by usual methods” or strategic withholding? Rev. Ind. Organ. 2011, 38, 285–310. [Google Scholar] [CrossRef]
  13. Huang, C.; Yan, Z.; Chen, S.; Yang, L.; Li, X. Two-stage market clearing approach to mitigate generator collusion in Eastern China electricity market via system dynamics method. IET Gener. Transm. Distrib. 2019, 13, 3346–3353. [Google Scholar] [CrossRef]
  14. Bataille, M.; Bodnar, O.; Steinmetz, A.; Thorwarth, S. Screening instruments for monitoring market power—The Return on Withholding Capacity Index (RWC). Energy Econ. 2019, 81, 227–237. [Google Scholar] [CrossRef] [Green Version]
  15. Salarkheili, S.; Setayesh Nazar, M. New indices of capacity withholding in power markets. Int. Trans. Electr. Energy Syst. 2015, 25, 180–196. [Google Scholar] [CrossRef]
  16. Salarkheili, S.; Foroud, A.A. Market power assessment in electricity markets: Supply function equilibrium-based model. Int. Trans. Electr. Energy Syst. 2013, 23, 553–569. [Google Scholar] [CrossRef]
  17. Tabatabaei, M.; Nazar, M.S.; Shafie-Khah, M.; Catalão, J.P. An integrated framework for dynamic capacity withholding assessment considering commitment strategies of generation companies. Int. J. Electr. Power Energy Syst. 2022, 134, 107321. [Google Scholar] [CrossRef]
  18. Tabatabaei, S.M.; Nazar, M.S. Dynamic capacity withholding assessment of generation companies in wholesale electricity markets considering collusion. Int. J. Eng. Technol. 2019, 11, 702–715. [Google Scholar] [CrossRef] [Green Version]
  19. Shen, W.; Liang, H.; Dong, L.; Ren, J.; Wang, G. Synergistic CO2 reduction effects in Chinese urban agglomerations: Perspectives from social network analysis. Sci. Total Environ. 2021, 798, 149352. [Google Scholar] [CrossRef]
  20. Beni, S.A.; Sheikh-El-Eslami, M.-K. Market power assessment in electricity markets based on social network analysis. Comput. Electr. Eng. 2021, 94, 107302. [Google Scholar] [CrossRef]
  21. Liu, K.; Liu, W.; Xu, Q. Research on the financial crisis early warning of listed companies in China—Based on Probit model and Logit model. Acad. J. Bus. Manag. 2020, 2, 27–35. [Google Scholar]
  22. Filippopoulou, C.; Galariotis, E.; Spyrou, S. An early warning system for predicting systemic banking crises in the Eurozone: A logit regression approach. J. Econ. Behav. Organ. 2020, 172, 344–363. [Google Scholar] [CrossRef]
  23. Sun, B.; Deng, R.; Ren, B.; Teng, M.; Cheng, S.; Wang, F. Identification method of market power abuse of generators based on lasso-logit model in spot market. Energy 2022, 238, 121634. [Google Scholar] [CrossRef]
  24. Wu, C.; Zhong, P.-S.; Cui, Y. Additive varying-coefficient model for nonlinear gene-environment interactions. Stat. Appl. Genet. Mol. Biol. 2018, 17, 20170008. [Google Scholar] [CrossRef] [PubMed]
  25. Zhou, F.; Ren, J.; Lu, X.; Ma, S.; Wu, C. Gene–Environment interaction: A variable selection perspective. In Epistasis; Springer: Cham, Switzerland, 2021; pp. 191–223. [Google Scholar]
  26. He, Y.; Li, Z.; Li, D. Lessons of California electricity crisis for Zhejiang electricity market design and implementation. In Proceedings of the Annual Meeting of the Electric Power Market Committee of the Chinese Society of Electrical Engineering, Shanghai, China, 12–13 October 2018; pp. 90–98. [Google Scholar]
  27. Frank, K.A. Identifying cohesive subgroups. Soc. Netw. 1995, 17, 27–56. [Google Scholar] [CrossRef]
  28. Johnson, J.D. UCINET: A software tool for network analysis. Commun. Educ. 1987, 36, 92–94. [Google Scholar] [CrossRef]
  29. Kening, C.; Qu, H.; Ma, Q.; Luo, Z.; Wenjun, Z. Design of China’s Electricity Market Evaluation Indicators Based on SCP Model. In IOP Conference Series: Earth and Environmental Science, Proceedings of the 2020 International Symposium on Energy Environment and Green Development, Chongqing, China, 20–22 November 2020; IOP Publishing: Bristol, UK, 2021; p. 012092. [Google Scholar]
  30. Bao, M.; Ding, Y.; Shao, C.; Song, Y. Review of Nordic electricity market and its suggestions for China. Proc. CSEE 2017, 37, 4881–4892. [Google Scholar]
  31. Swinand, G.; Scully, D.; Ffoulkes, S.; Kessler, B. Modeling EU electricity market competition using the residual supply index. Electr. J. 2010, 23, 41–50. [Google Scholar] [CrossRef]
  32. Liu, D.; Zhang, Q.; Li, X.; Gu, Y.; Tan, Z. Identification of potential harmful behaviors in electricity market based on cloud model and fuzzy petri net. Autom. Electr. Power Syst. 2019, 43, 25–37. [Google Scholar]
  33. Yan, Y.; Lin, J.; Hou, Y. Construction of the framework for adaptability evaluation of electricity market. Electr. Power 2018, 51, 149–157. [Google Scholar]
  34. Zhou, H.; Chen, B.; Han, Z.; Zhang, F. Study on economic withholding in electricity market of Zhejiang Province, China. Commun. Nonlinear Sci. Numer. Simul. 2009, 14, 2495–2501. [Google Scholar] [CrossRef]
  35. Gao, X.; Zhang, S.; Zou, Y.; Li, F. Analysis and comprehensive judgement of illegitimate bidding among power suppliers. Power Syst. Prot. Control 2010, 38, 56–59+72. [Google Scholar]
  36. Kang, J.; Zhou, H. Analyzing bidding behavior of power suppliers by applying out-of-merit capacity. Proc. Electr. Power Syst. Autom. 2005, 17, 63–66. [Google Scholar]
  37. Yu, L.; Lin, N.; Wang, L. A parallel algorithm for large-scale nonconvex penalized quantile regression. J. Comput. Graph. Stat. 2017, 26, 935–939. [Google Scholar] [CrossRef]
  38. Yang, X.; Zhang, A.; Zhang, F. Farmers’ heterogeneous willingness to pay for farmland non-market goods and services on the basis of a mixed Logit model—A case study of Wuhan, China. Int. J. Environ. Res. Public Health 2019, 16, 3876. [Google Scholar] [CrossRef] [Green Version]
  39. Fang, K.; Zhang, G.; Zhang, H. Individual credit risk prediction method: Application of a lasso-logistic model. J. Quant. Tech. Econ. 2014, 1, 125–136. [Google Scholar]
  40. Fan, J.; Li, R. Variable selection via nonconcave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 2001, 96, 1348–1360. [Google Scholar] [CrossRef]
  41. Pötscher, B.M.; Leeb, H. On the distribution of penalized maximum likelihood estimators: The LASSO, SCAD, and thresholding. J. Multivar. Anal. 2009, 100, 2065–2082. [Google Scholar] [CrossRef] [Green Version]
  42. Zou, H. The adaptive lasso and its oracle properties. J. Am. Stat. Assoc. 2006, 101, 1418–1429. [Google Scholar] [CrossRef] [Green Version]
  43. Huang, J.; Ling, C.X. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 2005, 17, 299–310. [Google Scholar] [CrossRef] [Green Version]
Figure 1. An indicator system for identify economic withholding behavior of the generators.
Figure 1. An indicator system for identify economic withholding behavior of the generators.
Energies 15 04135 g001
Figure 2. Diagram of the price declaration interval for generators.
Figure 2. Diagram of the price declaration interval for generators.
Energies 15 04135 g002
Figure 3. Flow chart of identifying the economic withholding behavior of generators.
Figure 3. Flow chart of identifying the economic withholding behavior of generators.
Energies 15 04135 g003
Figure 4. Graph of equity analysis (partial).
Figure 4. Graph of equity analysis (partial).
Energies 15 04135 g004
Figure 5. Cross-validation effect chart.
Figure 5. Cross-validation effect chart.
Energies 15 04135 g005
Figure 6. Effect diagram of change in indicators with the value of λ.
Figure 6. Effect diagram of change in indicators with the value of λ.
Energies 15 04135 g006
Figure 7. Impact of various indicators on the economic withholding probability of generators.
Figure 7. Impact of various indicators on the economic withholding probability of generators.
Energies 15 04135 g007
Figure 8. ROC curves.
Figure 8. ROC curves.
Energies 15 04135 g008
Table 1. An indicator system for identifying economic withholding behavior of the generators.
Table 1. An indicator system for identifying economic withholding behavior of the generators.
IrregularitiesFirst Level IndicatorSecondary IndicatorsIndicator Symbol
The economic withholdingMarket structure indicatorsMarket share S i , t
Residual supply index I i , t RS
Bidding strategy indicatorsWeighted average quotation P ¯ i , t
Quotation fluctuation degree Q ˜ i , t
Quotation and cost deviation range R i , t
High quotation index W i , t
Capacity price index E i , t
Clearance result indicatorsHigh bid winning rate B i , t
Out-of-sequence capacity index U i , t
Average rate of return H i , t
Table 2. Confusion matrix of SCAD-logit model.
Table 2. Confusion matrix of SCAD-logit model.
Category of Real BehaviorCategory of Identify Behavior
01
0TNFP
1FNTP
Table 3. Data of adjacency matrix M.
Table 3. Data of adjacency matrix M.
Serial Number of
Generators
1234……216
11.0000.8510.6940.383……0.887
20.8511.0000.7830.376……0.007
……………………………………
880.3040.9260.1670.554……0.574
890.4150.8540.6610.836……0.421
……………………………………
2150.4670.1520.8810.027……0.513
2160.8870.0070.9040.903……1.000
Table 4. Indicator selection results and collinearity test.
Table 4. Indicator selection results and collinearity test.
IndicatorsVariance Inflation FactorTolerance
Quotation fluctuation degree1.020.983
Market share1.010.986
High quotation index1.010.993
Capacity price index1.000.997
Table 5. SCAD-logit model estimation results.
Table 5. SCAD-logit model estimation results.
IndicatorsCoefficientsWaldSignificance
Quotation fluctuation degree3.71858.640.00
Market share2.73931.300.00
High quotation index52.23325.680.00
Capacity price index2.62535.230.00
Constant−6.223105.870.00
Table 6. Identification results of economic withholding of generators.
Table 6. Identification results of economic withholding of generators.
Identification Category01
Number of generators1626
Table 7. Indicator data for generators that exercise economic withholding behavior.
Table 7. Indicator data for generators that exercise economic withholding behavior.
Serial Number of
Generators
Quotation Fluctuation DegreeMarket ShareHigh Quotation IndexCapacity Price Index
10.7860.4060.8220.479
20.9850.6670.4360.266
30.7390.8800.0460.975
40.7070.8000.0460.715
50.9420.9030.0390.803
60.9270.4180.0430.973
Table 8. Results of performance evaluation.
Table 8. Results of performance evaluation.
Evaluation ItemSCAD-Logit ModelOrdinary Logit ModelLasso-Logit Model
Training SetTesting SetTraining SetTesting SetTraining SetTesting Set
overall correct ratemean0.96830.96750.95430.95380.95860.9574
standard deviation0.0010.00140.01650.01960.01270.0143
FPmean0.03560.03540.05340.05140.04370.0476
standard deviation0.00210.00150.02270.02470.01380.0146
FNmean0.02780.03140.03470.03860.02930.0349
standard deviation0.00070.00120.01520.02140.00850.0128
Table 9. AUC values.
Table 9. AUC values.
ModelAUC Value
SCAD-logit model0.9274
Ordinary logit model0.8489
Lasso-logit model0.8837
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sun, B.; Cheng, S.; Xie, J.; Sun, X. Identification of Generators’ Economic Withholding Behavior Based on a SCAD-Logit Model in Electricity Spot Market. Energies 2022, 15, 4135. https://doi.org/10.3390/en15114135

AMA Style

Sun B, Cheng S, Xie J, Sun X. Identification of Generators’ Economic Withholding Behavior Based on a SCAD-Logit Model in Electricity Spot Market. Energies. 2022; 15(11):4135. https://doi.org/10.3390/en15114135

Chicago/Turabian Style

Sun, Bo, Siyuan Cheng, Jingdong Xie, and Xin Sun. 2022. "Identification of Generators’ Economic Withholding Behavior Based on a SCAD-Logit Model in Electricity Spot Market" Energies 15, no. 11: 4135. https://doi.org/10.3390/en15114135

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop