Next Article in Journal
Effect of Fly Ash on Compressive Strength, Drying Shrinkage, and Carbonation Depth of Mortar with Ferronickel-Slag Powder
Previous Article in Journal
Studies on a Piezoelectric Cylindrical Transducer for Borehole Dipole Acoustic Measurements
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Online Methodology for Separating the Power Consumption of Lighting Sockets and Air-Conditioning in Public Buildings Based on an Outdoor Temperature Partition Model and Historical Energy Consumption Data

Institute of Building Energy, Dalian University of Technology, Ling Gong Rd, Dalian 116024, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(3), 1031; https://doi.org/10.3390/app11031031
Submission received: 16 December 2020 / Revised: 7 January 2021 / Accepted: 13 January 2021 / Published: 24 January 2021

Abstract

:
Among sub-items of energy consumption in public buildings, lighting sockets play an important role in energy-saving analysis. So, the energy consumption data quality of lighting sockets is important. However, limited by the initial cost of energy monitoring platform, it is difficult to install electricity meters covering all branches and to retrofit the incompact classification electricity branches, which results in a mixture of the lighting socket energy consumption and other components. In this study, a separation methodology is proposed. First, the abnormal data in the energy monitoring platform are cleaned and screened using a clustering algorithm. Second, the average outdoor air temperature partitioning model (OATPM) method and the k-nearest neighbor (KNN) clustering algorithm method are proposed for identifying and separating the abnormal data. These two methods have complementary advantages in the best applicable scenarios, including calculation accuracy and other aspects. The verification results for three buildings show that the relative error of this separation methodology is less than 15%. Finally, this paper presents the optimization parameters of the KNN method. Through this methodology, building managers need only historical data in an energy monitoring platform to separate the combined power consumption of the lighting sockets and air-conditioning online, independent of detailed information statistics.

1. Introduction

With increasing urbanization, research in building energy efficiency is becoming more important on a global basis. In 2018, total building energy consumption was 899 million tons of standard coal equivalent (tce), accounting for 20.62% of the total energy consumption in China. In addition, public building energy consumption was 346 million tce, accounting for 38.53% of the total building energy consumption [1,2]. Thus, energy efficiency in public buildings has become a necessity to study the energy-saving potential of different sub-items and energy efficiency strategies. Scholars from various countries study the electricity consumption characteristics of public buildings with different functions. Lee et al. found that the ratio of electricity consumption for air-conditioning, office equipment, elevators, and lighting sockets in commercial buildings was 43%, 17%, 7%, and 34%, respectively, according to an investigation of 16,000 commercial buildings in Hong Kong [3,4,5]. These results indicate that air-conditioning and lighting sockets in public buildings play an important role in building energy consumption, of which operation characteristics are worthy of special attention. Benavente-Peces et al. used various machine learning technique classifiers to analyze and classify building energy efficiency, and they demonstrated that reliable classification is feasible with a few featured parameters [6]. Suzane A. Monteiro et al. developed a methodology of energy efficiency for lighting and air-conditioning systems in buildings using a multi-objective optimization algorithm [7].
In recent years, a large number of studies have focused on the study of human behavior models, because occupant behavior is complex and requires an interdisciplinary approach to be fully understood. On the one hand, occupant behavior is influenced by external factors such as culture, economy, and climate, as well as internal factors such as individual comfort preference, physiology, and psychology. On the other hand, the occupants’ interactions with building systems strongly influence building operations and thus energy use/cost and indoor comfort; this in turn influences occupant behavior, thus forming a closed loop. Therefore, in the study of building energy consumption, it is difficult to avoid the study of human behavior in buildings. For example, Yan Zhang et al. clustered research keywords on human behavior and presented them with visual images, pointing out the close relationship among building energy consumption, human behavior, and environmental comfort [8]. Qindi Li et al. studied the influence of human behavior in the construction of a low-carbon campus in Shanghai, and the results showed that individual behavior directly affected the overall energy consumption and carbon emissions of the campus. Although the relevant energy-saving policies were introduced, energy consumption still increased by 5% per year [9]. Hoes P. et al. simulated human behavior in buildings and pointed out that architectural design should be optimized according to the actual human behavior and architectural characteristics for special buildings, and the description of human behavior should be more detailed [10]. Ioannou A. et al. analyzed the Monte Carlo sensitivity analysis results of factors affecting annual heating energy consumption and predicted mean vote (PMV) comfort index (related to human behavior) in Northern Europe [11]. Chuang Wang presented the environment- and event-related driving forces behind lighting usage in office buildings, described the degree to which these factors influence lighting use, and used a function and probability relations to describe the random process of lighting use [12]. The problem of energy consumption data quality cannot be ignored. In fact, in other related fields, some scholars have also conducted relevant research, for example, Gang Zhou et al. developed an iterative online fault identification framework utilizing a novel lost data repair technique [13].
To obtain the energy consumption monitoring data for guiding the research about human behavior and energy consumption, it is necessary to accurately obtain the lighting power consumption and socket power consumption, which have an obvious correlation with human behavior, as shown in Figure 1. It is worth noting that most public buildings in cold and severely cold regions of China use municipal hot water for central heating, which is charged by area rather than volume used actually and does not consume electricity from lighting sockets. It means that each room in winter does not consume electricity from lighting sockets to create a thermal environment, and the environment is created using the radiator with the hot water prepared centrally. This means that the research about separation methodology focuses not on energy consumption of heating but on electricity consumption.
In the past decade, the Chinese government supported the establishment of an energy monitoring platform to monitor and analyze energy consumption data online, especially electricity consumption. The energy monitoring platform has become an important technical carrier for building energy efficiency [14,15], but in most of the existing building energy monitoring platforms, these two items of power consumption data are often difficult to obtain. First, due to cost constraints and construction age, the construction process of the building electrical system is not designed according to the correct design of electrical items. The monitoring branches of lighting, sockets, and air-conditioning, shown in red in Figure 2, are mixed. Second, due to the changes in the functions and partitions of a building, the distribution circuit scheme also changes accordingly. In this case, even if the lighting is separated from sockets in the design scheme, the air-conditioning consumption is often mixed into the socket consumption in the renovation process.
So, especially for public buildings, it is difficult to retrofit electric circuits massively, and at the same time, the aforementioned problems arise when the room function or room user changes. As a result, with this background, the itemized energy consumption data, which we obtain using an electrical loop monitoring instrument, will be inaccurate, because we fail to completely separate the power consumption of the lighting, socket, and air-conditioning terminal. Taking the study of human behavior as an example, if the power consumption of lighting sockets and air-conditioning cannot be accurately separated, it is difficult to evaluate the role of human behavior, outdoor weather, and envelope structure.
Therefore, how to separate the mixed power consumption of lighting sockets and air-conditioning without large-scale reconstruction is worthy of investigation. Scholars have presented a type of non-intrusive load monitoring (NILM) method. By measuring the curve of real-time power consumption, the start and stop electrical signals (current, voltage, active power, reactive power, power factor, etc.) of the equipment can be identified for abnormal power consumption data identification and separation [16,17,18]. However, these relevant studies mainly focus on residential buildings with a few types of equipment. A possible reason for this is that the types of equipment in public buildings are complicated and their number is extremely large. Thus, it is extremely difficult to judge the state of equipment only by the total energy consumption curve. Therefore, Akbari [19] developed an end-use disaggregation algorithm based on the hybrid technology of simulation software and statistical analysis. This algorithm can adjust the estimation results according to the error size and can be used to separate the electrical energy consumption of multiple buildings in the same power grid, which overcomes some disadvantages of the NILM method. Newer studies, such as that of Li et al., used the outlier detection method to eliminate abnormal energy consumption data and then used canonical variable analysis to classify and predict energy consumption [20]. Wang proposed an energy consumption separation algorithm for an equipment load rate model. The estimation results of the equipment power consumption were obtained by inputting the rated power of the equipment, which means that the equipment power consumption is separated from total power consumption [21]. Doherty proposed a socket power consumption model, which was developed by comparing power consumption data and the corresponding equipment information. This model is applicable in separating various types of power consumption in sockets, but it is difficult to separate the air-conditioning power consumption, which is mixed with socket power consumption [22]. Anand proposed a new parameter, energy consumption per capita (K), to explain the stochastic relationship between energy consumption and utilization, which was developed by multiple nonlinear regression and deep neural network algorithms. This model can be used to estimate the energy consumption of different sub-items according to human behavior, which means that the sub-item power consumption is separated from total power consumption [23]. However, the common point of these energy separation methods [24,25,26,27,28,29,30] is to obtain the actual energy consumption data or to estimate the energy consumption of some sub-items according to the detailed information of buildings, equipment, and human behavior. The energy consumption data of different equipment can then be calculated, which means that this type of equipment power consumption is separated from total power consumption. If detailed information on buildings, equipment, and human behavior cannot be obtained, or is for public buildings, of which equipment and human behavior are complex, these methods still have some defects.
From the literature [1,2,3,4,5,6,7], we can discover the necessity of building energy efficiency, especially for public buildings. Therefore, how to use energy consumption data in monitoring platform is important, such as studying human behavior for an energy retrofit shown in the literature [8,9,10,11,12,13]. This means that accurate itemization measurement is important in the research of public building energy efficiency. However, in the face of various practical problems, it is often difficult to make accurate itemization measurement. Therefore, researchers have developed various algorithms to repair energy consumption data, such as those found in the literature [16,17,18,19,20,21,22,23], but most of the above separation methods rely on the detailed information of buildings, equipment, and human behavior. It is usually time-consuming and inefficient to survey this information. Thus, under the condition of generally abnormal energy consumption monitoring data and high labor cost, research on separation methodology, which is driven only by historical data and applied online with a universality of multiple scenarios without detailed information of buildings and equipment, is critical. Therefore, this study proposes an online separation methodology for separating the power consumption of lighting sockets and air-conditioning in public buildings. This methodology is driven only by historical data, without detailed information of buildings and equipment.
The rest of this paper is organized as follows. Section 2 introduces the basic information in this research, including the sample campus, building energy monitoring platform, and actual problems with data quality with two aspects. Section 3 introduces methods of pre-processing power consumption data with the k-means clustering algorithm. Section 4 introduces the methods and cases for identifying abnormal conditions for daily data and hourly data, compares the identification effect of the OATPM method and the KNN method, and then introduces the method and case for separating abnormal data after identifying data. Section 5 analyzes the applicability of the separation methodology. Section 6 discusses the selection of important parameters in the clustering algorithm. Finally, conclusions are presented in Section 7.

2. Background and Data for Power Monitoring

In this research, an office building in Dalian (referred to as Building A), an office building in Jinzhou (Building B), and a commercial building in Anshan (Building C) are used as the research sample set. Table 1 shows the basic information, Figure 3 and Figure 4 the temperature information.
While using the above platforms, there are many problems with data quality, which are also common problems of Chinese building energy consumption platforms, and which are reflected in the following two aspects.
(1) When problems occur in the process of data collection and transmission, they cannot be dealt with in a short period of time, resulting in missing data, data mutation, etc.
(2) When the energy measurement of low-power terminal equipment is ignored, or when different functions of low-power terminal equipment are collected by the same meter, the power consumption of lighting sockets and air-conditioning are mixed. For example, lighting, sockets, and air-conditioning are monitored by the same branch monitoring. In Building A, the peak value of lighting socket power consumption in the cooling season is 1.5 times that of the peak value in the transition season, as shown in Figure 5, which exceeds the reasonable power consumption range of the lighting socket equipment. This indicates that the data of lighting socket power consumption is mixed with the data of air-conditioning power consumption.
Therefore, cleaning abnormal data and separating mixed data is important for improving data quality, and it is also a crucial link for energy efficiency retrofits. There are two main dimensions to cleaning and separating abnormal data: one is for abnormal daily data, and the other is for abnormal hourly data. For daily data, the granularity of measurements is one day, and it means one day is one point. For hourly data, the granularity of measurements is one hour, and it means one hour is one point. Different methods are proposed to address the two dimensions, including the OATPM and KNN methods, and the details are in Section 4.

3. Pre-Processing of Power Consumption Data

It is necessary to identify and clean abnormal data before separating the power consumption data of lighting sockets and air-conditioning. The abnormal conditions mainly include the data missing and data mutation storage. Among them, the values of missing data are displayed as “0,” which means that they are easy to identify and clean. As for data mutation, the partial power consumption data are too big or small, exceeding the actual threshold of energy consumption, as shown in Figure 5 and its description above. In this paper, the k-means clustering algorithm [30,31,32,33,34] is adopted to identify and clean the mutated data. The steps are as follows.
  • The K sets of points are randomly selected as the initial clustering center in the sample data set. According to the three kinds of air-conditioning state (closed/slightly open/fully open) and three kinds of data state (located/above/below), K is evaluated at 3 for KNN throughout the paper.
  • The distance between other points and the initial clustering center point is calculated, and these other points are allocated to the nearest neighbor cluster.
  • After the preliminary clustering is completed, the averages of all sample points in different clusters are selected as the new clustering center, and then steps 1 and 2 are repeated.
  • The clustering center and clusters of the sample points are updated iteratively, until the clustering center no longer changes, which means the end of this clustering algorithm. Next, we can output the clustering center and K pieces of the clusters of the sample points.
What needs illustration is that the distance between the points in step (2) can be calculated by the Minkowski distance [30,31,32,33,34], as shown in Equation (1), where p is 2, when we compute the two-dimensional point, and x 1 ,   x 2 are the value of the two points.
D i s t ( x 1 , x 2 ) = [ k = 1 n | x 1 , k x 2 , k | p ] 1 p
Taking Building A as an example, workdays, weekends, and holidays are selected as different types, and abnormal data are identified and cleaned day by day, as shown in Table 3 and Figure 6. Holidays are different from weekends. Holidays mean rest days, which are usually for some festivals and for summer or winter vacations, such as Christmas vacation. If the number of samples in the largest and smallest cluster centers accounts for less than 5% of the total sample number, or if the difference between the cluster centers is too large, it will be judged as a data mutation.
From Figure 6, it is seen that the k-means method of cleaning daily data screens abnormal data effectively. The results of cleaning the abnormal hourly data in Building A are shown in Figure 7.

4. Separation Methodology

The proposed methodology needs to identify the abnormal days or hours first, and then use historical data to separate abnormal data. There are five main steps, as shown in Figure 8. The details are shown in the other sections that follow.
For the ways to discern the abnormal hours or day and operating state of air-conditioning, mentioned in Figure 8, there are three main steps as follows:
  • Use the model of average outdoor temperature and lighting socket daily power consumption, or the k-means clustering algorithm, to obtain the eigenvalues of the method, including clustering center (c), outdoor temperature threshold ( T M i n ), and power consumption data threshold ( E ¯ M i n ), etc.
  • Use the above eigenvalues and the lighting socket power consumption with air-conditioning opened to identify the state of the air-conditioning.
  • Use appropriate historical data of lighting sockets (without mixing the power consumption of the air-conditioning) to predict and interpolate the actual power consumption of the lighting socket when the air-conditioning is running.
The OATPM method is appropriate for identifying the state of air-conditioning for daily data, because it can only identify the closed or open state. However, the KNN method is appropriate for identifying the state of air-conditioning for daily data or hourly data, because it can identify the closed, slightly open, or fully open states.

4.1. Method for Identifying Abnormal Conditions for Daily Data

According to the ratio of air-conditioning running hours (p), the state of air-conditioning is specified. If p < 40%, it is judged that the air-conditioning was closed; if 40% < p < 70%, it is judged that the air-conditioning was slightly open; if p > 70%, it is judged that the air-conditioning was fully open. The steps of these two methods are shown in Figure 9a,b.
It is worth noting that the KNN method also uses the model of average outdoor temperature and daily lighting socket power consumption for the eigenvalues (cluster center and maximum and minimum values of the cluster interval, such as c, T m i n ,   E ¯ m i n ) whether identifying abnormal daily or hourly data. Thus, it will not repeat the same process in Section 4.1, Section 4.2, Section 4.3, Section 4.4 and Section 4.5.

4.2. The Case of Identifying Abnormal Conditions for Daily Data

From April to August 2018, Building A is selected to verify the results for identifying and separating the normal data by the above methodology. The parameter a is used to evaluate the results, as follows:
a = n N
For the OATPM method, taking daily power consumption as the ordinate and the daily average outdoor temperature as the abscissa, the model of daily average outdoor temperature and daily power consumption of lighting sockets (mixed with power consumption of air-conditioning) is shown in Figure 10. The three clusters are displayed in three colors.
The data are fitted into a curve, and the second derivative is calculated to obtain the inflection point (24 °C, 1445 kWh). The average outdoor temperature and power consumption are, respectively, represented by black dotted lines. As shown in Figure 10, when the outdoor temperature is lower than 24 °C the daily power consumption is within the range of 750–1500 kWh, and the maximum value is 1445 kWh.
When the outdoor temperature is higher than 24 °C, the daily cumulative electricity consumption is higher than 1445 kWh. According to the properties of the second derivative, at 24 °C, the plus and minus signs of the first derivative change, that is, the local extreme value appears. Therefore, 24 °C is presumed to be the point of the power consumption characteristic change. At the same time, analyzing this point by the correlation coefficient, the power consumption of the lighting socket equipment is weakly correlated with the outdoor temperature, but the power consumption of the air-conditioning is strongly correlated with outdoor meteorological conditions. The conditions when T i ¯ is higher than 24 °C and T i ¯ is lower than 24 °C are calculated, respectively, about the correlation coefficient between the average outdoor temperature and the daily power consumption, as follows:
r X Y = c o v ( X i , Y ) v a r ( X i ) v a r ( Y )
where X i is the average daily outdoor temperature and Y is the power consumption of the lighting socket (mixed with the power consumption of air-conditioning). The value of the correlation coefficient R ranges from 0 to 1. The closer it is to 1, the stronger the correlation between variables; the closer it is to 0, the weaker the correlation. After the calculation, R ( T i ¯ 24   ) = 0.709 ;   R ( T i ¯ 24   ) = 0.196 .
Therefore, when the average outdoor temperature is lower than the judgment temperature (24 °C), the power consumption is small, and the relationship between the average outdoor temperature and the lighting socket power consumption is relatively weak, so it can be considered that the air-conditioning power consumption is not mixed with the lighting socket power consumption. When the average outdoor temperature is higher than the judgment temperature (24 °C), the power consumption increases suddenly, and the maximum value of the power consumption curve appears. The relationship between the average daily outdoor temperature and the power consumption is relatively large. Thus, it can be considered that the air-conditioning power consumption is mixed with the lighting socket power consumption.
Regarding the KNN method, it is calculated by the k-means algorithm, of which the parameter K, the number of clusters, is also evaluated at 3, as discussed above. That is, closed (p < 40%), slightly open (40% < p < 70%), and fully open (p > 70%). The three types of cluster distributions are also marked in three different colors, as shown in Figure 10.
The identification results with different methods are calculated for analyzing the advantages and disadvantages of the two proposed methods, as shown in Figure 11a,b. The accuracy a of identification results for the OATPM method is 83.9%, and the accuracy for the KNN method is 93.1%. This shows that the accuracy of the OATPM method is slightly lower than that of the KNN method, but it still reaches a high level. At the same time, the OATPM method only needs to calculate the second derivative without a complex algorithm, so the calculation speed is high. A more detailed comparison is made in Table 4.

4.3. Method for Identifying Abnormal Conditions for Hourly Data

According to the above analysis, only the clustering algorithm is suitable for this situation, of which the steps are similar to those shown in Figure 9b. The steps of this method for hourly data are not exactly the same as the steps for daily data.
(1)
Using the clustering algorithm to generate the critical eigenvalues, T I I I ,   M i n and E ¯ I I , M i n , when the power consumption and the outdoor temperature are less than T I I I , M i n and E ¯ I I , M i n , respectively, the state of the air-conditioning is closed. For the contrary state, the air-conditioning is open.
However, the state of the air-conditioning does not completely depend on the outdoor climate, and human behavior is also a type of complex influencing factor. For example, when the power consumption is higher than E ¯ I I , M i n and the outdoor temperature is less than T I I I , M i n , it is possible that air-conditioning is closed and the lighting socket power consumption is higher, as shown in Figure 12 (the fuzzy area in the upper left corner). When the outdoor temperature is higher than T I I I , M i n and the power consumption is less than E ¯ I I , M i n , it is possible that people do not use air-conditioning due to their habits, even if the outdoor temperature reaches the degree of using air-conditioning, as shown in Figure 12 (the fuzzy area in the bottom right corner). Therefore, after the end of step (1), the next step should be identified.
(2)
The theoretical maximum power consumption e i , M a x of the lighting socket in this special hour is calculated and compared with the actual power consumption e i . If e i is higher than e i , M a x , it is considered that the power consumption of the lighting socket is mixed with that of the air conditioner; otherwise, it is not mixed.

4.4. The Cases of Identifying Abnormal Conditions for Hourly Data

The data of Building A in August 2018 are selected for the case analysis, as shown in Figure 12. The three cluster centers and eigenvalues, such as large and small thresholds, are shown in Table 5.
The comparison diagram between the simulation conditions and the actual conditions for hourly data is shown in Figure 13, with 93.1% accuracy. It is divided into three states (closed, slightly open, and fully open) for daily data, but this is not necessary for the complex steps for hourly data. It can also be divided into two states (closed and open) without affecting the results.

4.5. Comparison of the OATPM Method and the KNN Method

Together, the OATPM method and the KNN method constitute the online identifying methodology, which is the first step of the separation methodology. A comparison of their advantages and disadvantages is displayed in Table 5. According to different application conditions, different methods can be selected to identify the air-conditioning state, to better separate the mixed data.

4.6. The Method and Case of Separating Abnormal Data after Identifying Data

After identifying the normal data and their corresponding hours or days, the work of separating abnormal data begins.
Abnormal data appear on abnormal days or hours, which need to be separated. The normal power consumption for the nearest days or hours of the abnormal days or hours are selected for interpolation. If the nearest data are also abnormal, the second-nearest data need to be selected for interpolation. By analogy, the normal power consumption data of lighting sockets without air-conditioning can finally be obtained.
Equation (4) is used to calculate the relative error between the lighting socket power consumption of the separation method and the actual situation, as follows:
δ = Δ p i , j × 100 % = | e i , j p i , j | p i , j × 100 %
where δ is the relative error when separating mixed data (%), e i , j is the lighting socket power consumption of the separation method (kWh), and p i , j is the lighting socket power consumption of the actual situation (kWh). The data of Building A from April to August 2018 are used for verifying the results of the separation. The error of 355 data points is shown in Figure 14, and some parts of the details are shown in Figure 15.
It is seen from Figure 15 that the relative error when separating mixed data is mostly within 10%. The numbers of samples for which the relative error is more than 10% accounts for 3.94%. Therefore, this method is feasible to a certain extent.

5. Discussion of the Separation Methodology

The human behavior, type of equipment, climates, and functions of different buildings are different, which leads to obvious differences in the power consumption characteristics of different buildings. To verify the applicability of the methodology for identifying and separating mixed data, on the premise of the successful verification of Building A, Buildings B and C, having different areas and functions, are selected as samples for further verification.
The results show that the methodology is also applicable to buildings with other functions and climates, of which the identification accuracy is above 90%. A comparison of identifying and separating data is shown in Figure 16 and Table 6. As seen in Figure 16, the trend of separation is basically consistent with the actual situation, and the separation power consumption is close to the actual value.
The relative errors of separation in Buildings A, B, and C are shown in Figure 17. The relative error of buildings having different functions is within 15%, and the relative error less than 10% in each building is more than 85%, which indicates the high applicability of the proposed method.
At present, there is no accurate specification for energy consumption data processing, but there are clear specifications in the related field of energy consumption prediction, as shown in Table 7 [35,36]. From the strictest point of view, the error of this method is within the acceptable range.
For future research, on the one hand, the results of the study show that the effect is obvious in cold regions and severely cold regions. As this method is purely data-driven, its applicability to other climatic zones is of value for further study. On the other hand, the kind of methodology that is driven by historical data with nearest-neighbor clustering algorithm is also widely cited in other fields about building energy efficiency—for example, the research about the effects of human behavior or meteorology on energy consumption. So, it also has reference value for the research in other fields.

6. Selection of Important Parameters in the Clustering Algorithm

In the process of identification and separation by the KNN method, the selection of some parameters is critical for accuracy, primarily the selection of clustering samples and the number of clusters, K.
As for the number of clusters, K, it should be selected according to actual demand. When one needs to identify and separate abnormal data by the KNN method for daily data, the three states, that is, closed, slightly open, or fully open, should be considered. When one needs to identify and separate abnormal data for daily data, it is enough to divide into two states, that is, closed and open.
Regarding the selection of clustering samples, the number of clustering samples, such as samples in a whole year or in a cooling season, directly affects the clustering results. In severely cold and cold climate areas, outdoor temperature values vary greatly in different seasons. Thus, in the case of clustering with outdoor temperature and power consumption values as variables, the outdoor temperature values vary with the number of clustering samples, which means that there is a difference between the identification and separation results. To obtain the best size of samples with strong applicability and more accuracy, the sample in a whole year, a cooling season, or a “transition season + cooling season” are selected for validation.
Samples for a whole year of Building A are selected for identifying and separating using the k-means clustering algorithm. The detailed information for these samples is shown in Table 8.
According to the data in Table 7, the following problems are noted. First, the variation trend of power consumption and the outdoor temperature are different, which means that identifying the three states (closed, slightly open, fully open) is difficult. Second, it is difficult to guarantee the eligible identification accuracy. If the power consumption value is used to define the three states of air-conditioning, the accuracy of identification results is 73.25%. If the outdoor temperature value is used to define the three states of air-conditioning, the accuracy of identification results is 22.42%. Finally, the outdoor temperature threshold difference of each cluster is too large, which means that identifying the state of air-conditioning is difficult.
Similarly, samples of the cooling season (from June to August) for Building A, and samples of the “transition season + cooling season” (from April to August) for Building A are also selected. The different results of identification are shown in Figure 18, with the detailed data shown in Table 9 and Table 3 separately. It is seen that the identification results of air-conditioning with the data of “cooling season + transition season” is the closest to the actual conditions, followed by the results of the cooling season.

7. Conclusions

(1)
The OATPM and KNN methods are driven by the historical data of the energy monitoring platform, which can effectively separate the power consumption of lighting sockets and air-conditioning in public buildings. According to the three kinds of air-conditioning state (closed/slightly open/fully open) and three kinds of data state (located/above/below), K is evaluated at 3 for KNN throughout the paper. The identification error for three public buildings utilizing the method was less than 15%, and the proportion of error greater than 10% was less than 15%.
(2)
The OATPM method is suitable for identifying and separating daily data, and the calculation speed is high, but it cannot identify and separate hourly data. Thus, it is suitable for scenes with low identification accuracy, such as research on total energy consumption statistics. The KNN method is suitable for identifying and separating not only daily data but also hourly data; however, the calculation is complex and slow. Therefore, it is suitable for scenes with high identification accuracy, such as research on the correlation between human behavior characteristics and energy consumption.
(3)
The methodology proposed in this study is suitable for public buildings with different functions and climates, especially for buildings with high power consumption values for lighting sockets or large differences between power consumption characteristics of air-conditioning and lighting sockets. For example, in commercial buildings, the power consumption of air-conditioning is relatively large. Therefore, the power consumption of lighting sockets is far less than the power consumption of lighting sockets (mixed air-conditioning) in the cooling season, which means that the number of clustering iterations is fewer, and the distances between different clusters is greater. Thus, the identification results of this separation methodology for commercial buildings are accurate.

Author Contributions

Methodology, validation, writing—original draft, T.Z.; investigation and modeling, C.Z. and T.U.; conceptualization and supervision, L.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Project of China (Grant No. 2017YFC0704203), National Natural Science Foundation of China (Grant No. 52078096) and “the Fundamental Research Funds for the Central Universities” (Grant No. DUT20JC47).

Institutional Review Board Statement

Studies do not involve humans or animals.

Informed Consent Statement

Studies do not involve humans.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. China Association of Building Energy Efficiency. Report on Chinese Building Energy Consumption in 2018. Available online: http://www.cabee.org/site/content/22960.html (accessed on 9 December 2018).
  2. China Association of Building Energy Conservation, Energy Consumption Statistics Committee of China Association of Building Energy Conservation. Report of China Building Energy Consumption Research. Available online: https://www.cabee.org/site/content/23565.html (accessed on 10 April 2020).
  3. Deng, S.; Burnett, J. A study of energy performance of hotel buildings in Hong Kong. Energy Build. 2000, 31, 7–12. [Google Scholar] [CrossRef]
  4. Lee, W.L.; Yik, F.W.H.; Jones, P.; Burnett, J. Energy saving by realistic design data for commercial buildings in Hong Kong. Appl. Energy 2001, 70, 59–75. [Google Scholar] [CrossRef]
  5. Philip, C.H.; Chow, W.K. Energy use in commercial buildings in Hong Kong. Energy 2001, 69, 243–255. [Google Scholar]
  6. Benavente-Peces, C.; Ibadah, N. Buildings Energy Efficiency Analysis and Classification Using Various Machine Learning Technique Classifiers. Energies 2020, 13, 3497. [Google Scholar] [CrossRef]
  7. Monteiro, S.A.; Monteiro, F.P.; Tostes, M.E.L.; Carvalho, C.M. Methodology for Energy Efficiency on Lighting and Air Conditioning Systems in Buildings Using a Multi-Objective Optimization Algorithm. Energies 2020, 13, 3303. [Google Scholar] [CrossRef]
  8. Zhang, Y.; Bai, X.; Mills, F.P.; Pezzey, J.C.V. Rethinking the role of occupant behavior in building energy performance: A review. Energy Build. 2018, 172, 279–294. [Google Scholar] [CrossRef]
  9. Li, Q.; Sun, X.; Chen, C.; Yang, X. Characterizing the household energy consumption in heritage Nanjing Tulou buildings, China: A comparative field survey study. Energy Build. 2012, 49, 317–326. [Google Scholar] [CrossRef]
  10. Hoes, P.; Hensen, J.L.M.; Loomans, M.G.L.C.; de Vries, B.; Bourgeois, D. User behavior in whole building simulation. Energy Build. 2009, 41, 295–302. [Google Scholar] [CrossRef] [Green Version]
  11. Ioannou, A.; Itard, L.C. Energy performance and comfort in residential buildings: Sensitivity for building parameters and occupancy. Energy Build. 2015, 92, 216–233. [Google Scholar] [CrossRef]
  12. Wang, C. Simulation Research on Occupant Energy-Related Behaviors in Building; Tsinghua University: Beijing, China, November 2014. [Google Scholar]
  13. Zhou, G.; Han, Z.; Fu, J.; Xu, G.H.; Ye, C. Iterative Online Fault Identification Scheme for High-Voltage Circuit Breaker Utilizing a Lost Data Repair Technique. Energies 2020, 13, 3311. [Google Scholar] [CrossRef]
  14. Ministry of House and Urban-Rural Development of People’s Republic of China. Implementation Opinions on Strengthening the Administration of Energy Conservation in State Organs Office Buildings and Large Public Buildings. 2007. Available online: http://www.mohurd.gov.cn/wjfb/200710/t20071026_158566.html (accessed on 23 October 2007).
  15. Ministry of House and Urban-Rural Development of People’s Republic of China. Interim Measures for the Acceptance and Operation Management of Provincial-Level Public Building Energy Consumption Monitoring Platform. 2016. Available online: http://www.mohurd.gov.cn/wjfb/201604/t20160415_227217.html (accessed on 11 April 2016).
  16. Marceau, M.L.; Zmeureanu, R. Nonintrusive load disaggregation computer program to estimate the energy consumption of major end uses in residential buildings. Energy Convers. Manag. 2000, 41, 1389–1403. [Google Scholar] [CrossRef] [Green Version]
  17. Pihala, H. Non-intrusive Appliance Load Monitoring System Based on a Modern kWh- Meter; VTT Publications: Espoo, Finland, 1998; p. 356. [Google Scholar]
  18. Norford, L.K.; Leeb, S.B. Non-intrusive electrical load monitoring in commercial buildings based on steady-state and transient load-detection algorithms. Energy Build. 1996, 24, 51–64. [Google Scholar] [CrossRef]
  19. Akbari, H.; Konopacki, S.J. Application of an End-Use Disaggregation Algorithm for Obtaining Building Energy-Use Data. J. Sol. Energy Eng. 1998, 120, 205–210. [Google Scholar] [CrossRef]
  20. Li, X.; Bowers, C.P.; Schnier, T. Classification of Energy Consumption in Buildings with Outlier Detection. IEEE Trans. Ind. Electron. 2010, 57, 3639–3644. [Google Scholar] [CrossRef]
  21. Wang, X. Studies on Key Technology of Sub-Metering in Commercial Buildings; Tsinghua University: Beijing, China, November 2010. [Google Scholar]
  22. Doherty, B.; Trenbath, K. Device-level plug load disaggregation in a zero energy office building and opportunities for energy savings. Energy Build. 2019, 204, 109480. [Google Scholar] [CrossRef]
  23. Anand, P.; Cheong, D.; Sekhar, C.; Santamouris, M.; Kondepudi, S. Energy saving estimation for plug and lighting load using occupancy analysis. Renew. Energy 2019, 143, 1143–1161. [Google Scholar] [CrossRef]
  24. Aiad, M.; Lee, P.H. Energy disaggregation of overlapping home appliances consumptions using a cluster separating approach. Sustain. Cities Soc. 2018, 44, 487–494. [Google Scholar] [CrossRef]
  25. Aiad, M.; Lee, P.H. Unsupervised approach for load disaggregation with devices interactions. Energy Build. 2016, 116, 96–103. [Google Scholar] [CrossRef]
  26. Aiad, M.; Lee, P.H. Non-intrusive load disaggregation with adaptive estimations of devices main power effects and two-way interactions. Energy Build. 2016, 130, 131–139. [Google Scholar] [CrossRef]
  27. Farinaccio, L.; Zmeureanu, R. Using a pattern recognition approach to disaggregate the total electricity consumption in a house into the major end-uses. Energy Build. 1999, 30, 245–259. [Google Scholar] [CrossRef]
  28. Gabaldón, A.; Molina, R.; Marín-Parra, A.; Valero-Verdu, S.; Alvarez, C. Residential end-uses disaggregation and demand response evaluation using integral transforms. J. Mod. Power Syst. Clean Energy 2017, 5, 91–104. [Google Scholar] [CrossRef] [Green Version]
  29. Kolter, J.; Jaakkola, T. Approximate inference in additive factorial HMMs with application to energy disaggregation. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), La Palma, Canary Islands, Spain, 21–23 April 2012; pp. 1472–1482. [Google Scholar]
  30. Zhou, Y.; Shi, Z.; Shi, Z.; Gao, Q.; Wu, L. Disaggregating power consumption of commercial buildings based on the finite mixture model. Appl. Energy 2019, 243, 35–46. [Google Scholar] [CrossRef]
  31. Leonard, K.; Peter, J.R. Finding Groups in Data: An Introduction to Cluster Analysis; John Wiley & Sons: Hoboken, NJ, USA, 1990. [Google Scholar]
  32. Forgy, E. Cluster analysis of multivariate data: Efficiency vs. interpretability of classifications. Biometrics 1965, 21, 768. [Google Scholar]
  33. Jain, A.K. Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 2010, 31, 651–666. [Google Scholar] [CrossRef]
  34. Steinley, D. K-Means clustering: A half-century synthesis. Brit. J. Math. Stat. Psychol. 2006, 59, 1–34. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. ASHRAE Standards Committee. Measurement of Energy and Demand Savings: ASHRAE Guidelines 14—2002; ASHRAE Inc.: Atlanta, GA, USA, 2002; p. 17. [Google Scholar]
  36. US Department of Energy Office of Energy Efficiency and Renewable Energy. M&V guidelines: Measurement and verification for federal energy projects. Islington 2008. [Google Scholar] [CrossRef]
Figure 1. The indoor energy consumption component and its influencing factors in public buildings.
Figure 1. The indoor energy consumption component and its influencing factors in public buildings.
Applsci 11 01031 g001
Figure 2. Construction drawing of electric energy monitoring for a sample building studied in this paper.
Figure 2. Construction drawing of electric energy monitoring for a sample building studied in this paper.
Applsci 11 01031 g002
Figure 3. Daily maximum air temperature for three cities.
Figure 3. Daily maximum air temperature for three cities.
Applsci 11 01031 g003
Figure 4. Daily minimum air temperature for three cities. The energy consumption monitoring platforms of Buildings A, B, and C are listed in Table 2.
Figure 4. Daily minimum air temperature for three cities. The energy consumption monitoring platforms of Buildings A, B, and C are listed in Table 2.
Applsci 11 01031 g004
Figure 5. Time-series characteristics of the electricity consumption of lighting sockets.
Figure 5. Time-series characteristics of the electricity consumption of lighting sockets.
Applsci 11 01031 g005
Figure 6. Scatter diagram of daily electricity consumption of lighting sockets in Building A.
Figure 6. Scatter diagram of daily electricity consumption of lighting sockets in Building A.
Applsci 11 01031 g006
Figure 7. Scatter diagram of hourly electricity consumption of lighting sockets in Building A.
Figure 7. Scatter diagram of hourly electricity consumption of lighting sockets in Building A.
Applsci 11 01031 g007
Figure 8. Diagram of the overall model of implementing the identification and separation method.
Figure 8. Diagram of the overall model of implementing the identification and separation method.
Applsci 11 01031 g008
Figure 9. Identification method of air-conditioning usage condition on the designated day.
Figure 9. Identification method of air-conditioning usage condition on the designated day.
Applsci 11 01031 g009
Figure 10. Model of average outdoor temperature versus daily electricity consumption.
Figure 10. Model of average outdoor temperature versus daily electricity consumption.
Applsci 11 01031 g010
Figure 11. Comparison of identification results using different methods.
Figure 11. Comparison of identification results using different methods.
Applsci 11 01031 g011
Figure 12. Characteristic value recognition results.
Figure 12. Characteristic value recognition results.
Applsci 11 01031 g012
Figure 13. Comparison of identification results using the KNN method.
Figure 13. Comparison of identification results using the KNN method.
Applsci 11 01031 g013
Figure 14. Comparison of separating the mixed data of the separation method and actual data.
Figure 14. Comparison of separating the mixed data of the separation method and actual data.
Applsci 11 01031 g014
Figure 15. Relative error number distribution when separating mixed data.
Figure 15. Relative error number distribution when separating mixed data.
Applsci 11 01031 g015
Figure 16. Comparison of identifying and separating abnormal data by the proposed method.
Figure 16. Comparison of identifying and separating abnormal data by the proposed method.
Applsci 11 01031 g016
Figure 17. Distribution of the relative error between the separation data and the actual values.
Figure 17. Distribution of the relative error between the separation data and the actual values.
Applsci 11 01031 g017
Figure 18. Recognition results of air-conditioning usage conditions with different variable durations.
Figure 18. Recognition results of air-conditioning usage conditions with different variable durations.
Applsci 11 01031 g018
Table 1. Basic information for Buildings A, B, and C.
Table 1. Basic information for Buildings A, B, and C.
Building CodesFloor Height (m)Floor Area (m2)Energy Intensity (kWh/m2·Year)Climate
Partition
Heating SchemeCooling Scheme
Building A11.837,59313.9Cold regionMunicipal waterSeparate air-conditioning
Building B5626,00046.6Severely cold regionMunicipal waterSeparate air-conditioning
Building C41100,00093.9Severely cold regionMunicipal waterFan coil unit
Annotation: China can be divided into five climatic zones: severely cold region, cold region, hot-summer and cold-winter region, hot-summer and warm-winter region, and moderate region.
Table 2. Basic information on building energy consumption monitoring platforms.
Table 2. Basic information on building energy consumption monitoring platforms.
Energy Consumption Monitoring PlatformBuilding SampleTotal Number of Buildings (Blocks)Floor Area of Buildings
(×1000 m2)
Monitoring Point
(Pieces)
Dalian public institution energy consumption monitoringA15300951
Liaoning public institution building energy consumption monitoring platformB, C5020301659
Table 3. K-means clustering results of daily lighting socket consumption.
Table 3. K-means clustering results of daily lighting socket consumption.
State WorkdaysHolidays
The number of the cluster123123
Cluster center (kWh)1848.651263.52485.151037.01778.52151.75
Sample numbers (pieces)8130721513
Proportion (%)4.8890.854.2728.0068.004.00
State conclusionAbnormalReasonableAbnormalReasonableReasonableAbnormal
Table 4. Comparison of the OATPM and KNN methods.
Table 4. Comparison of the OATPM and KNN methods.
MethodIdentification AccuracyApplicable OccasionsDegree of IdentificationCan It Be Used to Predict Energy Consumption?
Temperature model83.9%For daily dataClosed/openYes
Clustering
algorithm
93.1%
96.2%
For daily data
For hourly data
Closed/slightly open/fully openNo
Note: The temperature models can be used to predict energy consumption.
Table 5. Eigenvalues of three clusters by the KNN method.
Table 5. Eigenvalues of three clusters by the KNN method.
Cluster CategoriesNumber of SamplesEigenvaluesPower Consumption of Lighting (kWh)Outdoor Temperature (°C)
Cluster Ⅰ: closed47C1152.5017.28
MAX1282.6524.55
MIN805.473.30
Cluster Ⅱ: slightly open27C1448.1119.21
MAX1686.7524.83
MIN1287.3319.89
Cluster Ⅲ: fully open13C2066.5528.87
MAX2280.9032.05
MIN1787.0425.38
Table 6. Accuracy of identifying operating conditions of air-conditioning.
Table 6. Accuracy of identifying operating conditions of air-conditioning.
BuildingClimate ZoneBuilding FunctionWorking TimeForm of Air-ConditioningAccuracy
For Daily Data (%)For Hourly Data (%)
Building AColdOffice building7:00–22:00Separate air-conditioning93.196.3
Building BSeverely coldOffice building7:00–17:00Separate air-conditioning91.293.4
Building CSeverely coldCommercial building8:00–21:00Fan coil unit 95.696.0
Table 7. Relevant international regulations.
Table 7. Relevant international regulations.
Technical Code in China (JGJ176-2009)IPMVPFEMP
Error in one month±15%±20%±15%
Error in one yearNo Standard No Standard ±10%
Note: Technical code in China (JGJ176-2009) is “Technical code for the retrofitting of public building on energy efficiency”; IPMVP is “International Performance Measurement and Verification Protocol”; and FEMP is “US Federal Energy Management Program.”
Table 8. Detailed information of the samples for a whole year by KNN method.
Table 8. Detailed information of the samples for a whole year by KNN method.
Cluster CategoriesNumber of SamplesEigenvaluesPower Consumption of Lighting (kWh)Outdoor Temperature (°C)
Cluster Ⅰ: closed124C1182.5216.00
MAX1329.9024.95
MIN805.47−8.72
Cluster Ⅱ: slightly open74C1517.242.52
MAX1787.0430.47
MIN1353.89−12.10
Cluster Ⅲ: fully open25C2089.8429.09
MAX2280.9032.05
MIN1610.4925.38
Table 9. The detailed information of the samples for a whole year by KNN method.
Table 9. The detailed information of the samples for a whole year by KNN method.
Cluster CategoriesNumber of SamplesEigenvaluesPower Consumption of Lighting (kWh)Outdoor Temperature (°C)
Cluster Ⅰ: closed25C1190.1320.79
MAX1329.9024.95
MIN805.4717.35
Cluster Ⅱ: slightly open17C1495.0424.62
MAX1686.7530.47
MIN1353.8922.41
Cluster Ⅲ: severely open13C2066.5528.84
MAX2280.9032.05
MIN1787.0425.38
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhao, T.; Zhang, C.; Ujeed, T.; Ma, L. Online Methodology for Separating the Power Consumption of Lighting Sockets and Air-Conditioning in Public Buildings Based on an Outdoor Temperature Partition Model and Historical Energy Consumption Data. Appl. Sci. 2021, 11, 1031. https://doi.org/10.3390/app11031031

AMA Style

Zhao T, Zhang C, Ujeed T, Ma L. Online Methodology for Separating the Power Consumption of Lighting Sockets and Air-Conditioning in Public Buildings Based on an Outdoor Temperature Partition Model and Historical Energy Consumption Data. Applied Sciences. 2021; 11(3):1031. https://doi.org/10.3390/app11031031

Chicago/Turabian Style

Zhao, Tianyi, Chengyu Zhang, Terigele Ujeed, and Liangdong Ma. 2021. "Online Methodology for Separating the Power Consumption of Lighting Sockets and Air-Conditioning in Public Buildings Based on an Outdoor Temperature Partition Model and Historical Energy Consumption Data" Applied Sciences 11, no. 3: 1031. https://doi.org/10.3390/app11031031

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop