Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility

Paulescu, Eugenia; Paulescu, Marius

doi:10.3390/app13116558

Open AccessArticle

Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility

by

Eugenia Paulescu

and

Marius Paulescu

^*

Faculty of Physics, West University of Timisoara, V. Parvan 4, 300223 Timisoara, Romania

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(11), 6558; https://doi.org/10.3390/app13116558

Submission received: 3 May 2023 / Revised: 23 May 2023 / Accepted: 25 May 2023 / Published: 28 May 2023

(This article belongs to the Special Issue Solar Radiation: Measurements and Modelling, Effects and Applications—Volume III)

Download

Browse Figures

Versions Notes

Abstract

:

The separation models are tools used in solar engineering to estimate direct normal (DNI) and diffuse horizontal (DHI) solar irradiances from measurements of global solar irradiance (GHI). This paper proposes two empirical separation models that stand out owing to their simple mathematical formulation: a rational polynomial equation. Validation of the new models was carried out against data from 36 locations, covering the four major climatic zones. Five current top minute-scale separation models were considered references. The tests were performed on the final products of the estimation: DNI and DHI. The first model (M1) operates with eight predictors (evaluated from GHI post-processed measurements and clear-sky counterpart estimates) and constantly outperforms the already established models. The second model (M2) operates with three predictors based only on GHI measurements, which gives it a high degree of accessibility. Based on a statistical linear ranking method according to the models’ performance at every station, M1 leads the hierarchy, ranking first in both DNI and DHI estimation. The high accessibility of the M2 does not compromise accuracy; it is proving to be a real competitor in the race with the best-performing current models.

Keywords:

diffuse solar irradiance; direct normal solar irradiance; separation model; diffuse fraction; clearness index

1. Introduction

Clean energy generation is a very important issue in the development of any community that aspires to become sustainable. In this context, we are witnessing an exponential development of the photovoltaic (PV) energy sector [1]. The feasibility study as well as the technical design of a PV system are based on an on-site, detailed simulation of the system’s operation. For running such simulations, accurate in-plane estimates of solar irradiance are required. Even today, measurements of solar irradiance in various spatial directions are very scarce. Thus, in-plane solar irradiance is commonly estimated on the basis of solar irradiance measured on the horizontal surface [2]. To apply a transposition model, both direct-normal (DNI) and diffuse (DHI) solar irradiance components are required [3]. However, the component of solar radiation measured with the highest spatial density at the planetary level is global horizontal solar irradiance (GHI). This fact assigns the separation models a privileged place in modeling solar resources at the Earth’s surface. In a broad sense, a separation model estimates DNI and DHI from GHI measurements (see, e.g., [4] for a recent treatment of the topic).

The term irradiance defines the solar energy flux, expressed in W/m². GHI denotes the incoming solar energy flux incident on a unitary horizontal surface from the entire celestial vault. GHI is the result of summing two components in the horizontal plane: the beam and the diffuse solar irradiance. The solar irradiance beam is the solar energy flux coming directly from the Sun disk to the Earth, measured on a horizontal plane. If

θ_{z}

denotes the zenithal angle, then the solar irradiance beam is expressed by the cosine law:

D N I \cdot \cos θ_{z}

. Diffuse solar irradiance, or DHI, defines the solar radiation that has been scattered by the atmospheric constituents but still reaches the surface of the Earth. The three components are related by the closure equation:

G H I = D N I \cos θ_{z} + D H I

(1)

The problem of separating global irradiance into its direct and diffuse components is a current topic of interest in solar energy research (see, e.g., [5]), with the stated aim of increasing the performance of the separation process.

Generally, the output of a separation model is the diffuse fraction, defined by equation [6]:

k_{d} = \frac{D H I}{G H I}

(2)

Technically speaking, GHI is directly measured, then DHI is straightforwardly obtained from the estimated diffuse fraction, and finally DNI is estimated through Equation (1):

D N I = \frac{1 - k_{d}}{\cos θ_{z}} G H I

(3)

Along with this basic application, the evaluation of DHI and DNI on the basis of GHI, which is currently the diffuse fraction, acquires new values, such as its use as a quantifier of the uncertainty induced by aerosols in estimating clear-sky DHI [7] or as a proxy for classifying the sky conditions into clear, intermediate, and overcast [8].

The key stage in DHI estimation is the choice and application of a separation model, i.e., the estimation of diffuse fraction from one or more atmospheric parameters, frequently called predictors. The accuracy level in diffuse fraction estimation determines the overall accuracy of a separation model. The diffuse fraction is typically estimated as a function of clearness index [6]:

k_{t} = \frac{G H I}{G_{e x t}}

(4)

where G_ext represents the solar irradiance deterministically computed in a horizontal plane at the top of the atmosphere. The clearness index encapsulates information about atmospheric transparency, and its measured time series captures the stochastic part of solar irradiance [9].

In the last few decades, separation models have experienced continuous development [6]. The main engine of the development was constituted by the needs of the solar industry, with the research focusing on the practical modeling of the diffuse fraction based on the atmospheric clearness index. At the same time, there was also a constant rush to develop more and more accurate separation models, even if their practical applicability proved to be extremely limited. In fact, the development of a better performing model is the goal per se of the research. In this study, both aspects of the research were addressed. On the one hand, we propose a new entry in the race for the best-performing separation model that maintains a large degree of accessibility in practice. On the other hand, we propose an applicative separation model from the class of the simplest models, whose performance is comparable to the best-performing current models.

In summary, this paper reports two new minute-scale separation models, which are rational functions of the clearness index and other predictors. The first model setup operates with eight predictors and constantly outperforms already established models. The second model is highly accessible because it uses only three predictors that are determined using GHI values. The models’ accessibility also resides in their structure, with each model being defined by a single equation and a unique set of coefficients. The predictors are easily computable in any location where GHI is measured. Constructed innovatively and supplely, the two proposed models are meant to cover the gap between high accuracy and accessibility.

The paper is organized as follows: Section 2 surveys the current state-of-the-art separation models. The relevant datasets used in this study are described in Section 3. The proposed separation models are introduced in Section 4. The models’ performance is evaluated in Section 5. Section 6 summarizes the main conclusions.

2. A Survey on the Separation Model’s Performance

A general perspective on the separation models’ equation is given by [10]. Almost all the separation models have an empirical basis, ranging from purely empirical models [11] to the very rare models incorporating physical features [12]. Many empirical models are fitted to data collected from a single station (e.g., [13]), but there are also models fitted to data collected from stations spread all over the world (e.g., [14]). The practical separation models consist of linear or non-linear equations (see, e.g., the large variety of separation models gathered by [4]). The most populated classes of the separation model equations’ are polynomial and logistic.

The first separation model proposed by Liu and Jordan in 1960 [15] linearly relates the daily diffuse solar irradiation to the daily global solar irradiation. After this opening work, hundreds of single-predictor polynomial models have been proposed, e.g., [16] in 1977, [17] in 1982, [18] in 2006, and so on. Most of the polynomial separation models contain a single equation defined over the entire range of the clearness index. Some separation models comprise more than one equation, each equation being defined over a sub-domain of the clearness index [11]. The first logistic model was proposed in 2001 [19]. It was motivated by the attempt to abandon the branched structure of many previous models. Some of the latest and most popular separation models are from the class of logistic models ([], [5,20,21]).

The separation models are not significantly distinctive in their formal nature. What differentiates them mainly is the number and nature of the predictors. The search for an ideal combination of predictors can be easily understood by looking at the scattering of the diffuse fraction with respect to the clearness index (e.g., Figure 1, Figure 2a). Because single predictor models cannot explain the dispersion of the diffuse fraction at a fixed value of the clearness index, the proposed polynomial models started to operate with more predictors [21,22,23].

From a timescale perspective, there is a large variety of separation models that are applicable to GHI time series sampled from 1 min [11] to monthly [24] intervals.

The separation models are comprehensively reviewed in [6]. The paper compares the performance of 140 separation models using data from 54 stations spread across all climate zones. The authors claim an exhaustive study, involving all separation models published prior to 2016. The paper concluded that no separation model can outclass all the others in every location. However, the model Engerer2 [20] (further denoted E2) was indicated to be quasi-universal based on the statistical results in different climate zones. The current state-of-the-art in separation models is highlighted by recent work [5]. It compares the 10 separation models that claimed to perform better than E2 against a huge dataset collected from all climate zones. The study identifies four minute-scale separation models, developed after 2016, that outperform Engerer2: Yang4 (Y4) [5], Starke1 (S1) [21], Starke3 (S3) [25], and Paulescu (PB) [11]. The abbreviation used in this study is indicated in brackets.

On the basis of the outcomes from [5,6], in the present study we considered only the top five models from [5]. The ensemble of the five models represents the state-of-the-art in separating diffuse from global solar irradiance. The performance of these models is considered a benchmark in the evaluation of the performance of the separation models proposed in this study. The five models are briefly introduced in Appendix A.

3. Dataset

The radiometric data used for developing and testing the proposed models were collected from the Baseline Surface Radiation Network (BSRN) [26]. Most of the stations in this network provide GHI, DHI, and DNI measured at 1 min resolution. The general database developed for this study includes two datasets. The first dataset, denoted D_FIT, was used for developing the proposed models. The second dataset, denoted D_TEST, was designed for comparing the performance of the proposed models with the performance of the reference models introduced in Section 2.

D_FIT contains 193,294 lines of data, recorded for one year (2020) at 1 min resolution at the station of Payerne, Switzerland (46.815 N, 6.944 E, BSRN indicative PAY). This location is situated in a temperate climate region classified as Cfb according to the Köppen–Geiger system [27]. D_TEST contains nearly 7 million lines. Data were collected from 36 locations spread across four climatic zones: tropical (A), arid (B), temperate (C), and continental (D). Only data measured at the Sun’s elevation angle h > 5° were included in D_FIT and D_TEST. This is a common practice in research conducted with radiometric data because the pyranometer’s accuracy is questionable close to sunrise and sunset.

The data in D_TEST are characterized by diversity; they were recorded in locations with latitudes between −45 and +58 deg, and the time span extends from 2002 to 2021. Table 1 summarizes the stations’ metadata.

There is some inherent overlapping between the locations included in D_TEST and the origin location of the models. Thus, Payerne (PAY), the origin location of the proposed models, is included in D_TEST with a share of data of 2.7%. There is no data overlap, i.e., the fit and the test were performed on data collected in different years. Testing an empirical model on data collected from the origin location in a different period is usually practiced (e.g., [22]). The data used to develop the PB model were recorded in Palaiseau (PAL) during 2014–2016. Palaiseau is present in D_TEST with the same share of 2.7% as data collected in 2017. E2 was developed with data from Australia, with none of the stations included in D_TEST. S1 was fitted with data recorded at stations in Australia. Only Darwin (DAR) station is part of the fitting dataset and D_TEST, with the mention that data are collected in different years. S3 proposes different empirical coefficients for each of the five climate zones. S3 was developed based on data collected from 51 BSRN stations, almost all from the network. A number of 32 BSRN stations are also present in D_TEST, which ascribes some advantage to S3. Five of the seven stations used to fit the coefficients of the Y4 are also included in D_TEST. It is worth noting that even if there is some spatial overlap, there is no temporal overlap.

4. A Proposal for Minute-Scale GHI Separation Models

In this section, two innovative separation models are introduced. The models were built on the basis of the D_FIT dataset. Most of the existing separation models are linear, polynomial, logistic, and/or piecewise defined [4,5,6]. Differently, the proposed models are defined by rational equations. Basically, the two models estimate the diffuse fraction based on common predictors included in the equations of the most performant separation models (see, e.g., Appendix A).

The first proposed model, further denoted M1, is defined by an elaborate equation calling on eight predictors. M1 is the result of intensive research on the effect of various predictors on the diffuse fraction. It gathers together almost all the predictors successfully accounted for by the previous separation models: (1) the clearness index

k_{t}

, (2) the deviation of the clearness index from its estimated value under clear skies

Δ k_{t c}

, (defined by Equation (A3)), (3) the part of diffuse fraction

k_{d}

that is attributable to cloud enhancement

k_{d e}

, (defined by Equation (A4)), (4) daily average of the clearness index

k_{d a y}

, (defined by Equation (A1)), (5) hourly average of the clearness index

k_{h o u r}

, (6) the persistence factor

ψ

, defined as the average of a lag and a lead of the clearness index values, (7) the clear sky global solar irradiance

G_{c s}

in MJ/(h∙m²), and (8) the ratio of measured GHI and the estimated global clear sky irradiance

K_{c s i}

. M1 is defined by the empirical equation (

R^{2} = 0.929

and nMBE = 0.011):

k_{d} = \frac{1 + 7.217 K_{c s i} k_{t} {(Δ k_{t c})}^{2} - 0.656 K_{c s i}^{0.5} k_{t}^{2} + 7.947 k_{d e} + 0.187 G_{c s} - 2.747 k_{t}^{2} k_{h o u r}^{3} - 15.379 k_{t} ψ {(Δ k_{t c})}^{3}}{1 + 1.655 K_{c s i} {(Δ k_{t c})}^{3} + 14.008 {(k_{d a y} k_{t})}^{3} + 17.477 k_{d e} + 0.16 G_{c s}}

(5)

The second proposed model, further denoted M2, is definitely simpler and more accessible than M1. It uses only three predictors, all related to the clearness index: (1) the clearness index itself

k_{t}

, the daily average of the clearness index

k_{d a y}

, and the hourly average of the clearness index

k_{h o u r}

. M2 is defined by the equation: (

R^{2} = 0.917

and nMBE = 0.007).

k_{d} = \frac{1 - 2.525 k_{t} + 1.834 k_{t}^{2} - 0.204 {(k_{t} k_{h o u r})}^{2}}{1 - 2.578 k_{t} + 1.902 k_{t}^{2} + 1.054 {(k_{t} k_{d a y})}^{2}}

(6)

The simplicity of M2 does not reduce its ability to capture the essential behavior of the diffuse fraction with respect to the clearness index. It is well illustrated by the sensitivity analysis presented in Figure 1, which displays k_d with respect to k_t for the boundary values of the hourly average of the clearness index: overcast during the whole hour

k_{h o u r} = 0.3

(Figure 1a) and clear sky

k_{h o u r} = 1.0

(Figure 1b). The curve parameter is

k_{d a y}

. In a broad sense,

k_{d a y}

and

k_{h o u r}

measures the cloud cover at two different time scales, hourly and daily, respectively. Figure 1 shows the two parameters acting as a locator for the geometrical place of the

k_{d} = k_{d} (k_{t})

curve in the plane

k_{t} - k_{d}

. As the daily/hourly average of cloudiness increases (measured by a decreasing in

k_{d a y}

and

k_{h o u r}

) the share of DHI in GHI increases as well. In other words, as the day and the hourly interval within the diffuse fraction are estimated to become cloudier, Equation (6) increases the likelihood for the diffuse fraction to take higher values. This discussion is valid for the atmospheric column content as well (e.g., atmospheric aerosol loading). Generally,

k_{d a y}

and

k_{h o u r}

capture the whole atmospheric transmittance (atmospheric column content and clouds), the cloud cover is the most influential parameter. Looking again at Figure 2a, it can be seen that the two parameters,

k_{d a y}

and

k_{h o u r}

, ascribe to M2 the ability to cover the geometrical place occupied by the measurements in the plane

k_{t} - k_{d}

. This perspective will be discussed next in more detail.

Figure 1. Diffuse fraction k_d estimated with M2 (Equation (6)) with respect to the clearness index k_t for two values of the hourly average of the clearness index k_hour (a) 0.3 and (b) 1.0. The curve parameter is the daily average of the clearness index k_day.

Figure 2. Diffuse fraction k_d vs. the clearness index k_t at the station in Payerne, Switzerland, in 2020. Measured (in gray) and estimated (in red) values, with the tested separation models, are displayed.

Figure 2 shows how the estimates of the seven models cover the measurements in the plane k_t − k_d. Figure 2a, already referred to, displays the measured diffuse fraction vs. the clearness index for D_FIT. This is a typical picture whose merit is to illustrate the wide dispersion of k_d compared to the classical predictor k_t. The estimates issued by a simple equation

k_{d} = k_{d} (k_{t})

, irrespective of their complexity, will always be accompanied by a substantial amount of uncertainty. Figure 2b–f displays the estimates issued by the two proposed models (M1 (Equation (5)) and (M2 (Equation (6)) and the reference models PB (Equation (A2)), E2 (Equation (A5)), S1 (Equation (A6)), S3 (Equation (A7)), and Y4 (Equation (A8)) superimposed over the measurements. This allows a visual intercomparison of the models’ flexibility with the variation of the atmospheric conditions captured by the input parameters. PB (Equation (A2)) appears to overlap the data reasonably well (Figure 2b) but is experiencing a relative underestimation of the diffuse fraction. The overlay of E2 over the measurements is lower, but the appearance and position of the estimates cluster indicate an accurate capture of the data mean (Figure 2c). S1 captures almost all the areas of data (Figure 2d). S3 best covers the space occupied by the measured data (Figure 2e). Y4 also appears to overlap the data reasonably well while displaying a lower dispersion than S3 (Figure 2f). As expected, the proposed models M1 and M2 cover the measurements to a large extent (Figure 2g,h), since the two models are fitted on D_FIT. The flexibilities of M1 and S3 are comparable, explained by their relative high complexity: M1 operates with eight predictors, while S3 operates with seven predictors on two branches. Looking at Figure 2 as a whole, it can be concluded that the proposed M1 and M2 models achieve with rational polynomial equations a similar behavior as the reference logistic models.

Figure 3 displays the density plot of k_d vs. k_t at the station in Payerne. It highlights the area in plane k_t − k_d where the measurements (Figure 3a) and the estimates issued by each model (Figure 3b–h) are clustered. Thus, Figure 3 gives us a better perspective on how the models locate the estimated pairs

(k_{t}, k_{d})

compared to the measured ones. In general, the models demonstrate the ability to agglomerate the estimates in the regions of the

k_{d} - k_{t}

space where the measurements show the highest density. It is useful to remember that the reference models are evaluated as high performing by independent studies [5]. A visual comparison between Figure 3a and Figure 3g indicates that the M1 estimates meet almost the same distribution as the measured joint probability k_t − k_d, especially in areas with high probability density. Visible differences appear in the area with intermediate values of the clearness index, but fortunately the probability density is much lower here.

5. Performance Assessment

Aiming to efficiently validate the performance of the proposed separation models, a detailed intercomparison of the M1 and M2 accuracy to that of the reference models (PB, E2, S1, S3, and Y4) was performed. The estimates were evaluated against data from D_TEST (described in Section 3). The tests were focused on the final product, i.e., DNI and DHI, with the diffuse fraction regarded just as a proxy.

5.1. Statistical Indicators

The models’ accuracy was evaluated in terms of three statistical indicators: the determination coefficient R², the normalized root mean square error nRMSE, and the normalized mean bias error nMBE. The indicators are defined in Appendix B.

First, we look closer at the performance of the M1 and M2 models in estimating DNI. For M1, the determination coefficient R² falls between 0.745 and 0.949. The minimum value of R² is obtained at the arid climate station in Tamanrasset, Algeria. The maximum value of R² is achieved at Payerne, Switzerland, with a temperate continental climate. This was expected, with Payerne being the origin location of M1. Similarly, for M2, the determination coefficient R² falls between 0.720 and 0.937. The minimum and maximum values were also achieved at Tamanrasset and Payerne, respectively. This means that the reduction by half of M1′s predictor number roughly keeps the fraction of variance in D_TEST explained by M1.

The models’ performance in estimating DNI from the diffuse fraction estimates through Equations (2) and (3) is displayed in Figure 4a in terms of nRMSE. M1 achieves the best performance at the arid station in Gobabeb, Namib Desert, Namibia (nRMSE = 14.0%), while the worst performance is reached at Lindenberg, Germany, a station with a temperate climate (nRMSE = 41.2%). At the different stations in D_TEST, M1 exhibits both positive and negative biases. The minimum nMBE was achieved at the tropical station on Cocos Island (nMBE = 0.19%). The highest positive bias is experienced at the continental station in Regina, Canada (nMBE = 15.3%), while the highest negative bias is experienced at the temperate continental station in Boulder, USA (nMBE = −13.9%). M2 achieves the best performance at Gobabeb (nRMSE = 14.8%) and the worst at Lindenberg (nRMSE = 48.1%). In terms of nMBE, M2 achieves the best result at the temperate station in Tateno, Japan (nMBE = 0.40%). Similarly to M1, M2 experiences the highest positive bias at Regina (nMBE = 19.0%) and the highest negative bias at the station in Boulder (nMBE = −12.0%).

At first glance, the accuracy of M1 and M2 seems to be relatively low. However, this is not the case. With all the progress registered by the separation models, there is plenty of room for improvement. Visual inspection of Figure 4 reveals high values of nRMSE when the reference models were applied to estimate DNI: nRMSE ranges between 15.2% and 49.4%, both values being reached by S3. A large bias is also present, with nMBE taking values between −19.0% (reached by E2) and +18.2% (reached by PB).

Secondly, the models’ performance in estimating DHI from

k_{d}

estimates through Equation (2) was tested. The results are summarized in Figure 4b in terms of nRMSE. Overall, the nRMSE range is wider than in the case of DNI, making the distinction between the models more visible. The determination coefficient R² of M1 falls between 0.428 and 0.904. The minimum value is reached at the arid climate station in Solar Village, Saudi Arabia. The maximum value is achieved at the continental climate station in Toravere, Estonia. The determination coefficient R² of M2 falls between 0.427 and 0.892. The minimum value is reached at the arid climate station in Tamanrasset, Algeria. The maximum value is also achieved at Toravere.

In terms of nRMSE, M1 achieves the best result at the temperate station in Cabauw, Netherlands (nRMSE = 23.1%), while the worst is obtained at Tamanrasset (nRMSE = 51.9%). M1 experiences the minimum bias at the continental station in Budapest, Hungary (nMBE = −0.18%), while the worst result is obtained at the arid station in Alice Springs, Australia (nMBE = 26.79%). M2 achieves the best nRMSE at Toravere (nRMSE = 25.4%), while the worst is obtained at Tamanrasset (nRMSE = 52.9%). The lowest bias is achieved by M2 at Toravere (nMBE = 0.25%), while the maximum is obtained at Tamanrasset (nMBE = 29.2%). As this last value notes a very large bias, we underline that only at six stations, out of thirty-six nMBE for M2 exceeds 10%.

Overall, the performance of the reference models in estimating DHI is modest, with nRMSE falling between 23.7% (achieved by S1) and 82.5% (achieved by E2). At some stations, DHI estimates are significantly biased, with nMBE falling between −26.0% (achieved by PB) and +54.5% (achieved by E2). At most stations, the reference models experience a reasonable bias, with the 1st and 3rd quartiles of nMBE taking the following values, respectively: −5.8% and +3.8% for PB, −10.7% and +0.7% for E2, −5.2% and +4.6% for S1, −3.7% and 4.7% for S3, and −8.9% and 0.8% for Y4.

5.2. Model Ranking

We conclude the models’ evaluation with a statistical ranking. The overall performance of a model was calculated on the basis of the linear ranking method [5]. Thus, the mean rank of a model is calculated as a weighted average:

m_{i} = \sum_{k = 1}^{7} p_{k} \cdot k

(7)

where

m_{i}

is the mean rank of the i-th model (i = PB, E2, S1, S3, Y4, M1, and M2),

p_{k} = n / 36

with n being the number of stations where the model i is in the nRMSE hierarchy at position k. For each station, the best model is ranked 1st, and the most ineffective model is ranked 7th.

Table 2 presents the ranking of the new and reference models based on the estimation of DNI and DHI. Each column in the table corresponds to the mean ranking (Equation (7)) of a particular model. The proposed model M1 is ranked first in estimating both DNI and DHI. For DNI estimation, M1 achieves the first position with a mean rank of

m_{M 1} = 2.02

, and is closely followed by S1 (

m_{S 1} = 2.30

). For DIF estimation, M1 also achieves the first position with a mean rank of

m_{M 1} = 2.05

, closely followed by S3 (

m_{S 3} = 2.11

). The detailed ranking of the models at the stations from D_TEST, on which the mean ranking (Table 2) was built, is displayed in Figure 5. Visual inspection shows that M1 performs best at the largest number of stations, taking first place at 16 stations for DNI estimation and at 17 stations for DHI estimation. The high accessibility of the M2 does not compromise accuracy. Figure 5 emphasizes M2 as a real competitor among the best-performing current models, taking second place at three stations for DNI estimation and first place at five stations for DHI estimation.

Figure 5 also displays the models’ sensitivity to climate. The stations are clustered by climate zones, according to the Koppen–Geiger climate classification [27]. Excepting the tropical climate A, where S3 is the most accurate model (with a mean rank (Equation (7))

m_{S 3} = 2.0

for DNI estimation and

m_{S 3} = 1.2

for DHI estimation), in all the other climates, the proposed model M1 performs with the highest accuracy. Thus, for DNI estimation, M1 achieves the following mean rank:

m_{M 1} = 2.0

in arid climate B,

m_{M 1} = 1.5

in both temperate climate C and in continental climate D. For DHI estimation, M1 achieves the following mean rank:

m_{M 1} = 2.3

in arid climate B,

m_{M 1} = 1.6

in temperate climate C and

m_{M 1} = 1.1

in continental climate D.

6. Conclusions

This paper introduced two new separation models, which share global solar irradiance GHI into its direct-normal DNI and diffuse horizontal DHI components. The models effectively estimate the diffuse fraction at minute-scale resolution. Different from the already established minute-scale equations, the new models are differently formulated, being defined by rational polynomial equations. The first model, M1, operates with eight predictors. The M1 equation fails the beauty test but successfully passes the performance test. By contrast, the second model, M2, is defined by a beautiful equation. It operates with only three predictors defined on GHI measurements, which gives it high accessibility.

Validation of the new models was carried out against data collected from 36 stations covering the four major climatic zones. Five current top minute-scale separation models were considered references. The tests were performed on the final product estimations, i.e., DNI and DHI. Based on a statistical linear ranking method according to the models’ performance at every station, M1 leads the hierarchy, ranking first in both DNI and DHI estimation. The high accessibility of the M2 does not compromise accuracy. The results place M2 among the best-performing current models.

Finally, we can conclude that all the models, both the proposed and the reference ones, seem to evolve in tandem (more or less similarly accurate), which could suggest a limit that cannot be crossed in the traditional estimation process. In order to further increase the accuracy of separating the global solar irradiance into its primary components, a new approach to the development of algorithms is necessary. Future work will be directed in this direction.

Author Contributions

Conceptualization E.P.; methodology, E.P. and M.P.; software E.P.; formal analysis, E.P. and M.P.; data curation, E.P.; writing—original draft preparation, E.P. and M.P.; writing—review and editing, E.P. and M.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available data from BSRN (https://bsrn.awi.de/ accessed on 15 May 2023) were processed and analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The five models found by [5] to be highly performant and considered in this study as references are briefly introduced next.

PB model

PB [11] estimates the diffuse fraction utilizing only two predictors: the clearness index

k_{t}

and the daily average of the clearness index

k_{d a y}

. The latter captures the variability in solar irradiance time series during a day.

k_{d a y}

is defined as:

k_{d a y} = \frac{1}{n} \sum_{i = 1}^{n} k_{t, i}

(A1)

where n is the daylight duration expressed in minutes. The model was built with data from the BSRN station in Palaiseau, France. The model is defined by a linear piecewise equation [11]:

\begin{matrix} k_{d} = 1.011925 - 0.0316193 k_{t} - 0.0293827 k_{d a y} - 1.6567278 (k_{t} - 0.367) θ (k_{t} - 0.367) + \\ + 1.8982144 (k_{t} - 0.734) θ (k_{t} - 0.734) - 0.8547682 (k_{d a y} - 0.462) θ (k_{d a y} - 0.462) \end{matrix}

(A2)

where

θ (x) = {\begin{cases} 0, & x < 0 \\ 1, & x \geq 0 \end{cases}

denotes the Heaviside step function.

E2 model

The first minute-scale separation models were reported by the author of [20]. The models were built using data collected from six stations included in the Australian Meteorology Bureau’s meteorological network. Among the proposed models in [20], the best is Engerer2, here denoted E2. The model estimates k_d based on the following predictors: k_t, the zenithal angle

θ_{z}

, the apparent solar time AST, and the deviation of the measured clearness index k_t from its estimation under clear skies k_tc:

Δ k_{t c} = k_{t c} - k_{t}

(A3)

Δ k_{t c}

provides information about cloud enhancement (

Δ k_{t c} < 0

). The last predictor used by E2 is k_de, the part of k_d that is attributable to cloud enhancement. It is given by the equation:

k_{d e} = \max (0, 1 - \frac{G_{c s}}{G})

(A4)

where G_cs is the clear-sky global horizontal irradiance.

E2 is defined by a logistic equation [20]:

k_{d} = C + \frac{1 - C}{1 + \exp (β_{0} + β_{1} k_{t} + β_{2} A S T + β_{3} θ_{z} + β_{4} Δ k_{t c})} + β_{5} k_{d e}

(A5)

C,

β_{0}

,

β_{1}

,

β_{2}

,

β_{3}

,

β_{4}

, and

β_{5}

are empirical coefficients [20].

In this study, for running E2, the clear-sky solar irradiance components, DNI and DHI, have been estimated with the Biga and Rosa model [28].

S1 model

The authors of [21] also proposed a logistic model, referred to in this study as S1. Unlike E2, S1 replaces the predictors that capture the cloud enhancement events with predictors that manage the variability in the state of the sky. Cloud enhancement is still accounted for, but as an indicator that switches between two branches of the model. Two different sets of empirical coefficients are fitted on the basis of two sets of data collected from Australia and Brazil, respectively. The model equation is [21]:

k_{d} = {\begin{cases} {[1 + \exp (β_{7} + β_{8} k_{t} + β_{9} A S T + β_{10} h + β_{11} k_{d a y} + β_{12} ψ + β_{13} G_{c s})]}^{- 1} & IF K_{c s i} \geq 1.05 AND k_{t} > 0.65 \\ {[1 + \exp (β_{0} + β_{1} k_{t} + β_{2} A S T + β_{3} h + β_{4} k_{d a y} + β_{5} ψ + β_{6} G_{c s})]}^{- 1} & OTHERWISE \end{cases}

(A6)

where AST denotes the apparent solar time in hours, h is the solar elevation angle in degrees,

ψ

is the persistence factor,

G_{c s}

is the clear sky solar irradiance, in

M J / (h o u r \cdot m^{2})

, and

K_{c s i}

is the ratio of the measured global horizontal irradiance and the modeled global clear sky irradiance.

In this study, the model was implemented using the coefficients fitted to data collected in Australia.

S3 model

S3 [25] is an alternative version of S1. An additional predictor, the hourly clearness index,

k_{h o u r}

was introduced:

k_{d} = {\begin{cases} {[1 + \exp (β_{0} + β_{1} k_{t} + β_{2} A S T + β_{3} h + β_{4} k_{d a y} + β_{5} ψ + β_{6} G_{c s} + β_{7} k_{h o u r})]}^{- 1} & IF K_{c s i} \geq 1.05 AND k_{t} > 0.75 \\ {[1 + \exp (β_{8} + β_{9} k_{t} + β_{10} A S T + β_{11} h + β_{12} k_{d a y} + β_{13} ψ + β_{14} G_{c s} + β_{15} k_{h o u r})]}^{- 1} & OTHERWISE \end{cases}

(A7)

For each climate from the Koppen–Geiger climate classification [27], the authors estimated a set of empirical coefficients [25].

Y4 model

The author of [5] reports an innovation in E2 by introducing an unusual predictor, namely the estimated hourly value of the diffuse fraction with E2,

k_{d, h o u r}^{E 2}

. The model equation reads:

k_{d} = C + (1 - C) / [1 + \exp (β_{0} + β_{1} k_{t} + β_{2} A S T + β_{3} θ_{z} + β_{4} Δ k_{t c} + β_{6} k_{d, h o u r}^{E 2})] + β_{5} k_{d e}

(A8)

The numeric values of the coefficients are given in [5]. This model is referred to as Y4.

Appendix B

The accuracy of different models is measured in terms of three common statistical indicators very often used in solar radiation modeling, i.e., coefficient of determination (R²), normalized root mean square error (nRMSE), and normalized mean bias error (nMBE):

R^{2} = 1 - \frac{\sum_{i = 1}^{M} {(m_{i} - c_{i})}^{2}}{\sum_{i = 1}^{M} {(m_{i} - \bar{m})}^{2}}

(A9)

n R M S E = \frac{{[M \sum_{i = 1}^{M} {(c_{i} - m_{i})}^{2}]}^{1 / 2}}{\sum_{i = 1}^{M} m_{i}}

(A10)

n M B E = \frac{\sum_{i = 1}^{M} (c_{i} - m_{i})}{\sum_{i = 1}^{M} m_{i}}

(A11)

where c and m refer to the estimated and measured values, respectively,

\bar{m}

is the mean of measured values, while M denotes the number of measurements.

References

Haas, R.; Duic, N.; Auer, H.; Ajanovic, A.; Ramsebner, J.; Knapek, J.; Zwickl-Bernhard, S. The photovoltaic revolution is on: How it will change the electricity system in a lasting way. Energy 2023, 265, 126351. [Google Scholar] [CrossRef]
Kambezidis, H.D.; Psiloglou, B.E. Estimation of the Optimum Energy Received by Solar Energy Flat-Plate Convertors in Greece Using Typical Meteorological Years. Part I: South-Oriented Tilt Angles. Appl. Sci. 2021, 11, 1547. [Google Scholar] [CrossRef]
Perez, R.; Ineichen, P.; Seals, R.; Michalsky, J.; Stewart, R. Modeling daylight availability and irradiance components from direct and global irradiance. Sol. Energy 1990, 44, 271–289. [Google Scholar] [CrossRef]
Tan, Y.; Wang, Q.; Zhang, Z. Algorithms for separating diffuse and beam irradiance from data over the East Asia-Pacific region: A multi-temporal-scale evaluation based on minute-level ground observations. Sol. Energy 2023, 252, 218–233. [Google Scholar] [CrossRef]
Yang, D. Estimating 1-min beam and diffuse irradiance from the global irradiance: A review and an extensive worldwide comparison of latest separation models at 126 stations. Renew. Sustain. Energy Rev. 2022, 159, 112195. [Google Scholar] [CrossRef]
Gueymard, C.A.; Ruiz-Arias, J.A. Extensive worldwide validation and climate sensitivity analysis of direct irradiance predictions from 1-min global irradiance. Sol. Energy 2016, 128, 1–30. [Google Scholar] [CrossRef]
Blaga, R.; Calinoiu, D.; Stefu, N.; Boata, R.; Sabadus, A.; Paulescu, E.; Pop, N.; Mares, O.; Bojin, S.; Paulescu, M. Quantification of the aerosol-induced errors in solar irradiance modeling. Meteorol. Atmos. Phys. 2021, 133, 1395–1407. [Google Scholar] [CrossRef]
Kambezidis, H.D.; Kampezidou, S.I.; Kampezidou, D. Mathematical Determination of the Upper and Lower Limits of the Diffuse Fraction at Any Site. Appl. Sci. 2021, 11, 8654. [Google Scholar] [CrossRef]
Ayodele, T.; Ogunjuyigbe, A. Prediction of monthly average global solar radiation based on statistical distribution of clearness index. Energy 2015, 90, 1733–1742. [Google Scholar] [CrossRef]
Paulescu, E.; Paulescu, M. A Semi-Analytical Model for Separating Diffuse and Direct Solar Radiation Components. Appl. Sci. 2022, 12, 12759. [Google Scholar] [CrossRef]
Paulescu, E.; Blaga, R. A simple and reliable empirical model with two predictors for estimating 1-minute diffuse fraction. Sol. Energy 2019, 180, 75–84. [Google Scholar] [CrossRef]
Hollands, K.G.T.; Crha, S.J. An improved model for diffuse radiation: Correction for atmospheric back-scattering. Sol. Energy 1987, 38, 233–236. [Google Scholar] [CrossRef]
Lewis, G. Estimates of monthly mean daily diffuse irradiation in the Southeastern United States. Renew. Energy 1995, 6, 983–988. [Google Scholar] [CrossRef]
Yang, D. Temporal-resolution cascade model for separation of 1-min beam and diffuse irradiance. J. Renew. Sustain. Energy 2021, 13, 056101. [Google Scholar] [CrossRef]
Liu, B.Y.; Jordan, R.C. The interrelationship and characteristic distribution of direct, diffuse and total solar radiation. Sol. Energy 1960, 4, 1–19. [Google Scholar] [CrossRef]
Orgill, J.; Hollands, K. Correlation equation for hourly diffuse radiation on a horizontal surface. Sol. Energy 1977, 19, 357–359. [Google Scholar] [CrossRef]
Erbs, D.; Klein, S.; Duffie, J. Estimation of the diffuse radiation fraction for hourly, daily and monthly-average global radiation. Sol. Energy 1982, 28, 293–302. [Google Scholar] [CrossRef]
Jacovides, C.P.; Tymvios, F.S.; Assimakopoulos, V.D.; Kaltsounides, N.A. Comparative study of various correlations in esti-mating hourly diffuse fraction of global solar radiation. Renew. Energy 2006, 31, 2492–2504. [Google Scholar] [CrossRef]
Boland, J.; Scott, L.; Luther, M. Modelling the diffuse fraction of global solar radiation on a horizontal surface. Environmetrics 2001, 12, 103–116. [Google Scholar] [CrossRef]
Engerer, N. Minute resolution estimates of the diffuse fraction of global irradiance for southeastern Australia. Sol. Energy 2015, 116, 215–237. [Google Scholar] [CrossRef]
Starke, A.R.; Lemos, L.F.; Boland, J.; Cardemil, J.M.; Colle, S. Resolution of the cloud enhancement problem for one-minute diffuse radiation prediction. Renew. Energy 2018, 125, 472–484. [Google Scholar] [CrossRef]
Furlan, C.; Oliveira, A.; Soares, J.; Codato, G.; Escobedo, J.F. The role of clouds in improving the regression model for hourly values of diffuse solar radiation. Appl. Energy 2012, 92, 240–254. [Google Scholar] [CrossRef]
Paulescu, E.; Blaga, R. Regression models for hourly diffuse solar radiation. Sol. Energy 2016, 125, 111–124. [Google Scholar] [CrossRef]
Karakoti, I.; Das, P.K.; Singh, S. Predicting monthly mean daily diffuse radiation for India. Appl. Energy 2012, 91, 412–425. [Google Scholar] [CrossRef]
Starke, A.R.; Lemos, L.F.; Barni, C.M.; Machado, R.D.; Cardemil, J.M.; Boland, J.; Colle, S. Assessing one-minute diffuse fraction models based on worldwide climate features. Renew. Energy 2021, 177, 700–714. [Google Scholar] [CrossRef]
Ohmura, A.; Dutton, E.G.; Forgan, B.; Frohlich, C.; Gilgen, H.; Hegner, H.; Heimo, A.; König-Langlo, G.; McArthur, B.; Muller, G.; et al. Baseline Surface Radiation Network (BSRN/WCRP): New precision radiometry for climate research. Bull. Am. Meteorol. Soc. 1998, 79, 2115–2136. [Google Scholar] [CrossRef]
Kottek, M.; Grieser, J.; Beck, C.; Rudolf, B.; Rubel, F. World Map of the Köppen-Geiger climate classification updated. Meteorol. Z. 2006, 15, 259–263. [Google Scholar] [CrossRef]
Biga, A.J.; Rosa, R. Contribution to the study of the solar radiation climate of Lisbon. Sol. Energy 1979, 23, 61–67. [Google Scholar] [CrossRef]

Figure 3. Density plot of diffuse fraction k_d vs. the clearness index k_t at the station in Payerne, Switzerland, in 2020. Measured and estimated values of k_d with the tested separation models are displayed.

Figure 4. Models’ accuracy is evaluated in terms of nRMSE at every station from D_TEST when the solar irradiance components (a) direct normal DNI and (b) diffuse DHI are estimated.

Figure 5. Ranking of the proposed (M1 and M2) and the reference (PB, E2, S1, S3, and Y4) separation models built on the basis of nRMSE reached at the estimation of (a) direct-normal DNI and (b) diffuse DHI irradiances at every station from D_TEST. The stations are clustered by climate [27]: (A) tropical, (B) arid, (C) temperate and (D) continental.

Table 1. List of the stations providing data in the D-TEST dataset. At each station, data are recorded during a year, as indicated in the last column.

No.	BSRN Indicative	Station	Latitude [deg]	Longitude [deg]	Elevation [m]	Climate	Year
1	BRB	Brasilia	−15.601	−47.713	1023	Aw	2011
2	COC	Cocos Island	−12.193	96.835	6	Aw	2009
3	DAR	Darwin	−12.425	130.891	30	Aw	2011
4	DWN	Darwin Met Office	−12.424	130.893	32	Aw	2010
5	KWA	Kwajalein	8.720	167.731	10	Af	2000
6	MAN	Momote	−2.058	147.425	6	Af	2010
7	NAU	Nauru Island	−0.521	166.917	7	Af	2007
8	ASP	Alice Springs	−23.798	133.888	547	BSh	2015
9	FPE	Fort Peck	48.316	−105.100	634	BSk	2009
10	GOB	Gobabeb	−23.561	15.042	407	BWk	2015
11	SBO	Sede Boqer	30.860	34.779	500	BWh	2011
12	SOV	Solar Village	24.910	46.410	650	BWh	2002
13	TAM	Tamanrasset	22.790	5.529	1385	BWh	2008
14	BER	Bermuda	32.267	−64.667	8	Cfa	2008
15	BIL	Billings	36.605	−97.516	317	Cfa	2007
16	BOU	Boulder	40.050	−105.007	1577	Cfb	2004
17	CAB	Cabauw	51.971	4.927	0	Cfb	2021
18	CAM	Camborne	50.217	−5.317	88	Cfb	2003
19	CAR	Carpentras	44.083	5.059	100	Cfb	2005
20	CNR	Cener	42.816	−1.601	471	Cfb	2012
21	FUA	Fukuoka	33.582	130.376	3	Cfa	2013
22	GCR	Goodwin Creek	34.254	−89.872	98	Cfa	2009
23	ISH	Ishigakijima	24.337	124.164	5.7	Cfa	2013
24	LAU	Lauder	−45.045	169.689	350	Cfb	2007
25	LIN	Lindenberg	52.210	14.122	125	Cfb	2019
26	PAL	Palaiseau	48.713	2.208	156	Cfb	2017
27	PAY	Payerne	46.815	6.944	491	Cfb	2019
28	SMS	Sao Martinho da Serra	−29.443	−53.823	489	Cfa	2007
29	TAT	Tateno	36.058	140.126	25	Cfa	2010
30	BON	Bondville, Illinois	40.066	−88.366	213	Dfb	2009
31	BUD	Budapest-Lorinc	47.429	19.182	139.1	Dfb	2020
32	PSU	Rock Springs	40.720	−77.933	376	Dfb	2009
33	REG	Regina	50.205	−104.713	578	Dfb	2011
34	SAP	Sapporo	43.060	141.329	17.2	Dfb	2013
35	SXF	Sioux Falls	43.730	−96.620	473	Dfa	2009
36	TOR	Toravere	58.254	26.462	70	Dfc	2019

Table 2. Mean rank (Equation (7)) of the proposed (M1 and M2) and the reference (PB, E2, S1, S3, and Y4) separation models evaluated for the estimation of direct-normal DNI and diffuse DHI irradiances at all 36 stations from D_TEST.

Model		PB	E2	S1	S3	Y4	M1	M2
Mean rank	DNI	4.86	6.58	2.30	3.44	4.30	2.02	4.47
Mean rank	DHI	4.83	6.94	4.33	2.11	4.61	2.05	3.11

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Paulescu, E.; Paulescu, M. Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility. Appl. Sci. 2023, 13, 6558. https://doi.org/10.3390/app13116558

AMA Style

Paulescu E, Paulescu M. Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility. Applied Sciences. 2023; 13(11):6558. https://doi.org/10.3390/app13116558

Chicago/Turabian Style

Paulescu, Eugenia, and Marius Paulescu. 2023. "Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility" Applied Sciences 13, no. 11: 6558. https://doi.org/10.3390/app13116558

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Minute-Scale Models for the Diffuse Fraction of Global Solar Radiation Balanced between Accuracy and Accessibility

Abstract

1. Introduction

2. A Survey on the Separation Model’s Performance

3. Dataset

4. A Proposal for Minute-Scale GHI Separation Models

5. Performance Assessment

5.1. Statistical Indicators

5.2. Model Ranking

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI