Since the 1980s, a considerable number of studies have been presented for the generation of TMY using equations from Finkelstein-Schafer (FS) statistics according to the methodology proposed by Sandia National Laboratories. These studies are mostly established with different climatic indices, weighting coefficients, and persistence criteria in the final process of selecting the appropriate sequences. In this work, to determine the importance of meteorological parameters in the growing period of microalgae in HRAP for WWT or plants in agriculture for crops, the Sandia methodology was used considering different scenarios of weighting coefficients and dividing the dataset into intervals to define the FS statistic.
2.1. Sandia National Laboratories Methodology
The Sandia methodology is widely present in the literature and turns out to be one of the most common methods for calculating a TMY [
8,
17,
18,
19,
20]. The TMY is obtained from multiannual historical series, for instance, 30 years (climate cycle), of different meteorological parameters: among others, temperature (mean, maximum, minimum, and range) and solar irradiance (global horizontal irradiance). At first, these parameters were data measured at the study site (26 SOLMET stations) for 23 years beginning in 1953 and extending through 1975 [
8]. From the available daily time series, the Sandia methodology selected 12 Typical Meteorological Months (TMM) to establish information on the annual variability of the parameters studied. Using the FS statistic, a TMM is chosen for each of the 12 calendar months of all the years available in the time series. This was done by assigning a weighting factor (wf) to the meteorological parameters considered, resulting in a reduction in the amount of data, losing the least amount of information as possible [
21,
22]. The wf can vary, depending on the importance of the variable [
23]. The dataset achieved represents a typical year of reliable data in the simulation of energy of renewable energy technologies [
20].
In addition to using FS statistics for generating a TMY, some studies introduced other approaches, such as the principal component analysis or genetic algorithms [
24,
25]. There are other methodologies, different from those listed above, based on the availability of meteorological data and the application of the generated sequence. Among them are the Test Reference Year (TRY) [
26,
27], the Design Reference Year (DRY) [
28], and the Short Reference Years (SRY) [
29]. To date, these methodologies have had remarkable results compared to average long-term weather data from meteorological stations [
19,
21,
30,
31].
2.2. Case of Study
Crop simulation is important to know the morphological characteristics of the crop according to the meteorological parameters and to anticipate in decision-making on agriculture, food security, climate change, energy saving, etc. [
32,
33]. To address the importance of meteorological influence, in this work, the application of a modified methodology to generate a typical weather sequence is applied; in this case, a TMW is applied in order to be used in the growth simulation of microalgae in a HRAP in the Madrid region. On the other hand, studies have been done on microalgae productivity as a raw material in the generation of high value-added products or as a source of energy. Therefore, some authors have used estimates of climate variables (Cligen) to incorporate them into microbial growth models to estimate microalgae production [
34,
35].
Microalgae are phototrophic microorganisms that grow rapidly and reproduce in hours. Therefore, microalgae generate a large amount of biomass in a relatively short time compared to other living species.
Biomass production and WWT are affected by uncontrollable meteorological parameters that vary throughout the cultivation period. Among these parameters, the temperature and solar irradiance between 400 and 700 nm (PAR) [
36] are indispensable for microalgae growth [
37,
38,
39,
40,
41]. The work carried out in [
37] shows that the observed reduction in the mean daily PAR radiation entering the greenhouse affects the plant metabolism. The same effect is observed when the temperature stress is applied to the crop [
42].
Therefore, due to the short hydraulic retention time for microalgae development, a TMS per meteorological season is studied using the data for the Madrid region. The four TMWs to be generated, one for each meteorological season, are based on the PAR and temperature in 18 wastewater treatment plants (WWTP) that already exist in the Madrid region, as shown in
Figure 1.
Due to the availability of simultaneous PAR and temperature data in these locations of WWTPs, a 15-year set of PAR and daily mean, maximum, and minimum air temperature values was used. PAR has been obtained from Kato bands, provided by the spectral resolved irradiance (SRI) of the Satellite Application Facility on Climate Monitoring (CM-SAF), which belongs to the European Organization for the Exploitation of Meteorological Satellites (EUMETSAT) [
43]. The daily mean temperature was obtained from the European Center for Medium-Range Weather Forecasts (ECMWF) [
44]. The period used in the present study was 1991 to 2005, with a spatial resolution of 0.125° × 0.125°.
These four parameters are then grouped into a matrix in which two additional columns, the temperature range and the solar day length, were added by calculations. The latter is used to take into account the photoperiod; that is, the number of solar hours during which microalgae are exposed to PAR and the maximum possible duration of the solar day [
45].
Madrid is almost located in the center of the Iberian Peninsula (between 41.15° N and 39.88° N latitude and between 3.05° W and 4.57° W longitude) on the Central Plateau, and the altitude ranges from 476 to 2428 m and the average is 678 m above sea level, with a surface area of approximately 8000 km
2. The orography of the Madrid region is characterized by the presence of the Central Mountain Range in the north and west of the territory, while the remaining areas are the plains and the Tajo River Valley. The climate of the region is strongly influenced by its orography. Therefore, in the range and its surroundings, there is a mountain climate (Dsc according to the Köppen-Geiger classification) [
46,
47,
48] and an oceanic-Mediterranean climate (Csb). On the other hand, there is a typical Mediterranean climate (Csa) in the plains and a semi-arid climate (BSk) in the southern areas and around the Tajo River Valley. This variation in climate could be estimated by generating TMS that reflect this variability in the growth of microalgae.
2.3. Applying the TMS Methodology
The growth and productivity of microalgae is challenged by multiple cultivation parameters, such as pH, nutrients, light, temperature, agitation, cultivation medium, etc. For our case, only physical parameters, solar irradiance (PAR and solar day length), and temperatures are considered in this scientific approach to study the effects of both parameters on microalgae activity.
Therefore, the existence of values of these parameters at which the culture is at its optimum level or not leads us to consider different weight factor cases for temperature and irradiance. For this purpose, an approach based on TMY methodologies is used to determine the importance of the meteorological parameters in the growth period of microalgae. Moreover, as these two cultivation parameters change significantly between two meteorological seasons, a seasonal approach is adopted. Each season is examined separately, following a multistage process.
Firstly, the whole set of data available (15 years in our case study) has been distributed in the four seasons (Ew = 1, 2, 3, 4): 1 = spring (March–May), 2 = summer (June–August), 3 = autumn (September–November), and 4 = winter (December–February) in that order. These seasons are based on the annual temperature cycle and not on the astronomical seasons, so there is a clear transition between them. For each location, there is a time series corresponding to 14 seasons (Ay = 1, 2, …, 14) for each of these four weather seasons. Thereafter, a period of time is identified as follows EwAy. For example, E1A4 represents the spring season (E1) of 1994 (A4), which is the first spring of the 14 spring seasons that we have between March 1991 and May 2004. Additionally, E4A14 represents the winter season that starts in December 2004 and ends in February 2005.
Since it is intended to characterize one week (S
p) over a season, and a week is a set of seven consecutive days (not necessarily beginning on Monday and ending on Sunday), the proposal is based on an increase in the available data so that the number of candidate weeks over the study period increases. In addition, for a given season, the weeks are constituted in such a way that there is a discontinuity when passing from one year to the next. In other words, in a sequence (week in our case), we cannot have days that come from two different years. This procedure will generate p weeks from the q available days per meteorological season (d
q) in the following way:
where
p = q − 6. This represents an increase of nearly 700% in the number of candidate weeks for a season in each year (season) of the time series. The data obtained (all 7-day packages) represent, for example, all candidate weeks for all spring seasons between March 1991 and May 2004. The same has been done for the other meteorological seasons. Therefore, this procedure generated a good number (q) of candidate weeks for each of these four weather seasons: 1204 for spring, 1204 for summer, 1190 for autumn, and finally, 1184 for winter.
Thereafter, for the entire dataset corresponding to each season and for each week (of each season), a Cumulative Distribution Function (CDF), Equation (1), is determined for each one of the six selected meteorological parameters: PAR, solar day length, mean, maximum, minimum, and temperature range.
The CDF of each meteorological parameter (x) was calculated by classifying the dataset into equally sized intervals, often called lags, because the size of the long-term data is different from that of short-term data. This is why it is interesting to use lags to perform Equation (2). Thereafter, the number n of observations is equal to the number of lags (n). Finally, the observations are arranged in ascending order x
1, x
2, …, x
n. The CDF of each observation is given by a monotonically increasing step function defined by:
where k is the order number from 1 to n − 1.
Then, the FS statistics of each sequence (in our case one week) for each given parameter (x) are obtained from the following Equation (2). In other words, the FS statistics for the candidate week are obtained by calculating the differences between the CDF (defined in Equation (1)) for this week (short-term) with the CDF for all the weeks contained in the corresponding weather season (long term) for each parameter and location.
with CDF
lt and CDF
st as the long-term and short-term CDF of parameter x.
A weighted sum (WS) of the FS statistics corresponding to each parameter (FS
j) of each week is calculated by applying a weight factor (wf
j), where m is the number of meteorological parameters:
The weighting factor chosen will depend on the importance that each parameter has on the growth of microalgae and must comply with:
Following the process, the ‘best’ candidate weeks (applying different options of wfs) are chosen according to a proportion determining the impact on the growth of microalgae. Thus, the proposal is to analyze the influence that these wfs have on the ranking of candidate weeks for a TMW.
Indeed, the generation of TMYs was done using different climate parameters and different weighting factors [
10,
17,
25,
45,
49,
50]. All these proposals are essentially similar; the main differences are the climate parameters to be included (type and quantity) and their corresponding weighting factors.
The studied parameters: temperature (mean, maximum, minimum, and range); PAR; and solar day length play an important role in the development of a TMS. However, in our case study, they do not have the same impact on microalgae productivity. Therefore, some meteorological parameters may be more important than others. The most influential parameters receive the highest weighting factor (wfj), which is considered representative of their impact on microalgae growth.
In
Table 1, nine scenarios with different wfs are proposed to test different options of wfs. This will allow us to check the robustness of the methodology. The idea is to give equal importance to the temperature parameters—maximum (Tmx), minimum (Tmn), mean (Tme), range (Trg), solar irradiance, PAR, and solar day length (Nsol).
Finally, the most representative sequence—week, in our case—among the five best-candidate weeks is obtained by determining the frequency of repetition of the candidate weeks, taking into account their persistence according to the different lags and wfs. Furthermore, the final decision on the choice of the TMW is also affected by its position in the particular season period. This position is validated by calculating the difference in Nsol between the day in the middle of the weather season and the fourth day of the candidate week, Equation (5). This was done to avoid extreme values for a season that could compromise expected results. The difference in Nsol is defined as follows:
where
represents the Nsol of the fourth day of the week in the middle of the considered weather season (w = 1, 2, 3, and 4 for spring, summer, autumn, and winter).
is Nsol of the fourth day of the given candidate week.