Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China

Liu, Quanshan; Wu, Zongjun; Cui, Ningbo; Zhang, Wenjiang; Wang, Yaosheng; Hu, Xiaotao; Gong, Daozhi; Zheng, Shunsheng

doi:10.3390/atmos13060971

Open AccessArticle

Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China

by

Quanshan Liu

¹,

Zongjun Wu

¹,

Ningbo Cui

^1,*,

Wenjiang Zhang

¹

,

Yaosheng Wang

²,

Xiaotao Hu

³,

Daozhi Gong

² and

Shunsheng Zheng

¹

State Key Laboratory of Hydraulics and Mountain River Engineering, College of Water Resource and Hydropower, Sichuan University, Chengdu 610065, China

²

Institute of Environment and Sustainable Development in Agriculture, Chinese Academy of Agricultural Sciences, Beijing 100081, China

³

Key Laboratory of Agricultural Soil and Water Engineering in Arid and Semiarid Areas, Ministry of Education, Northwest A & F University, Yangling, Xianyang 712100, China

^*

Author to whom correspondence should be addressed.

Atmosphere 2022, 13(6), 971; https://doi.org/10.3390/atmos13060971

Submission received: 28 April 2022 / Revised: 7 June 2022 / Accepted: 10 June 2022 / Published: 15 June 2022

(This article belongs to the Section Biosphere/Hydrosphere/Land–Atmosphere Interactions)

Download

Browse Figures

Versions Notes

Abstract

:

Reference evapotranspiration (ET₀) is an essential component in hydrological and ecological processes. The Penman–Monteith (PM) model of Food and Agriculture Organization of the United Nations (FAO) model requires a number of meteorological parameters; it is urgent to develop high-precision and computationally efficient ET₀ models with fewer parameter inputs. This study proposed the genetic algorithm (GA) to optimize extreme learning machine (ELM), and evaluated the performances of ELM, GA-ELM, and empirical models for estimating daily ET₀ in Southwest China. Daily meteorological data including maximum temperature (T_max), minimum temperature (T_min), wind speed (u₂), relative humidity (RH), net radiation (R_n), and global solar radiation (R_s) during 1992–2016 from meteorological stations were used for model training and testing. The results from the FAO-56 Penman–Monteith formula were used as a control group. The results showed that GA-ELM models (with R² ranging 0.71–0.99, RMSE ranging 0.036–0.77 mm·d⁻¹) outperformed the standalone ELM models (with R² ranging 0.716–0.99, RMSE ranging 0.08–0.77 mm·d⁻¹) during training and testing, both of which were superior to empirical models (with R² ranging 0.36–0.91, RMSE ranging 0.69–2.64 mm·d⁻¹). ET₀ prediction accuracy varies with different input combination models. The machine learning models using T_max, T_min, u₂, RH, and R_n/R_s (GA-ELM5/GA-ELM4 and ELM5/ELM4) obtained the best ET₀ estimates, with R² ranging 0.98–0.99, RMSE ranging 0.03–0.21 mm·d⁻¹, followed by models with T_max, T_min, and R_n/R_s (GA-ELM3/GA-ELM2 and ELM3/ELM2) as inputs. The machine learning models involved with R_n outperformed those with R_s when the quantity of input parameters was the same. Overall, GA-ELM5 (T_max, T_min, u₂, RH and R_n as inputs) outperformed the other models during training and testing, and was thus recommended for daily ET₀ estimation. With the estimation accuracy, computational costs, and availability of input parameters accounted, GA-ELM2 (T_max, T_min, and R_s as inputs) was determined to be the most effective model for estimating daily ET₀ with limited meteorological data in Southwest China.

Keywords:

reference evapotranspiration; extreme learning machine; optimization algorithm; empirical model; solar radiation

1. Introduction

Reference evapotranspiration (ET₀) is an important component in hydrological and ecological processes and a key parameter in determining crop water requirements for agricultural water management [1,2]. The Penman–Monteith (PM) model, based on both thermodynamics and dynamics, is of great rigorous theory and accurate estimation. Therefore, PM model has been recommended by the Food and Agriculture Organization of the United Nations (FAO) as the standard equation for estimating ET₀ and is thus widely accepted [3]. However, the FAO-56 PM model requires a number of meteorological parameters for a long period of time as input variables, and its data integrity requirements are extremely high, which limited its development in some areas where complete meteorological data are lacking. Hence, it is urgent to develop high-precision ET₀ models with fewer parameter inputs.

Many ET₀ empirical models with limited meteorological inputs have been developed, e.g., temperature-based models, radiation-based models, and mass transfer-based models [4]. Hargreaves model [5] is a classic temperature-based model for ET₀ estimation in virtue of its popularization and high prediction accuracy with only maximum and minimum temperatures and radiation data introduced. According to Priestley, Taylor, and Samani [6,7], about 80% of ET₀ calculations can be explained by air temperature and solar radiation, leading to accurate prediction in ET₀ by using radiation-based models. Feng et al. [8] evaluated the computational performance of five ET₀ empirical models (Hargreaves model, Modified Hargreaves model, Makkink model, Priestley–Taylor model and Ritchie model) in the humid region of Southwest China, and found that the radiation-based empirical models (Priestley–Taylor and Ritchie model) had higher prediction accuracy. The Romanenko model [9] is the mass transfer-based model that introduced relative humidity (RH) based on temperature parameter. The results of ET₀ prediction by using data from 203 weather stations distributed across Brazil showed that the addition of relative humidity as an input for ET₀ models yielded a better prediction performance at a low cost. Nonetheless, applying them directly to other regions usually reduces the estimation accuracy of models because the empirical models are susceptible to the climate zone environment and geographic location. Djaman et al. [10] evaluated 16 empirical models for estimating ET₀, and found that the empirical models were overestimated or underestimated. Zhang et al. [11] found that the calibrated Makkink model had better performance on ET₀ prediction than uncalibrated models.

Artificial intelligence techniques have developed rapidly in recent decades and have been applied in many fields. Due to their excellent performance in tackling nonlinear regression problems, machine learning models have been widely used to reveal the complex hydrological phenomena and estimate ET₀ [12,13,14]. Tabari et al. [15] evaluated the performances of support vector machine (SVM), adaptive neuro-fuzzy inference system (ANFIS), multiple linear regression (MLR), multiple non-linear regression (MNLR), four temperature-based models, and eight radiation-based empirical models for ET₀ estimation in the semi-arid climate of Iran. It was found that the SVM and ANFIS models performed best. Chen et al. [16] used three deep learning methods, namely deep neural network (DNN), temporal convolution neural network (TCN), long short-term memory neural network (LSTM), and two machine learning models, namely SVM and random forest (RF), to predict daily ET₀ under limited meteorological data conditions in the Northeast Plain of China. The results showed that the prediction accuracy of deep learning and machine learning models was higher than that of empirical models based on radiation and humidity data. The extreme learning machine (ELM) model, proposed by Huang et al. [17], consists of three layers: an input layer, a hidden layer (neurons), and an output layer, and can minimize the computational costs. Abdullah et al. [18] analyzed the performance of ELM and back propagation neural network (BPNN) models in estimating ET₀ in Iraq. They found that the two models had similar accuracy, but ELM showed better computational efficiency. Chia et al. [19] hybridized the ELM with three swarm-based optimization algorithms, namely the particle swarm optimization (PSO), the moth-flame optimization (MFO), and the whale optimization algorithm (WOA), to predict ET₀ in East Malaysia. It was concluded that WOA-ELM outperformed PSO-ELM and MFO-ELM. Wu et al. [20] evaluated the performances of four bio-inspired algorithms (i.e., genetic algorithm (GA), ant colony optimization (ACO), cuckoo search algorithm (CSA), and flower pollination algorithm (FPA)), and optimized extreme learning machine (ELM) models for estimating daily ET₀. The results advocated the capability of bio-inspired optimization algorithms, especially the FPA and CSA algorithms, for improving the performance of the conventional ELM model in daily ET₀ prediction in contrasting climates of China.

Southwest China is not only an important grain production base, but also an ecological defense in China. In recent years, the frequency and intensity of seasonal droughts have been increasing in Southwest China due to global warming. In the future, water resources in Southwest China will become more fragile under the impacts of climate change and human activities, resulting in decreased precipitation and intensified dry-hot weather [21,22]. It is apparent from the related reviews that the ELM model has a good applicability for ET₀ estimations. Although the high prediction accuracy plays a very important role when employing the ELM model, the less computational effort is also essential to consider [23]. Therefore, applying optimization algorithms for ELM model parameters tuning provides more reliable ET₀ estimations, which is of great significance to achieve precise irrigation, optimize water resources management, and maintain ecological security in Southwest China. This study employed GA to optimize the ELM model to improve the accuracy for estimating daily ET₀, and reduce the calculation cost of ELM model under different input combinations. Therefore, the study aims were to: (1) build GA-ELM and ELM models for modeling ET₀ in Southwest China based on daily meteorological data during 1992–2016; (2) optimize weights and thresholds of GA-ELM model with five input combinations; (3) explore the optimal daily ET₀ forecasting model of ELM, GA-ELM, and empirical models in Southwest China to provide a scientific basis for agricultural water management.

2. Materials and Methods

2.1. Study Area and Data Sets

Southwest China (20°54′–36°53′ N, 78°28′–112°04′ E) covers 2.6 million km², accounting for 27% of China’s territorial area. The climate in the Southwest China is dominated by subtropical monsoon and plateau mountain climates. The altitude of the study area varies greatly, and the terrain structure is changeable and complex. There are mainly basins, mountainous, and hilly areas as well as plateau mountain areas. Southwest China is rich in water resources, with an annual precipitation of 800 mm. However, the annual precipitation varies significantly in space, with decreasing tendency from east to west.

Considering the topography, elevation, and landform characteristics, Southwest China is further divided into five sub-zones [24,25], i.e., the Qinghai–Tibet Plateau (QTP), the Northwest Sichuan Plateau (NSP), the Sichuan Basin (SB), the Yunnan–Kweichow plateau (YGP), and the Guangxi Basin (GB) (Figure 1). QTP is an ecological fragile region with an average altitude of 4000 m and sparse vegetation. NSP is one of the five largest pastoral areas in China and the largest animal husbandry base in Sichuan. The SB is one of the main grain-producing areas in China, with dense population and developed agriculture. YGP is one of the four plateaus in China, and there are many types of disastrous weather. The average elevation of the GB is less than 200 m, which is high in the northwest and low in the southeast.

The daily meteorological data of fifteen representative meteorological stations in Southwest China from 1992 to 2016 were selected from the China Meteorological Administration (http://data.cma.cn, accessed on 1 November 2021), including maximum temperature (T_max), minimum temperature (T_min), wind speed (u₂), relative humidity (RH), net surface radiation (R_n), and global solar radiation (R_s). The annual mean values of meteorological parameters at the studied meteorological stations can be found in Table 1.

2.2. Penman–Monteith Model

The PM model incorporates the aerodynamic effects and principle of water balance, and has been recommended by FAO as the reference model for ET₀ calculation because of its rigorous theory and high calculation accuracy. The specific expression [3] of the formula is:

E T_{0} = \frac{0.408 ∆ (R_{n} - G) + γ \frac{900}{T_{m e a n + 273}} u_{2} (e_{s} - e_{a})}{∆ + γ (1 + 0.34 u_{2})}

(1)

where ET₀ is reference evapotranspiration (mm·d⁻¹); R_n is the net surface radiation (MJ·m⁻²·d⁻¹); G is the soil heat flux density (MJ·m⁻²·d⁻¹); T_mean is the mean daily air temperature (°C); u₂ is the wind speed at 2 m height (m·s⁻¹); e_s is saturation vapor pressure (kPa); e_a is actual vapor pressure (kPa); Δ is the slope of vapor pressure curve; Υ is the psychrometric constant (kPa/°C). The detailed calculation processes can be found in the FAO-56 [3].

2.3. Empirical Models

2.3.1. Romanenko Model

Romanenko [9] is an ET₀ prediction model based on temperature and humidity, and the specific expression is as follows:

E T_{0} = 0.00006 {(T_{m e a n} + 25)}^{2} (100 - R H)

(2)

where T_mean is the mean daily air temperature (°C); RH is the relative humidity (%).

2.3.2. Makkink Model

Makkink [26] is a radiation-based model for predicting ET₀. The specific expression is as follows:

E T_{0} = 0.61 (\frac{∆}{∆ + γ}) \frac{R_{s}}{λ} - 0.12

(3)

where Δ, γ and R_n were defined in Equation (1). R_s is the global solar radiation (MJ·m⁻²·d⁻¹); λ is the latent heat of vaporization (MJ/kg).

2.3.3. Tabari Model

Hossein Tabari [27] proposed an ET₀ prediction model based on solar radiation under humid conditions:

E T_{0} = 0.156 R_{s} - 0.0112 T_{m a x} + 0.0733 T_{m i n} - 0.478

(4)

where T_max and T_min are maximum and minimum air temperatures (°C), respectively.

2.3.4. Irmak–Allen Model

Irmak [28] proposed an ET₀ prediction model based on net surface radiation in humid climate:

E T_{0} = 0.489 + 0.289 R_{n} + 0.023 T_{m e a n}

(5)

2.3.5. Priestley–Taylor Model

Priestley and Taylor [6] derived the Priestley–Taylor formula from solar radiation and soil heat flux, and the specific expression is:

E T_{0} = 1.26 \frac{∆}{∆ + γ} \frac{R_{n} - G}{λ}

(6)

where Δ, γ, R_n and G were defined in Equation (1).

2.4. Extreme Learning Machine and Optimization Algorithms

2.4.1. Extreme Learning Machine

The ELM model, proposed by Holland [17], is a single-hidden layer feedforward neural network. The principle of ELM is that the weight and threshold are randomly generated, and the number of neurons is set in the hidden layer. Then, the unique and optimal solution can be obtained by simple matrix calculation [8]. ELM model has the advantages of faster learning speed and better generalization performance, which can not only solve the problems of regression and fitting, but also has been widely used in classification, pattern recognition and other fields. As shown in Figure 2, the ELM model consists of three layers: input layer, hidden layer, and output layer. Further details about the ELM model can be found in reference [17].

2.4.2. Extreme Learning Machine Optimized by Genetic Algorithm

Genetic algorithm (GA) was developed by Holland [29] to search the optimal solution by simulating the process of biological evolution, and is usually used to deal with nonlinear optimization problems [30]. The basic logic of GA is Darwin’s evolution theory. GA begins with a population representing a potential solution to an optimization problem, while the population consists of many individuals with genetic codes. The principle of GA is to convert the solution of the problem into chromosomes. According to the principle of survival of the fittest, the information in chromosomes is transformed by selection, crossover, and mutation, and eventually evolves into the target chromosome. In other words, the evolution of generations produces the approximate optimal solution which is close to the objective function.

The internal parameters of the ELM were randomly initialized, leading to poor stability and generalization performance. Estimation of the kernel-based ELM can be improved by fine-tuning the internal parameters. Specifically, GA can be applied to optimize the parameters of the ELM model, thereby improving the performances of the models because it is capable of deducting the optimal solutions and improving the computational speed. In GA-ELM model, the input weights and node thresholds of hidden layer are chromosome vectors of GA. In a word, GA was used to optimize the optimal input weights and thresholds of ELM. The optimization process is presented in Figure 3. For more details on GA-ELM, see reference [29].

2.5. Input Combinations of Meteorological Parameters

In this study, the daily data during 1992–2011 from 15 representative meteorological stations were used to train the ELM and GA-ELM model, and the rest of the data were used for testing. As shown in Figure 4, the correlation between R_n, R_s, and ET₀ was highest, followed by that between temperature and ET₀. The correlation between u₂, RH, and ET₀ was lowest, which was consistent with the results of Chia and Sharma [19,31]. Previous studies have showed that global solar radiation and temperature were significant meteorological factors affecting ET₀ in Southwest China [8,25,32]. Therefore, the temperature-related and radiation-related factors were selected as the input parameters in this study. The number and combination of input variables will affect the performance of the machine learning model. In the case of limited meteorological parameters, parameters with reliable correlation can effectively improve the accuracy of models for estimating daily ET₀ [33]. This study proposed appropriate input combinations based on the correlation between different climate variables and ET₀. The input combinations are shown in Table 2.

2.6. Model Evaluation

Five commonly used statistical indicators were used to analyze and compare the accuracy and performance of different models in estimating daily ET₀, including coefficient of determination (R²), root mean square error (RMSE), relative root mean square error (RRMSE), mean absolute error (MAE), and global performance indicator (GPI), which were defined in the following equations:

R^{2} = \frac{{[\sum_{i = 1}^{n} (X_{i} - \bar{x}) (Y_{i} - \bar{Y})]}^{2}}{\sum_{i = 1}^{n} {(X_{i} - \bar{X})}^{2} \sum_{i = 1}^{n} {(Y_{i} - \bar{Y})}^{2}}

(7)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - X_{i})}^{2}}

(8)

R R M S E = \frac{\sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - X_{i})}^{2}}}{\bar{X_{i}}}

(9)

M A E = \frac{1}{n} \sum_{i = 1}^{n} | Y_{i} - X_{i} |

(10)

G P I_{i} = \sum_{i = 1}^{4} α_{k} (Z_{i k} - Z_{k})

(11)

where X_i and Y_i are the measured and predicted ET₀ values, respectively;

\bar{X_{i}}

and

\bar{Y_{i}}

are the corresponding mean ET₀ values; n is the number of data. GPI_i is GPI values of the model i; k is the verification factor; Z_ik is the value of k for model i; Z_k is the median of the k. α_k equals –1 for k being RMSE, RRMSE and MAE, and equals 1 for k being R². The higher R² is close to 1, the better models perform; the closer RMSE, RRMSE, and MAE are to 0, the smaller the models’ simulation error. The best-performing model with the highest GPI was ranked No.1, while the model corresponding to the lowest GPI was ranked No.15.

3. Results

3.1. Performances of Reference Evapotranspiration Models in the Five Zones

The statistical results of ELM, GA-ELM, and empirical models for predicting ET₀ in each sub-zone during training and testing are presented in Table 3. It can be observed from Table 3 that R² and RMSE of ELM models were 0.59–0.99, 0.08–0.77 mm·d⁻¹, respectively; the corresponding values of GA-ELM models were 0.56–0.99, 0.03–0.77 mm·d⁻¹, respectively; the corresponding values of empirical models were 0.36–0.91, 0.69–2.64 mm·d⁻¹, respectively. GA-ELM models showed good prediction accuracy in each sub-zone, with the highest prediction accuracy, followed by the ELM models, while the empirical models had the worst overall accuracy.

As seen from Table 3, GA-ELM and ELM models with complete input showed optimal prediction accuracy in each sub-zone during training and testing. The R² and RMSE of GA-ELM and ELM models were 0.98–0.99, 0.03–0.21 mm·d⁻¹, respectively; the corresponding values of machine learning models with three parameters inputs were 0.83–0.98, 0.34–0.63 mm·d⁻¹, models with two parameters inputs were 0.56–0.95, 0.60–0.77 mm·d⁻¹, respectively.

The prediction accuracy of two-input models was the worst, and the GPI rank of two-input models in SB and YGP was even lower than that of IA and PT models, respectively. The accuracy of three-input models was significantly improved compared with that of the two-input models, while the accuracy of the complete input models was slightly higher than that of the three-parameter input models, indicating that three-input models can also predict ET₀ with acceptable accuracy in areas where meteorological parameters were difficult to obtain completely.

ELM and GA-ELM models involved with R_n always showed more accurate results than those involved with R_s under the same quantity of inputs. With T_max, T_min, R_s/R_n, u_2, and RH as inputs, the R² of GA-ELM5 and ELM5 models were above 0.99, and the RMSE of those ranged 0.03–0.07 mm·d⁻¹/0.08–0.15 mm·d⁻¹, respectively; the R² of GA-ELM4 and ELM4 models were above 0.98, and the RMSE of those ranged 0.110–0.15 mm·d⁻¹/0.14–0.21 mm·d⁻¹, respectively. So, the rank was GA-ELM5> GA-ELM4, ELM5> ELM4. With T_max, T_min, R_s/R_n, u_2, and RH as inputs, GA-ELM3 involved with R_n exhibited the best prediction accuracy in QTP, NSP, SB and GB; GA-ELM2 involved with R_s outperformed the other models in YGP. Similar results were shown in empirical models. The accuracy of the IA and PT model were better than that of other empirical models. IA model was the optimal empirical model in NSP, SB, and GB. The PT model outperformed the other empirical models in QTP and YGP.

It can be observed from Table 3 that the studied models in terms of prediction accuracy during training and testing were ranked in the order of GA-ELM, ELM, and empirical model, indicating that GA could effectively improve the performance of ELM models. The statistical indicators of each station during training were superior to those during testing. The main reason was that the difference between the training and testing datasets as well as time series of the meteorological data. It might be also related to the unavailable accurate meteorological data due to the changed environment and climate, resulting in differences in the estimation accuracy of the ET₀ model. During the training and testing stage, the models with complete input produced satisfaction results of ET₀ estimation. The GPI of the GA-ELM5 model ranked first during testing and training, so the GA-ELM5 model (with R² being above 0.99, RMSE ranging 0.03–0.07 mm·d⁻¹) was recommended as daily ET₀ estimation model in QTP. However, the models with three-input also exhibited good prediction accuracy of ET₀ estimation. Considering the availability of input parameters, GA-ELM2 was also capable of predicting daily ET₀ with limited input parameters, with R² ranging 0.83–0.97, RMSE ranging 0.34–0.63 mm·d⁻¹.

3.2. Performances of Reference Evapotranspiration Models in Southwest China

The statistical results of the ELM, GA-ELM, and empirical models for predicting ET₀ in Southwest China during training and testing are presented in Table 4. During training, R² and RMSE of ELM models were 0.94–0.99, 0.09–0.41 mm·d⁻¹, respectively; the corresponding values of GA-ELM models were 0.95–0.99, 0.07–0.41 mm·d⁻¹, respectively. During testing, R² and RMSE of the ELM models were 0.73–0.99, 0.10–0.77 mm·d⁻¹, respectively; the corresponding values of GA-ELM models were 0.73–0.99, 0.03–0.77 mm·d⁻¹, respectively; the corresponding values of empirical models were 0.36–0.83, 1.01–2.64 mm·d⁻¹, respectively. The GA-ELM5 model also showed the highest accuracy during this stage, with R² and RMSE of 0.99, 0.03 mm·d⁻¹, respectively. The IA model performed best among empirical models in Southwest China. GA-ELM models had the highest result, followed by ELM and empirical models. The GA-ELM5 model performed better than other models during testing and training stages, so this model was recommended as the daily ET₀ estimation in the Southwest China. Considering the availability of input parameters, GA-ELM2 was also capable of predicting daily ET₀ with limited input parameters.

The accuracy of machine learning models with complete inputs was better than that with three parameters inputs in Southwest China (Table 4). Regarding to two-input models, the prediction accuracy was worst. When the quantity of input parameters was the same, ELM and GA-ELM models involved with R_n always exhibited more accurate results than those involved with R_s, and the ranking was GA-ELM5> GA-ELM4, ELM5> ELM4, GA-ELM3> GA-ELM2, ELM3> ELM2. Among empirical models, IA and PT model involved with R_n outperformed the temperature- and R_s-based models. Zhang et al. [34] found that solar radiation was a highly influential meteorological factor in ET₀ calculation, and small changes in it will considerably impact on ET₀ prediction [35]. Most areas in Southwest China (such as SB, GB, and YGP) are low radiation areas in China, and air pollution caused by human activities may further weaken the solar radiation. Therefore, compared with R_s, R_n had a greater impact on ET₀ in Southwest China, which was an important meteorological factor affecting ET₀ estimation.

Figure 5 demonstrates boxplots with regard to performances of the 15 ET₀ prediction models. As seen from Figure 5, there were differences in accuracy among ELM, GA-ELM, and empirical models when the input parameters were not the same. When it came to the same inputs, the average median line of R² and RMSE of GA-ELM models were 0.91 (ranging 0.77–0.99), 0.36 mm·d⁻¹ (ranging 0.059–0.70 mm·d⁻¹), respectively; the corresponding values of ELM models were 0.91 (ranging 0.77–0.99), 0.39 mm·d⁻¹ (ranging 0.08–0.53 mm·d⁻¹), respectively; the corresponding values of empirical models were 0.79 (ranging 0.62–0.85), 1.24 mm·d⁻¹ (ranging 0.81–1.44 mm·d⁻¹), respectively. In conclusion, the GA-ELM models showed the best accuracy with highest R² and lowest RMSE and MAE, indicating that GA-ELM models were highly suitable for predicting daily ET₀ in Southwest China.

When it came to the same quantity of input parameters, the models involved with R_n were more accurate than that involved with R_s. The GA-ELM/ELM models with complete inputs improved the accuracy for predicting daily ET₀ during the testing stage, with average median line increasing in R² of 0.79%/0.89%, in comparison with the models based on R_s, and decreasing in RMSE and MAE of 62.65%/37.96% and 67.69%/41.26%, respectively. The average median line of R² for the GA-ELM/ELM models with three-input parameters including R_n increased by 0.25% and 0.06%, respectively, and that of MAE decreased by 0.34% and increased by 0.42%, respectively. The same results can be gained among empirical models, where the average median line of R² and RMSE of IA model were 0.82, 0.81 mm·d⁻¹, respectively, and the corresponding values of PT model were 0.79, 1.18 mm·d⁻¹, respectively. Therefore, GA-ELM and ELM model with R_n showed better simulation performance, which indicated that R_n played a key role in the ET₀ forecasting in Southwest China.

The scatter plots of predicted ET₀ values by ELM models compared with FAO56-PM values during testing (2012–2016) at fifteen representative meteorological stations are presented in Figure 6. As seen from the figure, ELM1 with two input parameters produced more scattered estimates than the other ELM models, with R² ranging 0.56–0.89. ELM2 and ELM3 with three input parameters were closer to the FAO56-PM values, and the fitting accuracy of the two models was very approximate, with R² ranging 0.77–0.9910 and 0.78–0.99, respectively. The ELM3 model had higher fitting accuracy at other stations except YGP, which may be caused by the fact that YGP is one of the areas with high altitude but low solar radiation, so R_s may affect ET₀ in YGP more than R_n. ELM4 and ELM5 with complete inputs outperformed the other models, with R² ranging 0.97–0.99 and 0.98–0.99, respectively. ELM5 (with T_max, T_min, R_n, u₂ and RH as input) showed the highest accuracy for predicting ET₀. Figure 7 demonstrates the scatter plots of predicted ET₀ values by GA-ELM models compared with FAO56-PM values during testing stage at fifteen representative meteorological stations. As seen from Figure 7, GA-ELM1 with two input parameters produced more scattered ET₀ points relative to the other GA-ELM models, with R² ranging 0.56–0.89. GA-ELM2 and GA-ELM3 with three input parameters were closer to values obtained by the FAO-56 PM, and the fitting accuracy of the two models were also very approximate to each other, with R² ranging 0.78–0.99 and 0.79–0.99, respectively. Similarly, the fitting accuracy of GA-ELM3 model was slightly higher than that of GA-ELM2 model in other stations except YGP. The distributions of ET₀ values predicted by GA-ELM4 and GA-ELM5 were close to those of FAO-56 PM ET₀ values, further highlighting the better prediction accuracy of daily ET₀, with R² ranging 0.98–0.99 and 0.994–0.99, respectively. GA-ELM5 (with T_max, T_min, R_n, u₂ and RH as input) produced the best accuracy for predicting ET₀. In summary, GA-ELM models exhibited better accuracy than ELM models. ELM and GA-ELM models with complete parameters input had higher simulation accuracy at each station. ELM and GA-ELM models (ELM3, GA-ELM3, ELM5, and GA-ELM5) involving R_n showed better prediction accuracy than those involving R_s at each station.

Figure 8 shows the GPI rank of each model at all stations. From the figure, it can be seen that the rank of GA-ELM was higher than that of ELM at most stations, which indicated that the GA-ELM had good universality for predicting daily ET₀ in Southwest China. The top four in GPI rank of ET₀ prediction models during testing were GA-ELM5, ELM5, GA-ELM4, and ELM4, which showed that data-driven models with complete meteorological parameters input had better accurate estimates [36]. GA-ELM5 ranked 1st at all stations. The GPI-ranks of the models involved with R_n (GA-ELM5, ELM5, GA-ELM3, and ELM3) were higher than those involved with R_s (GA-ELM4, ELM4, GA-ELM2, and ELM2) when the number of input parameters was the same, which showed that R_n was an important meteorological parameter affecting ET₀ in Southwest China. Considering availability of input parameters and prediction accuracy, GA-ELM5 could be recommended for estimating ET₀ if the meteorological data is complete, with R² ranging 0.994–0.99 RMSE ranging 0.03–0.11 mm·d⁻¹, respectively. GA-ELM2 was capable of predicting daily ET₀ with limited input parameters, with R² ranging 0.78–0.99, RMSE ranging 0.14–0.67 mm·d⁻¹, respectively.

4. Discussion

4.1. ELM Models Produced More Accurate ET₀ Estimates Than Empirical Models in Southwest China

In present study, ELM models offered better accuracy than empirical models for daily ET₀ prediction in Southwest China. The predicted daily ET₀ values of five empirical models were generally undervalued at most stations, and the prediction accuracy of empirical models was lower than that of ELM models. The calculation of ET₀ can be considered as a complicated and nonlinear regression process depending on a large number of meteorological variables. The simplified empirical models require fewer input meteorological parameters, and are vulnerable to climatic changes and geographical conditions. Therefore, it is difficult to develop accurate empirical models for ET₀ prediction with limited meteorological inputs. Empirical models have obvious regional characteristics, so they are suitable for estimating weekly or monthly ET₀ changes [37].

ELM models have a good ability to handle the problems of nonlinear relationship between variables [38], which can significantly improve accuracy of ET₀ prediction. Previous studies have proven that ELM model was superior for ET₀ estimation with good stability [18,39,40]. Zhu et al. [41] found that ELM provided more accurate ET₀ estimates, compared with six empirical models (including radiation-, temperature-, and mass transfer-based empirical models). Feng et al. [8] evaluated the computational performance of ELM and five empirical models to predict ET₀ in the humid area of Southwest China, and the results indicated that the accuracy of the ELM model was much better than empirical models.

4.2. Combination of Input Parameters Decided Accuracy of ET₀ Prediction Models

The combination of input parameters had an important impact on the prediction accuracy of the ELM model for estimating ET₀. In present study, the accuracy of ELM increased with increasing quantity of input parameters, and the calculated results of ELM and GA-ELM with complete input parameters were in good agreement with those of the FAO-56 PM model. The results were generally in accordance with the previous results reported by Antonopoulos and Torres [37,42]. The more meteorological parameters were input, the more information was obtained for ET₀ prediction, improving the prediction accuracy of machine learning models. Feng et al. [43] found that RF and GRNN models with complete parameters inputs of T_max, T_min, T_mean, RH, u₂ and R_s obtained satisfactory results in ET₀ prediction. Yu et al. [40] applied different combinations of T_max, T_min, u₂, RH, and R_s to predict ET₀ in the arid area Altay Prefecture, and the results confirmed that the ELM model with complete inputs exhibited the best prediction accuracy. Chia et al. [19] employed six input combinations to evaluate the accuracies of hybridized ELM models with T_max, T_min, T_mean, RH, u₂ and R_s as input parameters accurately predicted ET₀, and the prediction accuracy of ELM model decreased as with reducing input parameters. Generally, the more input parameters improved the ET₀ prediction accuracy of machine learning model, but the contribution of meteorological parameters to the ET₀ estimation was different.

The input combination strategy of meteorological data also affected the performance of machine learning models. Previous studies have shown that solar radiation and sunshine duration were important meteorological factors affecting ET₀ in tropical and subtropical monsoon zones of China [4]. In the present study, the accuracy of models with radiation parameters was obviously higher than those only with temperature parameters. In addition, when same quantity of parameters was imputed, the performance of ELM and GA-ELM models with R_n inputted were slightly better than those with R_s in most sub-zones, indicating that R_n was a more influential factor in ET₀ prediction in Southwest China. Feng et al. [8,43] also found that models employing R_n performed better than models employing R_s in SB. According to Irmak et al. [28], the prediction accuracy of the R_n-based model was better than the R_s-based model in humid climates. Nevertheless, the contribution of input parameter to the ET₀ prediction varied in different zones in Southwest China. In the present study, models with R_s performed better than models with R_n in YGP. As a result of the changeable terrain and climate, as well as uneven radial distribution of solar radiation in YGP, R_s was the main meteorological factor affecting ET₀ [44]. In fact, in actual observations, both R_s and R_n can be obtained by pyranometer of the meteorological observation station [45]. However, R_s and R_n are usually calculated by other meteorological parameters in areas lacking radiation observation. R_s can be calculated by extraterrestrial radiation and sunshine duration [46]. R_n is the difference between net shortwave radiation and net longwave radiation, and net shortwave radiation needs to be calculated by R_s [3]. ET₀ was more susceptible to R_n in Southwest China, but R_s was easier to be obtained than R_n for ET₀ prediction with acceptable accuracy. Therefore, R_s is a better choice as input parameter for ET₀ estimation, especially in YGP.

4.3. GA Improved the Performance of ELM Models

It is well known that the prediction of the ELM model mainly depends on its kernel function. The internal parameters of ELM model are randomly initialized, which may generate non-optimal solutions and local optimum, affecting the performance of ELM model [41,47]. Therefore, applying an optimization algorithm to tune the parameters of machine learning models can improve the ET₀ prediction accuracy of ELM. Zhu et al. [41] utilized a PSO algorithm to optimally determine the parameters of ELM model under limited inputs condition. The results showed that PSO-ELM model accurately predicted daily ET₀ in Northwest China. Wu et al. [48] proposed three hybrid models for predicting daily ET₀ in different climatic zones of China, and the results indicated that the biological heuristic algorithms effectively improved the performance of ELM model. The present study showed that GA effectively improved the performance of ELM in predicting ET₀, which was in consistent with the above conclusions.

Previous studies showed that the quantity of hidden neurons was affected by the quantity of input parameters. Meanwhile, the computational cost of ELM was associated with the hidden neurons [49]. Optimization efficiency is a major factor for the computational cost of optimized ELM models [49]. In the context of this study, the task of GA was to search for the optimal hidden neurons for ELM model. To further evaluate the comprehensive performance of GA-ELM model, the averaged computational time of machine learning models with different inputs was recorded (Figure 9). In each sub-zone, the computational costs were significantly saved under each input combination. The averaged computational cost saving of combinations 1–5 (as seen from Table 2) were 42%, 45%, 44%, 39%, and 41%, respectively. The increased quantity of input parameters into ELM model improved the prediction accuracy of ET₀ estimation but decreased optimization efficiency, which was in agreement with the conclusion of Chia et al. and Jose V. Frances-Villora [19,50]. The internal kernel function of ELM is too complex to determine in one time [17]. GA have strong global convergence ability and can neglect the information regarding the gradient [51], so the input weights and node thresholds of ELM could be optimized by GA. In conclusion, GA effectively saved computational cost of the ELM model and improved the prediction accuracy, which can be recommended to optimize the machine learning model for ET₀ estimation.

However, this study only explored the accuracy and computational cost for the ET₀ estimation of the ELM/GA-ELM model under different input combinations. Using different machine learning models to predict daily ET₀ under limited meteorological data input, evaluating the potential of different machine learning models for ET₀ prediction, and applying a variety of more advanced optimization algorithms to optimize ELM or other machine learning models to improve the accuracy of ET₀ estimation will be important research aspects of ET₀ prediction. In addition, the accuracy and computational cost of GA-ELM models for ET₀ prediction in satellite-based ET retrievals should be further explored.

5. Conclusions

In the present study, ELM and GA-ELM models for ET₀ estimation were developed based on daily meteorological data from 15 representative meteorological stations in Southwest China. The prediction accuracy of ELM and GA-ELM models with five inputs of meteorological data and five empirical models were evaluated based on the calculated ET₀ from the PM model. The results showed that:

(1) GA-ELM models performed best in estimating ET₀ in Southwest China, with R² ranging 0.56–0.99 and RMSE ranging 0.033–0.771 mm·d⁻¹, followed by ELM models, with R² ranging 0.59–0.99, RMSE ranging 0.080–0.77 mm·d⁻¹, and the empirical models performed worst, with R² ranging 0.36–0.91, RMSE ranging 0.69–2.64 mm·d⁻¹.

(2) The GA-ELM5 model performed best among the models with five input parameters (T_max, T_min, u₂, RH, R_n/R_s), with R² ranging 0.99–0.99, RMSE ranging 0.03–0.11 mm·d⁻¹. The GA-ELM3 model performed best with input combinations of three parameters (T_max, T_min, R_n/R_s), with R² ranging 0.79–0.99, RMSE ranging 0.15–0.65 mm·d⁻¹. Regarding to the input combination of T_max and T_min, the GA-ELM1 model outperformed the ELM1 model, with R² ranging 0.56–0.89, RMSE ranging 0.47–0.87 mm·d⁻¹. Models involving R_n outperformed those involving R_s under the same input parameter quantity.

(3) GA-ELM5 model (T_max, T_min, u₂, RH, R_n) could be recommended as the ET₀ estimation model for Southwest China under complete meteorological data, with R² ranging 0.9939–0.9996, RMSE ranging 0.03–0.11 mm·d⁻¹. Considering that R_s is easier and cheaper to obtain than R_n, the GA-ELM2 model (T_max, T_min, R_s) was also acceptable to estimate daily ET₀ in Southwest China with high accuracy under limited meteorological data conditions, with R² ranging 0.78–0.99, RMSE ranging 0.14–0.67 mm·d⁻¹.

This study can provide a reliable method for accurate prediction of daily ET₀ in Southwest China under the condition of missing or limited meteorological parameters, and provide reference for optimizing ELM model or other machine learning models to predict ET₀. In addition, applying multiple advanced optimization algorithms to optimize machine learning models to improve the estimation accuracy will become an important research direction for ET₀ prediction.

Author Contributions

Q.L.: conceptualization, methodology, investigation, writing—original draft. Z.W.: formal analysis, investigation. N.C.: conceptualization, methodology, writing—review and editing, funding acquisition. W.Z.: writing—review and editing. Y.W.: resources, writing—review and editing. X.H.: writing—review and editing. D.G.: methodology. S.Z.: investigation, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (51922072, 51779161) and the National Key Research and Development Program of China (No.2016YFC0400206).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The meteorological data used in this study are available here: http://data.cma.cn, accessed on 1 November 2021.

Acknowledgments

The authors would like to thank the National Climatic Centre of the China Meteorological Administration for providing the climate database used in this study. Thanks also go to the reviewers and editors for thorough comments that really helped to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fan, J.; Wu, L.; Zhang, F.; Xiang, Y.; Zheng, J. Climate change effects on reference crop evapotranspiration across different climatic zones of China during 1956–2015. J. Hydrol. 2016, 542, 923–937. [Google Scholar] [CrossRef]
Lai, C.; Chen, X.; Zhong, R.; Wang, Z. Implication of climate variable selections on the uncertainty of reference crop evapotranspiration projections propagated from climate variables projections under climate change. Agric. Water Manag. 2022, 259, 107273. [Google Scholar] [CrossRef]
Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration-Guidelines for Computing Crop Water Requirements-FAO Irrigation and Drainage Paper 56; FAO: Rome, Italy, 1998.
Fan, J.; Yue, W.; Wu, L.; Zhang, F.; Cai, H.; Wang, X.; Lu, X.; Xiang, Y. Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric. For. Meteorol. 2018, 263, 225–241. [Google Scholar] [CrossRef]
Hargreaves, G.H.; Samani, Z.A. Reference Crop Evapotranspiration from Temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
Priestley, C.H.B.; Taylor, R.J. On the Assessment of Surface Heat Flux and Evaporation Using Large-Scale Parameters. Mon. Weather Rev. 1972, 100, 81–92. [Google Scholar] [CrossRef]
Samani, Z. Estimating solar radiation and evapotranspiration using minimum climatological data. J. Irrig. Drain. Eng. 2000, 126, 265–267. [Google Scholar] [CrossRef]
Feng, Y.; Cui, N.; Zhao, L.; Hu, X.; Gong, D. Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J. Hydrol. 2016, 536, 376–383. [Google Scholar] [CrossRef]
Ferreira, L.B.; da Cunha, F.F.; de Oliveira, R.A.; Filho, E.I.F. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM–A new approach. J. Hydrol. 2019, 572, 556–570. [Google Scholar] [CrossRef]
Balde, A.B.; Sow, A.; Muller, B.; Irmak, S.; N’Diaye, M.K.; Baboucarr, M.; Moukoumbi, Y.D.; Futakuchi, K.; Saito, K. Evaluation of sixteen reference evapotranspiration methods under sahelian conditions in the Senegal River Valley. J. Hydrol. Reg. Stud. 2015, 3, 139–159. [Google Scholar] [CrossRef] [Green Version]
Zhang, Q.; Cui, N.; Feng, Y.; Gong, D.; Hu, X. Improvement of Makkink model for reference evapotranspiration estimation using temperature data in Northwest China. J. Hydrol. 2018, 566, 264–273. [Google Scholar] [CrossRef]
Kisi, O.; Alizamir, M. Modelling reference evapotranspiration using a new wavelet conjunction heuristic method: Wavelet extreme learning machine vs. wavelet neural networks. Agric. For. Meteorol. 2018, 263, 41–48. [Google Scholar] [CrossRef]
Mehdizadeh, S. Estimation of daily reference evapotranspiration (ETo) using artificial intelligence methods: Offering a new approach for lagged ETo data-based modeling. J. Hydrol. 2018, 559, 794–812. [Google Scholar] [CrossRef]
Yassin, M.A.; Alazba, A.; Mattar, M.A. Artificial neural networks versus gene expression programming for estimating reference evapotranspiration in arid climate. Agric. Water Manag. 2016, 163, 110–124. [Google Scholar] [CrossRef]
Tabari, H.; Kisi, O.; Ezani, A.; Talaee, P.H. SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J. Hydrol. 2012, 444, 78–89. [Google Scholar] [CrossRef]
Chen, H.; Huang, J.J.; McBean, E. Partitioning of daily evapotranspiration using a modified shuttleworth-wallace model, random Forest and support vector regression, for a cabbage farmland. Agric. Water Manag. 2019, 228, 105923. [Google Scholar] [CrossRef]
Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Abdullah, S.S.; Malek, M.; Abdullah, N.S.; Kisi, O.; Yap, K.S. Extreme Learning Machines: A new approach for prediction of reference evapotranspiration. J. Hydrol. 2015, 527, 184–195. [Google Scholar] [CrossRef]
Chia, M.Y.; Huang, Y.F.; Koo, C.H. Swarm-based optimization as stochastic training strategy for estimation of reference evapotranspiration using extreme learning machine. Agric. Water Manag. 2020, 243, 106447. [Google Scholar] [CrossRef]
Wu, L.; Zhou, H.; Ma, X.; Fan, J.; Zhang, F. Daily reference evapotranspiration prediction based on hybridized extreme learning machine model with bio-inspired optimization algorithms: Application in contrasting climates of China. J. Hydrol. 2019, 577, 123960. [Google Scholar] [CrossRef]
Zhao, L.; Zhao, X.; Zhou, H.; Wang, X.; Xing, X. Prediction model for daily reference crop evapotranspiration based on hybrid algorithm and principal components analysis in Southwest China. Comput. Electron. Agric. 2021, 190, 106424. [Google Scholar] [CrossRef]
Wang, P.; Wu, X.; Hao, Y.; Wu, C.; Zhang, J. Is Southwest China drying or wetting? Spatiotemporal patterns and potential causes. Theor. Appl. Climatol. 2019, 139, 1–15. [Google Scholar] [CrossRef]
Hassan, M.A.; Khalil, A.; Kaseb, S.; Kassem, M. Exploring the potential of tree-based ensemble methods in solar radiation modeling. Appl. Energ. 2017, 203, 897–916. [Google Scholar] [CrossRef]
Wu, Z.; Cui, N.; Zhu, B.; Zhao, L.; Wang, X.; Hu, X.; Wang, Y.; Zhu, S. Improved Hargreaves Model Based on Multiple Intelligent Optimization Algorithms to Estimate Reference Crop Evapotranspiration in Humid Areas of Southwest China. Atmos.-Basel 2021, 12, 15. [Google Scholar] [CrossRef]
Zeng, Z.; Wu, W.; Zhou, Y.; Li, Z.; Hou, M.; Huang, H. Changes in Reference Evapotranspiration over Southwest China during 1960–2018: Attributions and Implications for Drought. Atmos.-Basel 2019, 10, 705. [Google Scholar] [CrossRef] [Green Version]
Makkink, G.F. Testing the Penman Formula by Means of Lysimeters. J. Inst. Water Eng. 1957, 11, 277–288. [Google Scholar]
Tabari, H.; Grismer, M.E.; Trajkovic, S. Comparative analysis of 31 reference evapotranspiration methods under humid conditions. Irrig. Sci. 2013, 31, 107–117. [Google Scholar] [CrossRef]
Irmak, S.; Irmak, A.; Allen, R.G.; Jones, J.W. Solar and Net Radiation-Based Equations to Estimate Reference Evapotranspiration in Humid Climates. J. Irrig. Drain. Eng. 2003, 129, 336–347. [Google Scholar] [CrossRef]
Holland, J.H. Adaptation in Natural and Artificial Systems; University of Michigan Press: Ann Arbor, MI, USA, 1975. [Google Scholar]
Liu, W.; Chung, C.E. Enhancing the Predicting Accuracy of the Water Stage Using a Physical-Based Model and an Artificial Neural Network-Genetic Algorithm in a River System. Water-Sui. 2014, 6, 1642–1661. [Google Scholar] [CrossRef] [Green Version]
Sharma, G.; Singh, A.; Jain, S. A hybrid deep neural network approach to estimate reference evapotranspiration using limited climate data. Neural Comput. Appl. 2021, 34, 4013–4032. [Google Scholar] [CrossRef]
Mokhtar, A.; He, H.; Alsafadi, K.; Li, Y.; Zhao, H.; Keo, S.; Bai, C.; Abuarab, M.; Zhang, C.; Elbagoury, K.; et al. Evapotranspiration as a response to climate variability and ecosystem changes in southwest, China. Environ. Earth Sci. 2020, 79, 1–21. [Google Scholar] [CrossRef]
Koca, A.; Oztop, H.F.; Varol, Y.; Koca, G.O. Estimation of solar radiation using artificial neural networks with different input parameters for Mediterranean region of Anatolia in Turkey. Expert Syst. Appl. 2011, 38, 8756–8762. [Google Scholar] [CrossRef]
Zhang, Y.; Cui, N.; Feng, Y.; Gong, D.; Hu, X. Comparison of BP, PSO-BP and statistical models for predicting daily global solar radiation in arid Northwest China. Comput. Electron. Agric. 2019, 164, 104905. [Google Scholar] [CrossRef]
Citakoglu, H. Comparison of artificial intelligence techniques via empirical equations for prediction of solar radiation. Comput. Electron. Agric. 2015, 118, 28–37. [Google Scholar] [CrossRef]
Feng, Y.; Jia, Y.; Zhang, Q.; Gong, D.; Cui, N. National-scale assessment of pan evaporation models across different climatic zones of China. J. Hydrol. 2018, 564, 314–328. [Google Scholar] [CrossRef]
Yang, Y.; Chen, R.; Han, C.; Liu, Z.; Wang, X. Optimal Selection of Empirical Reference Evapotranspiration Method in 36 Different Agricultural Zones of China. Agronomy 2022, 12, 31. [Google Scholar] [CrossRef]
Wang, L.; Kisi, O.; Zounemat-Kermani, M.; Li, H. Pan evaporation modeling using six different heuristic computing methods in different climates of China. J. Hydrol. 2017, 544, 407–427. [Google Scholar] [CrossRef]
Yin, Z.; Feng, Q.; Yang, L.; Deo, R.; Wen, X.; Si, J.; Xiao, S. Future Projection with an Extreme-Learning Machine and Support Vector Regression of Reference Evapotranspiration in a Mountainous Inland Watershed in North-West China. Water-Sui. 2017, 9, 880. [Google Scholar] [CrossRef] [Green Version]
Yu, H.; Wen, X.; Li, B.; Yang, Z.; Wu, M.; Ma, Y. Uncertainty analysis of artificial intelligence modeling daily reference evapotranspiration in the northwest end of China. Comput. Electron. Agric. 2020, 176, 105653. [Google Scholar] [CrossRef]
Zhu, B.; Feng, Y.; Gong, D.; Jiang, S.; Zhao, L.; Cui, N. Hybrid particle swarm optimization with extreme learning machine for daily reference evapotranspiration prediction from limited climatic data. Comput. Electron. Agric. 2020, 173, 105430. [Google Scholar] [CrossRef]
Antonopoulos, V.Z.; Antonopoulos, A.V. Daily reference evapotranspiration estimates by artificial neural networks technique and empirical equations using limited input climate variables. Comput. Electron. Agric. 2017, 132, 86–96. [Google Scholar] [CrossRef]
Feng, Y.; Jia, Y.; Cui, N.; Zhao, L.; Li, C.; Gong, D. Calibration of Hargreaves model for reference evapotranspiration estimation in Sichuan basin of southwest China. Agric. Water Manag. 2017, 181, 1–9. [Google Scholar] [CrossRef]
Zhao, R.; Wang, K.; Wu, G.; Zhou, C. Temperature annual cycle variations and responses to surface solar radiation in China between 1960 and 2016. Int. J. Climatol. 2021, 41, E2959–E2978. [Google Scholar] [CrossRef]
Besharat, F.; Dehghan, A.A.; Faghih, A.R. Empirical models for estimating global solar radiation: A review and case study. Renew. Sustain. Energy Rev. 2013, 21, 798–821. [Google Scholar] [CrossRef]
Ågnström, A. Solar and Terrestrial Radiation. 19. Mon. Weather Rev. 1924, 52, 83. [Google Scholar] [CrossRef]
Mohapatra, P.; Chakravarty, S.; Dash, P.K. An improved cuckoo search based extreme learning machine for medical data classification. Swarm Evol. Comput. 2015, 24, 25–49. [Google Scholar] [CrossRef]
Wu, Z.; Cui, N.; Hu, X.; Gong, D.; Wang, Y.; Feng, Y.; Jiang, S.; Lv, M.; Han, L.; Xing, L.; et al. Optimization of extreme learning machine model with biological heuristic algorithms to estimate daily reference crop evapotranspiration in different climatic regions of China. J. Hydrol. 2021, 603, 127028. [Google Scholar] [CrossRef]
Tejada, A.T.; Ella, V.B.; Lampayan, R.M.; Reaño, C.E. Modeling Reference Crop Evapotranspiration Using Support Vector Machine (SVM) and Extreme Learning Machine (ELM) in Region IV-A, Philippines. Water-Sui. 2022, 14, 754. [Google Scholar] [CrossRef]
Rosado-Muñoz, A.; Frances-Villora, J.V. Hardware implementation of real-time Extreme Learning Machine in FPGA: Analysis of precision, resource occupation and performance R. Comput. Electr. Eng. 2016, 51, 139–156. [Google Scholar]
Whitley, D. A Genetic algorithm tutorial. Stat. Comput. 1994, 4, 65–85. [Google Scholar] [CrossRef]

Figure 1. Geographic locations of the representative meteorological stations and five zones in Southwest China.

Figure 2. Topological structure of extreme learning machine.

Figure 3. The flowchart of GA-ELM.

Figure 4. R² of meteorological parameters for ET₀ estimated using the Penman–Monteith model.

Figure 5. Boxplots of statistical indicators for each daily ET₀ prediction model in Southwest China. Blue: ELM models; green: GA-ELM models; yellow: empirical models.

Figure 6. Scatter plots of predicted ET₀ values using ELM models against their corresponding FAO56-PM values during testing in Southwest China. (Note: the fine line represents 1:1 line).

Figure 7. Scatter plots of predicted ET₀ values using GA-ELM models against their corresponding FAO56-PM values during testing in Southwest China. (Note: the fine line represents 1:1 line).

Figure 8. GPI ranking of each model at all stations.

Figure 9. Percentage of computational cost saving for GA-ELM models under different input combinations.

Table 1. Annual means of the main meteorological parameters at each station during 1992–2016.

Zone	Station	Lon (°)	Lat (°)	H (m)	T_max (°C)	T_min (°C)	RH (%)	u₂ (m·s⁻¹)	R_s (MJ·m⁻²·d⁻¹)	R_n (MJ·m⁻²·d⁻¹)
QTP	Shiquanhe	80.05	32.30	4278	9.02	−5.98	31	1.89	20.03	11.54
	Gaize	84.25	32.09	4414	8.90	−7.19	34	2.51	18.97	11.04
	Anduo	91.06	32.21	5200	5.25	−8.04	53	2.50	17.50	10.40
	Zedang	91.46	29.16	3560	17.28	2.93	42	1.51	18.26	10.41
NSP	Hongyuan	102.33	32.48	3491	10.72	−4.57	70	1.72	15.67	9.29
	Ganzi	100	31.37	3393	14.70	0.11	56	1.32	16.44	9.54
	Zuogong	97.5	29.40	3780	13.26	−1.10	55	1.00	15.91	3.15
SB	Bazhong	106.46	31.52	417	21.72	13.71	77	0.65	12.77	7.60
SB	Dazu	105.42	29.42	394	21.49	14.28	83	2.6	11.52	7.17
YGP	Dali	100.11	25.42	1990	21.49	10.66	67	1.76	16.31	10.65
	Huize	103.15	26.24	2188	19.33	9.02	69	1.92	16.63	10.84
	Meitan	107.28	27.46	792	19.68	12.51	80	1.32	11.95	8.29
	Yuanjiang	101.59	23.36	400	31.01	19.45	67	1.61	17.06	11.43
GB	Laibin	109.09	23.46	96	25.72	18.15	75	1.12	13.64	8.12
GB	Bama	107.15	24.08	254	26.15	17.36	80	0.95	13.87	8.25

Table 2. Summary of the input data for each ET₀ model.

Extreme Learning Machine	Extreme Learning Machine Optimized by Genetic Algorithm	Input Data	Empirical Models
ELM1	GA-ELM1	T_max, T_min	Romanenko (Ro)
ELM2	GA-ELM2	T_max, T_min, R_s	Makkink (MK)
ELM3	GA-ELM3	T_max, T_min, R_n	Tabari (TAB)
ELM4	GA-ELM4	T_max, T_min, R_s, u₂, RH	Irmak–Allen (IA)
ELM5	GA-ELM5	T_max, T_min, R_n, u₂, RH	Priestley–Taylor (PT)

Table 3. Statistical performance of ET₀ models in sub-zones.

Sub-zone	Model	Training						Testing
Sub-zone	Model	R²	RMSE (mm·d⁻¹)	RRMSE	MAE (mm·d⁻¹)	GPI	Rank	R²	RMSE (mm·d⁻¹)	RRMSE	MAE (mm·d⁻¹)	GPI	Rank
QTP	ELM1	0.94	0.62	0.17	0.49	−0.40	10	0.85	0.60	0.15	0.48	−0.50	10
	ELM2	0.97	0.41	0.11	0.33	0.06	7	0.93	0.40	0.10	0.31	0.00	8
	ELM3	0.98	0.40	0.11	0.31	0.10	6	0.94	0.40	0.10	0.31	0.02	6
	ELM4	0.99	0.20	0.20	0.16	0.38	4	0.98	0.21	0.05	0.16	0.45	4
	ELM5	0.99	0.15	0.04	0.11	0.63	3	0.99	0.12	0.03	0.09	0.63	2
	GA-ELM1	0.95	0.62	0.17	0.49	−0.39	8	0.86	0.61	0.15	0.48	−0.50	9
	GA-ELM2	0.97	0.42	0.11	0.33	0.06	9	0.94	0.40	0.10	0.31	0.01	7
	GA-ELM3	0.98	0.40	0.11	0.31	0.10	5	0.94	0.39	0.10	0.30	0.03	5
	GA-ELM4	0.99	0.14	0.04	0.11	0.64	2	0.99	0.15	0.04	0.11	0.58	3
	GA-ELM5	0.99	0.05	0.01	0.04	0.83	1	0.99	0.07	0.02	0.04	0.75	1
	Ro	/	/	/	/	/	/	0.69	1.26	0.33	1.05	−2.06	13
	MK	/	/	/	/	/	/	0.91	1.33	0.34	1.23	−2.11	14
	TAB	/	/	/	/	/	/	0.91	1.50	0.49	1.84	−3.03	15
	IA	/	/	/	/	/	/	0.83	0.74	0.19	0.61	−0.83	12
	PT	/	/	/	/	/	/	0.91	0.72	0.18	0.59	−0.70	11
NSP	ELM1	0.92	0.61	0.19	0.50	−0.47	10	0.76	0.60	0.18	0.48	−0.57	10
	ELM2	0.96	0.42	0.13	0.33	0.00	8	0.90	0.39	0.12	0.31	0.00	8
	ELM3	0.96	0.41	0.12	0.32	0.03	6	0.91	0.38	0.11	0.30	0.04	6
	ELM4	0.99	0.17	0.05	0.13	0.55	4	0.98	0.17	0.05	0.13	0.55	4
	ELM5	0.99	0.10	0.03	0.07	0.71	2	0.99	0.09	0.03	0.07	0.72	2
	GA-ELM1	0.92	0.61	0.19	0.50	−0.47	9	0.77	0.60	0.18	0.47	−0.56	9
	GA-ELM2	0.96	0.42	0.13	0.32	0.00	7	0.90	0.3	0.11	0.31	0.01	7
	GA-ELM3	0.96	0.40	0.12	0.31	0.03	5	0.91	0.38	0.11	0.29	0.05	5
	GA-ELM4	0.99	0.14	0.04	0.11	0.61	3	0.99	0.14	0.04	0.11	0.62	3
	GA-ELM5	0.99	0.03	0.01	0.03	0.84	1	0.999	0.03	0.01	0.03	0.84	1
	Ro	/	/	/	/	/	/	0.59	1.26	0.37	1.07	−2.20	13
	MK	/	/	/	/	/	/	0.74	1.52	0.42	1.31	−2.61	14
	TAB	/	/	/	/	/	/	0.57	1.97	0.55	1.83	−3.86	15
	IA	/	/	/	/	/	/	0.74	0.81	0.22	0.62	−0.99	11
	PT	/	/	/	/	/	/	0.80	0.93	0.26	0.72	−1.19	12
SB	ELM1	0.85	0.70	0.23	0.55	−0.36	10	0.81	0.75	0.22	0.56	−0.45	11
	ELM2	0.92	0.56	0.18	0.41	0.01	7	0.89	0.57	0.17	0.42	0.00	8
	ELM3	0.94	0.54	0.18	0.39	0.08	5	0.90	0.56	0.16	0.42	0.03	6
	ELM4	0.99	0.18	0.06	0.14	0.88	4	0.99	0.18	0.05	0.13	0.92	4
	ELM5	0.99	0.07	0.02	0.06	1.10	2	0.99	0.08	0.03	0.06	1.10	2
	GA-ELM1	0.87	0.70	0.23	0.55	−0.35	9	0.81	0.74	0.22	0.56	−0.44	10
	GA-ELM2	0.92	0.57	0.19	0.42	0.01	8	0.89	0.57	0.17	0.42	0.01	7
	GA-ELM3	0.93	0.54	0.18	0.40	0.07	6	0.90	0.56	0.16	0.41	0.03	5
	GA-ELM4	0.99	0.14	0.04	0.10	0.97	3	0.99	0.13	0.04	0.10	0.99	3
	GA-ELM5	0.99	0.04	0.01	0.03	1.18	1	0.99	0.04	0.01	0.03	2.00	1
	Ro	/	/	/	/	/	/	0.69	1.32	0.39	1.09	−1.8	14
	MK	/	/	/	/	/	/	0.85	1.49	0.44	1.36	−2.2	15
	TAB	/	/	/	/	/	/	0.89	1.26	0.37	1.13	−1.60	13
	IA	/	/	/	/	/	/	0.88	0.69	0.20	0.52	−0.26	9
	PT	/	/	/	/	/	/	0.89	0.88	0.26	0.73	−0.71	12
YGP	ELM1	0.80	0.76	0.23	0.58	−0.93	10	0.71	0.76	0.22	0.58	−0.95	11
	ELM2	0.95	0.35	0.10	0.25	0.10	6	0.93	0.36	0.10	0.26	0.13	6
	ELM3	0.93	0.39	0.11	0.27	0.00	8	0.91	0.41	0.12	0.30	0.00	8
	ELM4	0.99	0.14	0.04	0.11	0.54	4	0.99	0.15	0.04	0.11	0.60	4
	ELM5	0.99	0.10	0.03	0.07	0.64	2	0.99	0.11	0.03	0.08	0.68	2
	GA-ELM1	0.80	0.75	0.23	0.58	−0.92	9	0.71	0.76	0.22	0.58	−0.94	10
	GA-ELM2	0.95	0.34	0.10	0.25	0.10	5	0.93	0.35	0.10	0.26	0.14	5
	GA-ELM3	0.94	0.38	0.11	0.27	0.01	7	0.92	0.39	0.11	0.28	0.04	7
	GA-ELM4	0.99	0.12	0.04	0.09	0.59	3	0.99	0.11	0.03	0.09	0.67	3
	GA-ELM5	0.99	0.07	0.02	0.05	0.71	1	0.99	0.07	0.02	0.05	0.76	1
	Ro	/	/	/	/	/	/	0.77	1.17	0.33	0.92	−1.73	15
	MK	/	/	/	/	/	/	0.88	0.94	0.27	0.78	−1.20	13
	TAB	/	/	/	/	/	/	0.85	0.96	0.28	0.75	−1.23	14
	IA	/	/	/	/	/	/	0.85	0.81	0.25	0.72	−1.02	12
	PT	/	/	/	/	/	/	0.84	0.73	0.22	0.60	−0.80	9
GB	ELM1	0.59	0.77	0.20	0.61	−0.63	10	0.73	0.77	0.19	0.60	−0.40	10
	ELM2	0.87	0.60	0.15	0.46	0.01	6	0.83	0.63	0.15	0.48	0.00	8
	ELM3	0.88	0.61	0.16	0.46	0.01	7	0.83	0.63	0.15	0.47	0.01	6
	ELM4	0.99	0.18	0.05	0.14	0.99	4	0.98	0.18	0.04	0.13	1.06	4
	ELM5	0.99	0.11	0.03	0.07	1.15	2	0.99	0.10	0.03	0.07	1.22	2
	GA-ELM1	0.56	0.62	0.20	0.61	−0.50	9	0.73	0.77	0.19	0.59	−0.39	9
	GA-ELM2	0.89	0.60	0.15	0.46	0.03	5	0.83	0.63	0.15	0.48	0.00	7
	GA-ELM3	0.87	0.61	0.15	0.47	0.01	8	0.84	0.62	0.15	0.47	0.02	5
	GA-ELM4	0.99	0.15	0.04	0.12	1.05	3	0.99	0.15	0.04	0.11	1.12	3
	GA-ELM5	0.99	0.03	0.01	0.03	1.29	1	0.99	0.03	0.01	0.02	1.36	1
	Ro	/	/	/	/	/	/	0.36	1.61	0.39	1.28	−2.48	13
	MK	/	/	/	/	/	/	0.80	1.88	0.45	1.76	−2.87	14
	TAB	/	/	/	/	/	/	0.82	1.54	0.37	1.41	−2.08	12
	IA	/	/	/	/	/	/	0.82	1.01	0.24	0.81	−0.82	11
	PT	/	/	/	/	/	/	0.83	2.64	0.63	2.24	−4.25	15

Table 4. Statistical performance of ET₀ models in Southwest China.

Model	Training						Testing
Model	R²	RMSE (mm·d⁻¹)	RRMSE	MAE	GPI	Rank	R²	RMSE (mm·d⁻¹)	RRMSE	MAE	GPI	Rank
ELM1	0.9409	0.4093	0.1282	0.3425	0.2608	10	0.7311	0.7744	0.1873	0.5979	−0.3987	10
ELM2	0.9887	0.1944	0.0604	0.1520	0.2316	8	0.8279	0.6285	0.1520	0.4772	0.0000	8
ELM3	0.9874	0.1724	0.0536	0.1316	0.1934	6	0.8344	0.6263	0.1514	0.4742	0.0124	6
ELM4	0.9957	0.1077	0.0335	0.0861	0.1878	4	0.9843	0.1769	0.0428	0.1321	1.0623	4
ELM5	0.9977	0.0886	0.0275	0.0694	0.0541	2	0.9947	0.1034	0.0251	0.0720	1.2241	2
GA-ELM1	0.9450	0.4080	0.1278	0.3340	0.0492	9	0.7325	0.7707	0.1863	0.5935	−0.3882	9
GA-ELM2	0.9886	0.1898	0.0590	0.1476	0.0116	7	0.8288	0.6267	0.1516	0.4760	0.0043	7
GA-ELM3	0.9905	0.1716	0.0534	0.1308	0.0013	5	0.8359	0.6238	0.1508	0.4704	0.0207	5
GA-ELM4	0.9965	0.1053	0.0327	0.0845	−0.5054	3	0.9892	0.1460	0.0353	0.1143	1.1235	3
GA-ELM5	0.9985	0.0746	0.0232	0.0593	−0.5197	1	0.9995	0.0326	0.0079	0.0244	1.3645	1
Ro	/	/	/	/	/	/	0.3605	1.6057	0.3875	1.2791	−2.4820	13
MK	/	/	/	/	/	/	0.7965	1.8755	0.4532	1.7643	−2.8667	14
TAB	/	/	/	/	/	/	0.8177	1.5410	0.3722	1.4144	−2.0800	12
IA	/	/	/	/	/	/	0.8212	1.0084	0.2437	0.8147	−0.8158	11
PT	/	/	/	/	/	/	0.8266	2.6417	0.6319	2.2367	−4.2538	15

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Q.; Wu, Z.; Cui, N.; Zhang, W.; Wang, Y.; Hu, X.; Gong, D.; Zheng, S. Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China. Atmosphere 2022, 13, 971. https://doi.org/10.3390/atmos13060971

AMA Style

Liu Q, Wu Z, Cui N, Zhang W, Wang Y, Hu X, Gong D, Zheng S. Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China. Atmosphere. 2022; 13(6):971. https://doi.org/10.3390/atmos13060971

Chicago/Turabian Style

Liu, Quanshan, Zongjun Wu, Ningbo Cui, Wenjiang Zhang, Yaosheng Wang, Xiaotao Hu, Daozhi Gong, and Shunsheng Zheng. 2022. "Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China" Atmosphere 13, no. 6: 971. https://doi.org/10.3390/atmos13060971

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Genetic Algorithm-Optimized Extreme Learning Machine Model for Estimating Daily Reference Evapotranspiration in Southwest China

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area and Data Sets

2.2. Penman–Monteith Model

2.3. Empirical Models

2.3.1. Romanenko Model

2.3.2. Makkink Model

2.3.3. Tabari Model

2.3.4. Irmak–Allen Model

2.3.5. Priestley–Taylor Model

2.4. Extreme Learning Machine and Optimization Algorithms

2.4.1. Extreme Learning Machine

2.4.2. Extreme Learning Machine Optimized by Genetic Algorithm

2.5. Input Combinations of Meteorological Parameters

2.6. Model Evaluation

3. Results

3.1. Performances of Reference Evapotranspiration Models in the Five Zones

3.2. Performances of Reference Evapotranspiration Models in Southwest China

4. Discussion

4.1. ELM Models Produced More Accurate ET0 Estimates Than Empirical Models in Southwest China

4.2. Combination of Input Parameters Decided Accuracy of ET0 Prediction Models

4.3. GA Improved the Performance of ELM Models

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. ELM Models Produced More Accurate ET₀ Estimates Than Empirical Models in Southwest China

4.2. Combination of Input Parameters Decided Accuracy of ET₀ Prediction Models