Next Article in Journal
A Comparison of Ensemble and Deep Learning Algorithms to Model Groundwater Levels in a Data-Scarce Aquifer of Southern Africa
Previous Article in Journal
A Review on Evapotranspiration Estimation in Agricultural Water Management: Past, Present, and Future
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multiple Linear Regression Models with Limited Data for the Prediction of Reference Evapotranspiration of the Peloponnese, Greece

by
Stavroula Dimitriadou
and
Konstantinos G. Nikolakopoulos
*
Department of Geology, University of Patras, 26504 Patras, Greece
*
Author to whom correspondence should be addressed.
Hydrology 2022, 9(7), 124; https://doi.org/10.3390/hydrology9070124
Submission received: 31 May 2022 / Revised: 8 July 2022 / Accepted: 11 July 2022 / Published: 12 July 2022

Abstract

:
The aim of this study was to investigate the utility of multiple linear regression (MLR) for the estimation of reference evapotranspiration (ETo) of the Peloponnese, Greece, for two representative months of winter and summer during 2016–2019. Another objective was to test the number of inputs needed for satisfactorily accurate estimates via MLR. Datasets from sixty-two meteorological stations were exploited. The available independent variables were sunshine hours (N), mean temperature (Tmean), solar radiation (Rs), net radiation (Rn), wind speed (u2), vapour pressure deficit (es − ea), and altitude (Z). Sixteen MLR models were tested and compared to the corresponding ETo estimates computed by FAO-56 Penman–Monteith (FAO PM) in a previous study, via statistical indices of error and agreement. The MLR5 model with five input variables outperformed the other models (RMSE = 0.28 mm d−1, adj. R2 = 98.1%). Half of the tested models (two to six inputs) exhibited very satisfactory predictions. Models of one input (e.g., N, Rn) were also promising. However, the MLR with u2 as the sole input variable presented the worst performance, probably because its relationship with ETo cannot be linearly described. The results indicate that MLR has the potential to produce very good predictive models of ETo for the Peloponnese, based on the literature standards.

1. Introduction

Reference evapotranspiration (ETo) as a climate parameter plays a pivotal role in climate crisis research and in water resources management. There are complex interactions between evapotranspiration and key environmental components, such as groundwater and streamflow, as well as with anthropogenic and climatic impacts, such as wildfires, air pollution, land use/land cover (LULC) change, crop intensification, and construction [1,2,3,4]. Moreover, ETo is directly related to the productive sector of agriculture, since crop water needs are usually estimated as a function of ETo with a crop-specific coefficient [5]. Irrigation system design and precision irrigation techniques demand accurate determination of ETo [6,7]. Since the management of finite resources such as water is challenging, the exploitation of accurate predictive tools of ETo, which are still easy to apply, is of major importance in an interdisciplinary context [8,9,10]. Another important contribution of ETo is its use in computing actual evapotranspiration, which is difficult to acquire [11,12].
The measurement of the ETo is demanding. Therefore, several methods for estimating ETo have been developed, ranging from simple empirical or physically based models [13,14] to complex algorithms and techniques, such as fuzzy logic and machine learning (ML) [15,16,17,18,19,20,21]. These methods employ data from meteorological stations, or retrieved data via remote sensors [22,23,24,25,26,27,28,29,30,31,32]. The FAO-56 Penman–Monteith (FAO PM) equation (Equation (1)) is the most established method used to compute ETo worldwide. However, it requires a large number of meteorological variables, which are not always easy to retrieve [33,34].
ETo = 0.408 Δ Rn G + γ 900 T + 273 u 2 es ea Δ + γ 1 + 0.34 u 2
where Rn is the net radiation (MJ m−2 d−1), G is the soil heat flux (MJ m−2 d−1), T is the air temperature (°C), es − ea is the vapour pressure deficit (kPa), Δ is the slope of the vapour pressure curve (kPa °C−1), u2 is the wind speed at 2 m distance from the surface (m s−1), and γ is the psychrometric constant (kPa °C−1) [33].
On the other hand, there is a handful of empirical methods in the literature utilising limited climatic parameters (e.g., the Hargreaves–Samani equation [35]). However, FAO PM usually outperforms them in terms of accuracy and consistency, especially over different seasons of the year [36]. As a general rule, the level of prediction accuracy generally improves with the increase in input parameters [37]. The cost of the necessary sensors, the equipment, and the maintenance of meteorological stations, along with the difficult environmental conditions at several regions, result in data unavailability or at least scarcity, especially for the developing countries. Therefore, an effort to reduce the quantity of data required for predictive models is highly recommended, provided that satisfactory accuracy is secured [38]. Air temperature (minimum, maximum, and mean values) is the most basic parameter. In a parsimonious prospect, parameters that are suitable explanatory variables for ETo modelling can be easily measured (i.e., air temperature), or can be either measured or computed (i.e., wind speed, solar and net radiation, sunshine hours, vapour pressure deficit). The computation of any missing data can be carried out using air temperature, Julian day, and latitude, according to FAO guidelines [33,39,40].
Among the highly complex methods that have been tested lately, multiple linear regression (MLR) has been suggested by a number of recent studies as an adequate method for estimating climatic parameters such as air temperature, growing degree days, and solar radiation [41,42,43,44]. Remotely sensed data, especially MODIS and Landsat products, have gained ground in MLR modelling lately, as inputs for the estimation of climatic parameters, such as land surface temperature and surface urban heat island [45,46,47]. The added value of MLR is that it is a simple, comprehensible method, producing easily interpretable results, while requiring short real-time control and low computational effort [48]. The challenge lies in the ability of MLR to produce sufficiently accurate predictions of ETo over an area with inhomogeneous LULC types and relief, and which presents distinguished differences in meteorological parameters over short distances (http://climatlas.hnms.gr/sdi/?lang=EN (accessed on 3 May 2022)) [36].
The recent relevant literature is rather limited compared to more complex methods such as artificial neural networks (ANNs). Perugu et al. developed MLR models to estimate ETo for Andhra Pradesh, India, based on temperature, sunshine hours, wind speed, and relative humidity as independent variables [49]. They put forward these models for regions with similar climatic conditions. The regression models developed earlier by Mallikarjuna et al. for the same area, utilizing the same explanatory variables, also showed a satisfactory performance in ETo estimation at daily scale (i.e., R2 between 89.1–97% and RMSE between 0.26–0.49 mm d−1) [50]. Khoshravesh et al. applied non-linear and Bayesian regression models for three arid regions in Iran, and the best results they yielded were R2 > 95% and a RMSE of about 4.00 mm d−1 [51]. Tabari et al. evaluated the performances of neuro-fuzzy models, multiple linear regression, and multiple non-linear regression models, along with temperature-based and radiation-based models for a semi-arid region in Iran [52]. The model with mean temperature, solar radiation, wind speed, and relative humidity as inputs displayed the best performance (RMSE = 0.552 mm d−1, R2 = 96%, and MAE = 0.442 mm d1). The model with only mean air temperature as an input produced the poorest estimates. The non-linear regression improved RMSE, R2, and MAE to 0.452 mm d−1, 97.6%, and 0.331 mm d−1, respectively. These values were considered acceptable for MLR predictions of ETo in a semi-arid region [52]. Shirsath and Singh found that all MLR models outperformed the climate-based Penman, Priestley–Taylor, and Stephens and Stewart models in predicting daily pan evaporation, although the best values of the evaluating measures were R2 = 61% and RMSE = 1.597 mm d−1 [53]. These MLR models were slightly less accurate than the corresponding comparative ANN models. Sanford et al. developed MLR equation with climate parameters (minimum and maximum temperature and precipitation) at an annual scale (1971–2000) for Virginia watersheds with R2 = 84.4% [54]. The MLR equation was modified in a following study using these parameters to estimate the ET/P ratio, yielding an R2 value of 86.74% and an RMSE = 0.067 mm d−1 for the best-fit parameters for the conterminous US [55]. The obtained values were considered satisfactory for explaining most of the variation in long-term average ETo across the conterminous US [55]. Although most recent articles do not employ MLR to the assessed models, Niaghi et al. applied MLR to datasets of six stations at Red River Valley [56]. They grouped the input combinations into air temperature-based (Tmax, Tmin), mass transfer-based (Tmax, Tmin, wind speed), and radiation-based (solar radiation, Tmax, Tmin) measurements. The best performance for MLR models was for the latter inputs (RMSE = 0.68 mm d−1, MAE = 0.51 mm d−1, and R2 = 88%). The MLR models were not sensitive to differences between local and spatial applications compared to the heuristic models, due to their inability to depict complex relationships [56]. Ohana-Levi et al. found that the six linear regression models had much higher RMSE values than the corresponding non-linear multivariate adaptive regression spline (MARS) models [57]. The correlation coefficients during training and testing were 0.84 and 0.85, respectively, for the linear models [57]. In MLR models where leaf area index (LAI) was added as an explanatory variable, the testing RMSE profoundly improved when using the Kc approach (the RMSE of 1.05 mm d−1 without LAI reduced to 0.81 mm d−1) [57]. Sharafi and Ghaleni compared the performance of six MLR models to empirical methods. They found that all the MLR models outperformed the latter [58]. The obtained RMSE values, ranging between 0.36 mm d−1 and 0.50 mm d−1, were considered excellent, so that MLR was recommended for all climate types of Iran, from arid desert to humid [58]. An MLR model of precipitation, temperature, soil moisture, and NDVI as explanatory variables was applied to four basins in India [59]. The ETo variance was explained by 82−91% (adj. R2) via the utilised MLR model [59]. The MLR stepwise modelling applied at the Inner Mongolia Autonomous Region (1971–2016) demonstrated that the ETo values in different regions were affected by different climatic parameters [60]. It is noteworthy that ETo was found insensitive to mean air temperature for the whole area [60]. For the Megecha catchment in Ethiopia, sunshine hours, wind speed, maximum temperature, and relative humidity were proven to be the best explanatory variables of ETo in MLR modelling [61].
This study examines sixteen MLR models utilizing data from sixty-two meteorological stations in the Peloponnese, a southwestern region of Greece (Table A1). The Peloponnese constitutes a challenging testbed for ETo research, since it presents considerable inhomogeneities regarding relief, LULC, altitude, etc. The empirical methods of a previous study exhibited inconsistency in terms of accuracy, with larger errors found for August (summertime) [36]. As described, MLR is not very popular in recent literature of ETo. This is probably because of the complex nature of ETo. However, the added value of MLR is the simplicity and the comprehensibility of the method and the equations produced. Furthermore, it is of importance to examine whether limited data (even one sole input variable) can predict ETo satisfactorily. The latter would be useful for cases with sparse data availability. Therefore, MLR for ETo estimation needs to be further investigated since it would be a promising alternative, being easily applicable and interpretable for interdisciplinary research and management purposes.

2. Materials and Methods

2.1. The Study Area

The Peloponnese peninsula of southwest Greece occupies about 1/6th of the Greek territory (21,439 km2), with a population of 1,086,935 (census 2011; https://www.statistics.gr/el/statistics/-/publication/SAM03/2011 (accessed on 19 April 2022)). The area is mostly hilly and mountainous. Moving from coastline to the mainland, the altitude reaches 2407 m. Its lithology, along with tectonic activity and climatic conditions, has resulted in the formation of relief. The hydrographic network is well-developed, though with few large rivers [62]. The most populated urban area is located at the northernmost edge, while the broadest plain covers the northwest. Except from urban areas, which are sparsely distributed primarily along the coastline, the main LULC types are forests, transitional vegetation, and various crop plots (Figure 1) [63]. According to the Köppen–Geiger classification, Peloponnese’s climate is Mediterranean, warm, with dry summers and mild winters (classified as Csa) [64]. The annual average (1971–2000) precipitation, air temperature, and sunshine hours range between 400 to over 2000 mm, 8–20 °C, and 1900–3100 h, respectively (http://climatlas.hnms.gr/sdi/?lang=EN (accessed on 3 May 2022)).

2.2. Methods

Ground-based datasets of daily scale from sixty-two meteorological stations running under the National Observatory of Athens, for the months August and December of 2016–2019 were utilised (Figure 1, Table A1). The selected period exhibits interest in the context of climate crisis, since the two warmest years (2016 and 2019) since the pre-industrial era are included (https://climate.copernicus.eu/copernicus-2019-was-second-warmest-year-and-last-five-years-were-warmest-record (accessed on 23 June 2022)). August and December were selected as representative months of summertime and wintertime respectively (http://climatlas.hnms.gr/sdi/?lang=EN (accessed on 3 May 2022)), in methodological consistency with our previous study [36], aiming the results to be directly comparable. In the former study, ETo was computed for the Peloponnese by FAO PM, which serves as our reference method. In this study, multiple linear stepwise regression has been applied with seven explanatory (independent) variables, namely mean air temperature (Tmean), wind speed at 2 m distance from the surface (u2), solar radiation (Rs), net radiation (Rn), sunshine hours (N), and vapour pressure deficit (es − ea). In addition to the climate variables, altitude (Z) was also used as an input variable, since it indirectly affects ETo [65]. Rs, Rn, N, and es − ea were previously calculated as in Zanetti et al. [40], based on FAO guidelines for missing climate data (functions of Julian date and station latitude) [33]. Since the need to limit the required variables in ETo modelling is underlined in the recent literature [66], the possibility of only a few variables or even only the most easy-to-acquire variable, Tmean, to produce satisfactorily accurate predictions of ETo for the Peloponnese was investigated. MLR as a statistical technique employs several explanatory variables and one response (dependent) variable, which is ETo in the present study. The aim of MLR is to model the linear relationship between the explanatory variables and the response variable, as an extension of ordinary least squares regression, since it incorporates more than one explanatory variable. In other words, MLR aims to find the linear function that minimizes the sum of the squares of errors (SSE) between the observed and the predicted data. An advantage of this method is the easy interpretation of the coefficients, which are generated in the model with low computational effort, in comparison to more complex techniques, such as energy balance methods and artificial intelligence algorithms [13,14,15,16,17,18,19,20,21,24,25,26,27,28,29,30,37,38,39,40,41,42,43,67,68,69,70,71,72,73,74,75]. For the MLR model, the response (dependent) variable y is assumed to be a function of k independent variables xi. The general form of the equation is computed as follows (Equation (2)):
y i = b 0 + b 1 x 1 + + b k x k + e
where b0 and bi stand for the fitting constants; xi represents the ith observation of each of the explanatory variables, yi stands for the ith prediction of ETo, and ei is a random error term representing the remaining effects of variables on y, which are not covered by the model (residuals). The least squares criterion for the minimum sum of squares of error terms is usually applied to determine the fitting constants [61]. Stepwise MLR has the advantage of presenting the MLR models, beginning from the most influential parameter of the input combination (which explains the larger percentage of ETo variability, expressed by adj. R2) [60]. It presents the models in ascending order of adj. R2, and it may omit several input variables in case they do not meaningfully contribute to the variability explanation. This meaningful contribution of one variable is determined via the adj. R2, as opposed to R2, which generally increases with the number of inputs. Therefore, adj. R2 and R2 are not directly comparable, since the latter would generally have a greater value than the former (Table 1).
The predicted ETo values from the regression models were then compared against the values of FAO PM via statistical indices (Table 1). Specifically, the error levels between predicted and reference values were computed via measures such as root mean square error (RMSE, mm d−1), normalised root mean square error (NRMSE, %), mean absolute error (MAE, mm d−1), and standard error of the predicted value (SE, mm d−1) (Table 1, Equations (2)–(4)). The error values follow the magnitude of the computed ETo values, except for NRMSE, which is expressed in *100% (Table 1, Equation (3)). Moreover, two measures that express the percentage of ETo variability explained by the independent variables of the model and the agreement between predicted and reference values were utilised, namely the adjusted coefficient of determination (adj. R2) and the index of agreement (IoA) (Table 1, Equations (5) and (7)). These measures are suitable for statistical analyses of evapotranspiration, [57,76,77]. At last, residual analysis was performed using the Durbin–Watson index (D–W), Cook’s Distance (Cook’s D), and Centred Leverage. D–W is used to detect the presence of autocorrelation in the residuals. Values around 2.0 (usually between 1.50 and 2.50) express normality, meaning negligible correlation among residuals. Lower (higher) values express positive (negative) correlation. Cook’s D values greater than 0.5 indicate potential outlier, and C. Leverage (between 0 and (k − 1)/k) is used to detect particular influential points.
In Table 1, pi stands for the ith value predicted by the regression model, ri stands for the ith reference value computed by FAO PM, r ¯ is the mean reference value, k is the number of the independent variables, and the sample size (n) is 290.

3. Results

MLR models with all the potential input combinations, from one to six explanatory variables, were tested. In total, sixteen models were examined. Models with altitude or latitude as sole input variables were not applicable, whereas models with seven inputs exhibited statistically insignificant results (p > 0.05). The obtained results are presented in Table 2.
As portrayed in Table 2, half of the tested models (8) yielded RMSE values below 0.35 mm d−1 and adj. R2 values greater than 97.1% (Table 2). Among these models, MLR4, MLR5, and MLR6 obtained an RMSE below 0.29 mm d−1, and an adj. R2 between 98% and 98.1%. The results for the majority of the models are statistically significant at the 99% confidence level. Only four models (MLR10, MLR11, MLR13, MLR15) yielded one coefficient that was statistically significant at the 95% confidence level. Regarding the statistical analysis of the residuals performed for each model, C. Leverage and Cook’s D were generally low, as were the SEs of the predicted values (Table 2).
MLR5 and MLR6 with five and six independent variables, respectively, have the (same) highest adj. R2 (98.1%). As anticipated, MLR6, which includes one extra independent variable, obtained slightly better RMSE/NRMSE values than MLR5 (0.276 mm d−1/8.2% vs. 0.280 mm d−1/8.3%). The two models have the same MAE (0.206 mm d−1) and IoA (99.5%). The model with the highest adj. R2 (98.1%) with the fewer inputs is therefore MLR5, which is a function of N, Tmean, u2, es − ea, and Rn. The former means that these five variables play a significant role in the prediction of ETo for the Peloponnese. The addition of altitude (Z) as an extra independent variable of the model (MLR6) deteriorated the aforementioned indices, without increasing the explanation of the variance (adj. R2 = 98.1%). The contribution of Z to the adj. R2 change was about 0.03%. The equation produced by the model with the best performance (MLR5) is presented in Table 3.
Models MLR1, MLR7, MLR14, MLR15, and MLR16 were one-input models, which means simple linear regression models. The climate variables, namely N, Tmean, Rn, Rs, es − ea, and u2 were the inputs of the aforementioned models, respectively. Among these models, MLR1 (N input) produced the best results (adj. R2 = 96.0% and RMSE = 0.409 mm d−1), followed by MLR7 (Rn input) (Table 2). The Tmean input model (MLR12) exhibited inferior performance. The produced MLR equations for the two best one-input models are displayed in Table 3. The model with the poorest performance was MLR15, with u2 as the sole explanatory variable, with an adj. R2 equal to 3%. The model with es − ea as the sole input (MLR16) also showed poor performance (Table 2).

4. Discussion

The ETo plays a critical role in the hydrological cycle, the climate crisis, as well as in water resource management and irrigation design. The established methods of estimation require the availability of a wide range of climate parameters, which is a serious drawback. MLR has been examined in search of a simple, affordable, and satisfactorily accurate modelling technique of ETo. Therefore, datasets of Tmean, Rn, Rs, es − ea, and u2, retrieved from sixty-two meteorological stations for the Peloponnese (Table A1) have been utilised. The altitude of the stations was also employed, since it affects ETo [65]. The months August and December of 2016–2019 were selected as typical months of summertime and wintertime, respectively, with considerable differences in climate variables between them (http://climatlas.hnms.gr/sdi/?lang=EN (accessed on 3 May 2022)). Furthermore, these two months were selected in methodological consistency with a previous study [36], the FAO PM estimates of which served as reference values for this study. MLR models with several input combinations of the seven available input parameters were tested. In total, sixteen models were examined, ranging from one to six explanatory variables, since models with seven inputs exhibited statistically insignificant results (p > 0.05), and were therefore omitted. As displayed in Table 2, the results of the majority of the tested models are significant at the 99% level of confidence. Models with altitude or latitude as sole input variables were not applicable. As a general rule, the values of error and agreement improve with an ascending number of input parameters. However, the magnitude of change differentiates based on the influence of each added parameter on ETo. As anticipated, the best performance was exhibited by the model with the most (six) inputs (MLR6). However, the MLR5 model is most recommended, since the extra parameter of MLR6 (Z) contributes to the interpretation of the variance only by 0.03% (adj. R2 change). The variance of the ETo observations is explained by 98.1% in both cases. For MLR5, RMSE is only 0.280 mm d−1 and MAE is 0.206 mm d−1, with an agreement between the FAO PM and the MLR5 ETo values of 99.5% (Table 2). Consequently, MLR5 is a satisfactorily accurate tool for predicting ETo for August and December of 2016–2019, with a simple linear formula (Table 3). The major contribution of sunshine hours to the explanation of the data in MLR is in line with the results of Yirga for Ethiopia [61].
An interesting group of MLR models demonstrated in Table 2 are those with one sole independent variable as an input. In this case, the MLR transmits to simple linear regression. From a parsimonious perspective, the potential of only one variable being able to satisfactorily represent the ETo values would be useful in terms of (low) complexity, computational load and cost, as well as of increased applicability. The usefulness is amplified due to the rather inhomogeneous characteristics of the region, and the selected months that belong to different seasons. The only prerequisite, in this case, is that input data are available or easily retrieved. Among one-input models, MLR1 with N as the input variable exhibited the best performance, with 96% of the ETo values explained and an IoA of 99%. The RMSE/MAE and NMRSE are equal to 0.409/0.320 mm d−1 and 12.1%, respectively (Table 2). According to the literature, these values are considered very good to excellent in terms of accuracy [58]. Sunshine hours is a parameter that was computed according to FAO guidelines for missing data, utilizing Julian day and station latitude [33], which might be a limitation in terms of accuracy compared to direct measurements. The methodological choice to include parameters computed based on measured parameters and the FAO procedure in ETo modelling is a common practice (e.g., in ANNs for ETo [40]). The second-best one-input model is MLR7, which is radiation-based (Rn). It explains data variability by 95.5%, with an IoA equal to 98.9%. The RMSE/MAE and NRMSE are 0.429/0.323 mm d−1 and 12.7%, respectively. Residual statistical values (i.e., Cook’s D, C. Leverage) were almost the same as those of MLR1, except from the D–W value, which was better (2.09), (Table 2). The latter value indicates that the residuals belong to a normal distribution.
However, the most easy-to-acquire datasets globally, by either ground-based data or remote sensing, are those of Tmean. MLR12 is a temperature-based, one-input model (Tmean). The variability explained (91.8%) is lower than those explained by N input and Rn input models, but is the same as the corresponding of the Rs input model (MLR14). The RMSE/MAE and NRMSE are equal to 0.582/0.461 mm d−1 and 17.2%. The poor performance of the Tmean input MLR model is also reported by Tabari et al. for Iran [52]. MLR14 (Rs) produced the same or slightly better values (MAE = 0.441 mm d−1) than the Tmean input model (Table 2). However, these values are almost identical to the results of Tabari et al. for the best MLR (four-input) model [52]. This accuracy is not satisfactory for applications such as precise irrigation design, but it remains a useful approach. It is noteworthy that half of the tested models (two to six inputs) yielded an RMSE below 0.35 mm d−1, three of which were even below 0.29 mm d−1, with adj. R2 ≥ 97.1%. The produced linear equations for MLR5, and MLR1 and MLR7, which are, respectively, the model with the best performance, and the two best one-input models, are simple and demand low computational effort and a short time to apply (Table 3). These findings provide flexible potential choices regarding the available datasets. The worst performance was exhibited by MLR15, with u2 as the sole explanatory variable. This model explains only 3% of the ETo observations. The error indices are not acceptable either (RMSE > 2.00 mm d−1, MAE = 1.953, NRMSE = 59.2%, and the SE of the predicted values was greater than 15%) (Table 2). The literature reports that near-surface wind speed is an influential parameter of ETo variability [78]. The influential role of u2 in our study area was also noted in our previous work on ETo variability across the Peloponnese [36]. It was found that, mostly in August, (wind speed is generally very low in summer), in cases where increased u2 values occurred, ETo was directly affected. This deduction is confirmed by the latest study on ETo across the Peloponnese, in which u2 was proven to be the most influential parameter after Tmean [73]. In conclusion, it is probable that the relationship between u2 and ETo is non-linear, thus the MLR model would not depict the established relationship. This inability of MLR models to portray complex relationships is anticipated due to their linear character, and has been denoted in the relevant literature [56].

5. Conclusions

MLR was employed to predict ETo with seven available climate parameters as inputs. Sixteen models, with all the potential input combinations, displayed statistically significant results (p ≤ 0.05). Among the tested models, MLR5 with five explanatory variables exhibited the best performance with less than the available input parameters (p ≤ 0.001). Therefore, MLR5, which constitutes a linear function of N, Tmean, Rn, es − ea, and u2, is recommended as a potential tool to predict ETo for the Peloponnese, after further investigation. Eight out of the sixteen tested models, with two to six inputs, displayed an RMSE below 0.35 mm d−1, and explained the variance of ETo at least by 97.1%. Moreover, three of these models yielded an RMSE below 0.29 mm d−1, and explained the variance by 98−98.1%.
Another noteworthy group of MLR models examined is the one-input models, which are simple linear equations. The N input model (MLR1) outperformed the rest (RMSE = 0.409 mm d−1), while it explained the variance of the ETo by 96%. The Rn model (MLR7) followed closely, with satisfactory results and data variance representation (RMSE = 0.429 mm d−1, adj. R2 = 95.5%). Probably, sunshine hours and net radiation significantly affect the ETo of the Peloponnese. The Tmean input model exhibited poorer performance. On the other hand, the model with the worst performance was MLR15, with u2 as the sole input. Based on previous research, which reported the influential role of u2 on the ETo for the Peloponnese, it seems that MLR could not capture the complex, non-linear relationship between u2 and ETo. Provided that the Peloponnese is an area with distinguished differences over short distances, and that the regimes of the examined months vary considerably, these models have the potential to provide useful tools with flexibility regarding input data, applicable for interdisciplinary purposes. It is, hence, suggested that MLR models should be tested for longer time periods and larger areas in Greece, aiming at generalisation.

Author Contributions

Conceptualisation, S.D. and K.G.N.; methodology, S.D. and K.G.N.; software, S.D.; validation S.D.; formal analysis, S.D.; investigation, S.D., data curation, S.D., writing—original draft preparation, S.D., writing and editing, S.D. and K.G.N.; supervision, K.G.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are private, available only for the review process of the present article.

Acknowledgments

The authors acknowledge the National Observatory of Athens (https://meteosearch.meteo.gr (accessed on 15 April 2022).) for ground-based data availability of sixty-two meteorological stations.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Meteorological stations (62) used (source: https://meteosearch.meteo.gr (accessed on 15 April 2022)).
Table A1. Meteorological stations (62) used (source: https://meteosearch.meteo.gr (accessed on 15 April 2022)).
IDStationXYElevation (m)MunicipalityIDStationXYElevation (m)Municipality
Meteorological Stations for the 3 Empirical Methods (ETo)Meteorological Stations for the 3 Empirical Methods (ETo)
1Kalavrita334,349.94,210,128781Achaia32Oleni282,783.44,177,87261Ilia
2Kato Vlassia317,683.44,2085,58773Achaia33Pineia285,425.34,191,240184Ilia
3Lappa273,5504,218,92815Achaia34Pirgos273,886.94,171,89122Ilia
4Olenia288,845.14,221,65434Achaia35Vartholomio253,773.84,193,12715Ilia
5Panachaiko313,491.44,235,8001588Achaia36Zacharo290,302.64,150,8065Ilia
6Panagopoula318,709.54,243,84215Achaia37Amoni Sofikou424,227.54,186,89855Korinthia
7Panepistimio305,972.34,239,28966Achaia38Derveni362,057.14,221,7375Korinthia
8Patra301,697.84,236,6946Achaia39Isthmos408,645.44,200,4996Korinthia
9Rio305,898.14,242,1772Achaia40Kiato389,163.54,207,72215Korinthia
10Romanos313,476.14,235,744228Achaia41Krioneri378,491.94,203,310887Korinthia
11Sageika280,638.44,219,57526Achaia42Loutraki410,248.74,202,63630Korinthia
12Argos386,329.14,165,05938Argolida43Nemea381,197.94,188,976290Korinthia
13Didima426,936.94,146,702175Argolida44Perigiali397,303.14,199,34438Korinthia
14Kranidi424,615.74,137,411110Argolida45Trikala Korinthias365,493.74,206,8351077Korinthia
15Lagadia326,139.94,172,057970Arkadia46Agioi Theodoroi423,533.64,198,39537Korinthia
16Levidi349,386.54,171,330853Arkadia47Apidia392,819.74,082,655230Lakonia
17Lykochia337,772.64,151,113870Arkadia48Asteri386,527.14,076,7578Lakonia
18Magouliana334,497.74,171,2751256Arkadia49Geraki384,706.64,094,508330Lakonia
19Megalopoli335,3634,140,782432Arkadia50Krokees371,576.24,082,640241Lakonia
20Stemnitsa330,377.84,157,9671094Arkadia51Molaoi397,984.64,072,957128Lakonia
21Tripoli359,989.34,152,250650Arkadia52Monemvasia413,811.44,059,05117Lakonia
22Vytina339,989.84,170,4091013Arkadia53Sparti360,929.94,101,670204Lakonia
23Spetses424,919.54,124,6623Attiki54Alagonia343,840.94,107,863765Messinia
24Taktikoupoli Troizinias443,373.24,152,37415Attiki55Arfara326,299.44,113,66696Messinia
25Ydra452,645.84,133,7272Attiki56Filiatra285,439.94,115,17565Messinia
26Amaliada264,604.94,186,92326Ilia57Kalamata331,1274,098,9745Messinia
27Andritsaina314,220.34,152,125731Ilia58Kalamata Dytika329,347.34,100,00110Messinia
28Archaia Olympia287,981.34,163,85645Ilia59Kardamili347,857.74,074,65113Messinia
29Foloi297,082.74,174,732600Ilia60Kopanaki306,288.64,128,741184Messinia
30Katakolo263,537.24,169,3272Ilia61Kyparissia291,6914,123,58436Messinia
31Lampeia306,840.34,192,041840Ilia62Pylos294,556.84,087,5905Messinia

References

  1. Malamos, N.; Tegos, A. Advances in Evaporation and Evaporative Demand. Hydrology 2022, 9, 78. [Google Scholar] [CrossRef]
  2. Shamshirband, S.; Hashemi, S.; Salimi, H.; Samadianfard, S.; Asadi, E.; Shadkani, S.; Kargar, K.; Mosavi, A.; Nabipour, N.; Chau, K.W. Predicting Standardized Streamflow index for hydrological drought using machine learning models. Eng. Appl. Comput. Fluid Mech. 2020, 14, 339–350. [Google Scholar] [CrossRef]
  3. Xu, S.; Yu, Z.; Yang, C.; Ji, X.; Zhang, K. Trends in evapotranspiration and their responses to climate change and vegetation greening over the upper reaches of the Yellow River Basin. Agric. For. Meteorol. 2018, 263, 118–129. [Google Scholar] [CrossRef]
  4. Dimitriadou, S.; Nikolakopoulos, K.G. Evapotranspiration Trends and Interactions in Light of the Anthropogenic Footprint and the Climate Crisis: A Review. Hydrology 2021, 8, 163. [Google Scholar] [CrossRef]
  5. Djaman, K.; Balde, A.B.; Sow, A.; Muller, B.; Irmak, S.; N’Diaye, M.K.; Manneh, B.; Moukoumbi, Y.D.; Futakuchi, K.; Saito, K. Evaluation of sixteen reference evapotranspiration methods under sahelian conditions in the Senegal River Valley. J. Hydrol. Reg. Stud. 2015, 3, 139–159. [Google Scholar] [CrossRef] [Green Version]
  6. Pereira, L.S.; Perrier, A.; Allen, R.G.; Alves, I. Evapotranspiration: Concepts and Future Trends. J. Irrig. Drain. Eng. 1999, 125, 45–51. [Google Scholar] [CrossRef]
  7. Malamos, N.; Tsirogiannis, I.L.; Tegos, A.; Efstratiadis, A.; Koutsoyiannis, D. Spatial interpolation of potential evapotranspiration for precision irrigation purposes. Eur. Water 2017, 59, 303–309. Available online: https://www.researchgate.net/publication/323030147_Spatial_interpolation_of_potential_evapotranspiration_for_precision_irrigation_purposes (accessed on 19 May 2022).
  8. Saggi, M.K.; Jain, S. Reference evapotranspiration estimation and modelling of the Punjab Northern India using deep learning. Comput. Electron. Agric. 2019, 156, 387–398. [Google Scholar] [CrossRef]
  9. Zhang, Y.; Wei, Z.; Zhang, L.; Du, J. Applicability evaluation of different algorithms for daily reference evapotranspiration model in KBE system. Int. J. Comput. Sci. Eng. 2019, 18, 361–374. [Google Scholar] [CrossRef]
  10. Skarlatos, D.; Kalavrouziotis, I.K.; Montes, C.R.; Melfi, A.J.; Pereira, B.F.F. Wastewater reuse in citrus: A fuzzy logic model for optimum evapotranspiration. Desalination Water Treat. 2015, 55, 315–324. [Google Scholar] [CrossRef]
  11. Ma, N.; Szilagyi, J.; Zhang, Y. Calibration-free complementary relationship estimates terrestrial evapotranspiration globally. Water Resour. Res. 2021, 57, e2021WR029691. [Google Scholar] [CrossRef]
  12. Ma, N.; Szilagyi, J.; Zhang, Y.; Liu, W. Complementary-relationship-based modeling of terrestrial evapotranspiration across China during 1982–2012: Validations and spatiotemporal analyses. J. Geophys. Res. Atmos. 2019, 124, 4326–4351. [Google Scholar] [CrossRef]
  13. Li, M.; Chu, R.; Islam, A.R.M.T.; Shen, S. Reference Evapotranspiration Variation Analysis and Its Approaches Evaluation of 13 Empirical Models in Sub-Humid and Humid Regions: A Case Study of the Huai River Basin, Eastern China. Water 2018, 10, 493. [Google Scholar] [CrossRef] [Green Version]
  14. Tabari, H.; Grismer, M.E.; Trajkovic, S. Comparative analysis of 31 reference evapotranspiration methods under humid conditions. Irrig. Sci. 2013, 31, 107–117. [Google Scholar] [CrossRef]
  15. Roy, D.K.; Sarkar, T.K.; Kamar, S.S.A.; Goswami, T.; Muktadir, M.A.; Al-Ghobari, H.M.; Alataway, A.; Dewidar, A.Z.; El-Shafei, A.A.; Mattar, M.A. Daily Prediction and Multi-Step forward Forecasting of Reference Evapotranspiration Using LSTM and Bi-LSTM Models. Agronomy 2022, 12, 594. [Google Scholar] [CrossRef]
  16. Ferreira, L.B.; da Cunha, F.F. Multi-step ahead forecasting of daily reference evapotranspiration using deep learning. Comput. Electron. Agric. 2020, 178, 105728. [Google Scholar] [CrossRef]
  17. Ravindran, S.M.; Bhaskaran, S.K.; Ambat, S.K. A Deep Neural Network Architecture to Model Reference Evapotranspiration Using a Single Input Meteorological Parameter. Environ. Process. 2021, 8, 1567–1599. [Google Scholar] [CrossRef]
  18. Granata, F.; Di Nunno, F. Forecasting evapotranspiration in different climates using ensembles of recurrent neural networks. Agric. Water Manag. 2021, 255, 107040. [Google Scholar] [CrossRef]
  19. Mosavi, A.; Edalatifar, M. A Hybrid Neuro-Fuzzy Algorithm for Prediction of Reference Evapotranspiration. In Recent Advances in Technology Research and Education; INTER-ACADEMIA 2018; Lecture Notes in Networks and Systems; Laukaitis, G., Ed.; Springer: Cham, Switzerland, 2019; Volume 53. [Google Scholar]
  20. Andreu, A.; Kustas, W.P.; Polo, M.J.; Carrara, A.; González-Dugo, M.P. Modeling Surface Energy Fluxes over a Dehesa (Oak Savanna) Ecosystem Using a Thermal Based Two Source Energy Balance Model (TSEB) II—Integration of Remote Sensing Medium and Low Spatial Resolution Satellite Images. Remote Sens. 2018, 10, 558. [Google Scholar] [CrossRef] [Green Version]
  21. Silva, A.M.; da Silva, R.M.; Santos, C.A.G. Automated surface energy balance algorithm for land (ASEBAL) based on automating end member pixel selection for evapotranspiration calculation in MODIS orbital images. Int. J. Appl. Earth Obs. Geoinf. 2019, 79, 1–11. [Google Scholar] [CrossRef]
  22. Sidiropoulos, P.; Dalezios, N.R.; Loukas, A.; Mylopoulos, N.; Spiliotopoulos, M.; Faraslis, I.N.; Alpanakis, N.; Sakellariou, S. Quantitative Classification of Desertification Severity for Degraded Aquifer Based on Remotely Sensed Drought Assessment. Hydrology 2021, 8, 47. [Google Scholar] [CrossRef]
  23. Raoufi, R.; Beighley, E. Estimating Daily Global Evapotranspiration Using Penman–Monteith Equation and Remotely Sensed Land Surface Temperature. Remote Sens. 2017, 9, 1138. [Google Scholar] [CrossRef] [Green Version]
  24. Allen, R.G.; Tasumi, M.; Trezza, R. Satellite-based energy balance for mapping evapotranspiration with internalized calibration (METRIC)—Model. J. Irrig. Drain. Eng. 2007, 133, 380–394. [Google Scholar] [CrossRef]
  25. Dimitriadou, S.; Nikolakopoulos, K.G. Remote sensing methods to estimate evapotranspiration incorporating MODIS derived data and applications over Greece: A review. In Proceedings of the SPIE 11524, Eighth International Conference on Remote Sensing and Geoinformation of the Environment (RSCy2020), Paphos, Cyprus, 26 August 2020; p. 1152405. [Google Scholar] [CrossRef]
  26. Liu, Y.; Zhang, S.; Jiahua Zhang, J.; Tang, L.S.; Bai, Y. Assessment and Comparison of Six Machine Learning Models in Estimating Evapotranspiration over Croplands Using Remote Sensing and Meteorological Factors. Remote Sens. 2021, 13, 3838. [Google Scholar] [CrossRef]
  27. Proias, G.; Gravalos, I.; Papageorgiou, E.; Poczęta, K.; Sakellariou-Makrantonaki, M. Forecasting Reference Evapotranspiration Using Time Lagged Recurrent Neural Network. Wseas Trans. Environ. Dev. 2020, 16, 699–707. [Google Scholar] [CrossRef]
  28. Kitsara, G.; Papaioannou, G.; Retalis, A.; Paronis, D.; Kerkides, P. Estimation of air temperature and reference evapotranspiration using MODIS land surface temperature over Greece evapotranspiration using MODIS land surface temperature. Int. J. Remote Sens. 2018, 39, 924–948. [Google Scholar] [CrossRef]
  29. Falalakis, G.; Gemitzi, A. A simple method for water balance estimation based on the empirical method and remotely sensed evapotranspiration estimates. J. Hydroinform. 2020, 22, 440–451. [Google Scholar] [CrossRef]
  30. Tsouni, A.; Kontoes, C.; Koutsoyiannis, D.; Elias, P. Estimation of Actual Evapotranspiration by Remote Sensing. Sensors 2008, 8, 3586–3600. [Google Scholar] [CrossRef]
  31. Vasiliades, L.; Spiliotopoulos, M.; Tzabiras, J.; Loukas, A.; Mylopoulos, N. Estimation of crop water requirements using remote sensing for operational water resources management. In Proceedings of the Third International Conference on Remote Sensing and Geoinformation of the Environment (RSCy2015), Paphos, Cyprus, 19 June 2015; Volume 9535. [Google Scholar] [CrossRef]
  32. Dimitriadou, S.; Nikolakopoulos, K.G. Development of GIS models via optical programming and python scripts to implement four empirical methods of reference and actual evapotranspiration (ETo, ETa) incorporating MODIS LST inputs. In Proceedings of the SPIE 11856, Remote Sensing for Agriculture, Ecosystems, and Hydrology XXIII, 118560K, Madrid, Spain, 12 September 2021. [Google Scholar] [CrossRef]
  33. Allen, R.; Pereira, L.; Raes, D.; Smith, M. Crop evapotranspiration–Guidelines for computing crop water requirements. In Irrigation and Drainage, Paper No. 56; FAO: Rome, Italy, 1998; Volume 300. [Google Scholar]
  34. Pereira, L.S.; Allen, R.G.; Smith, M.; Raes, D. Crop evapotranspiration estimation with FAO 56: Past and future. Agric. Water Manag. 2015, 147, 4–20. [Google Scholar] [CrossRef]
  35. Samani, Z. Estimating solar radiation and evapotranspiration using minimum climatological data. J. Irrig. Drain. Eng. 2000, 126, 265–267. [Google Scholar] [CrossRef]
  36. Dimitriadou, S.; Nikolakopoulos, K.G. Reference Evapotranspiration (ETo) Methods Implemented as ArcMap Models with Remote-Sensed and Ground-Based Inputs, Examined along with MODIS ET, for Peloponnese, Greece. ISPRS Int. J. Geo-Inf. 2021, 10, 390. [Google Scholar] [CrossRef]
  37. Djaman, K.; Rudnick, D.; Mel, V.C.; Mutiibwa, D.; Diop, L.; Sall, M.; Kabenge, I.; Bodian, A.; Tabari, H.; Irmak, S. Evaluation of Valiantzas’ Simplified Forms of the FAO-56 Penman-Monteith Reference Evapotranspiration Model in a Humid Climate. J. Irrig. Drain. Eng. 2017, 143, 0001191. [Google Scholar] [CrossRef]
  38. Tegos, A.; Efstratiadis, A.; Malamos, N.; Mamassis, N.; Koutsoyiannis, D. Evaluation of a Parametric Approach for Estimating Potential Evapotranspiration across Different Climates. Agric. Agric. Sci. Procedia 2015, 4, 2–9. [Google Scholar] [CrossRef] [Green Version]
  39. Valiantzas, J.D. Simplified Reference Evapotranspiration Formula Using an Empirical Impact Factor for Penman’s Aerodynamic Term. J. Irrig. Drain. Eng. 2013, 18, 108–114. [Google Scholar] [CrossRef]
  40. Zanetti, S.S.; Sousa, E.F.; Oliveira, V.P.S.; Almeida, F.T.; Bernardo, S. Estimating evapotranspiration using artificial neural network and minimum climatological data. J. Irrig. Drain. Eng. 2007, 133, 83–89. [Google Scholar] [CrossRef]
  41. Noi, P.T.; Degener, J.; Kappas, M. Comparison of Multiple Linear Regression, Cubist Regression, and Random Forest Algorithms to Estimate Daily Air Surface Temperature from Dynamic Combinations of MODIS LST Data. Remote Sens. 2017, 9, 398. [Google Scholar] [CrossRef] [Green Version]
  42. Paparrizos, S.; Matzarakis, A. Present and future responses of growing degree days for Crete Island. Adv. Sci. Res. 2017, 14, 1–5. [Google Scholar] [CrossRef] [Green Version]
  43. Taki, M.; Rohani, A.; Yildizhan, H. Application of machine learning for solar radiation modeling. Theor. Appl. Climatol. 2021, 143, 1599–1613. [Google Scholar] [CrossRef]
  44. Alvarez-Mendoza, C.; Teodoro, A.; Ramirez-Cando, L. Spatial estimation of surface ozone concentrations in Quito Ecuador with remote sensing data, air pollution measurements and meteorological variables. Environ. Monit. Assess. 2019, 191, 155. [Google Scholar] [CrossRef]
  45. Almeida, C.R.D.; Teodoro, A.C.; Gonçalves, A. Study of the Urban Heat Island (UHI) Using Remote Sensing Data/Techniques: A Systematic Review. Environments 2021, 8, 105. [Google Scholar] [CrossRef]
  46. Ghosh, A.; Joshi, P.K. Hyperspectral imagery for disaggregation of land surface temperature with selected regression algorithms over different land use land cover scenes. ISPRS J. Photogramm. Remote Sens. 2014, 96, 76–93. [Google Scholar] [CrossRef]
  47. Mohammad, P.; Goswami, A. A Spatio-Temporal Assessment and Prediction of Surface Urban Heat Island Intensity Using Multiple Linear Regression Techniques Over Ahmedabad City, Gujarat. J. Indian Soc. Remote Sens. 2021, 49, 1091–1108. [Google Scholar] [CrossRef]
  48. ASCE Task Committee on Application of Artificial Neural Networks in Hydrology. Artificial neural networks in hydrology. I: Preliminary concepts. J. Hydrol. Eng. 2000, 5, 115–123. [Google Scholar] [CrossRef]
  49. Perugu, M.; Singam, A.J.; Kamasani, C.S.R. Multiple Linear Correlation Analysis of Daily Reference Evapotranspiration. Water Resour. Manag. 2013, 27, 1489–1500. [Google Scholar] [CrossRef]
  50. Mallikarjuna, P.; Jyothy, S.A.; Sekhar Reddy, K.C. Daily Reference Evapotranspiration Estimation using Linear Regression and ANN Models. J. Inst. Eng. India Ser. A 2012, 93, 215–221. [Google Scholar] [CrossRef]
  51. Khoshravesh, M.; Sefidkouhi, M.A.G.; Valipour, M. Estimation of reference evapotranspiration using multivariate fractional polynomial, Bayesian regression, and robust regression models in three arid environments. Appl. Water Sci. 2015, 7, 1911–1922. [Google Scholar] [CrossRef] [Green Version]
  52. Tabari, H.; Kisi, O.; Ezani, A.; Talaee, P.H. SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J. Hydrol. 2012, 444–445, 78–89. [Google Scholar] [CrossRef]
  53. Shirsath, P.B.; Singh, A.K. A Comparative Study of Daily Pan Evaporation Estimation Using ANN, Regression and Climate Based Models. Water Resour. Manag. 2010, 24, 1571–1581. [Google Scholar] [CrossRef]
  54. Sanford, W.E.; Nelms, D.L.; Selnick, J.; Pope, J.P. Quantifying Components of the Hydrologic Cycle in Virginia Using Chemical Hydrograph Separation and Multiple Regression Analysis; U.S. Geological Survey Scientific Investigations Report 2011-5198; USGS Science Publishing Network: Raleigh, NC, USA, 2011; p. 152. [Google Scholar]
  55. Sanford, W.E.; Selnick, D.L. Estimation of Evapotranspiration across the Conterminous United States Using a Regression with Climate and Land-Cover Data. J. Am. Water Resour. Assoc. 2013, 49, 217–230. [Google Scholar] [CrossRef] [Green Version]
  56. Niaghi, A.R.; Hassanijalilian, O.; Shiri, J. Estimation of Reference Evapotranspiration Using Spatial and Temporal Machine Learning Approaches. Hydrology 2021, 8, 25. [Google Scholar] [CrossRef]
  57. Ohana-Levi, N.; Ben-Gal, A.; Munitz, S.; Netzer, Y. Grapevine crop evapotranspiration and crop coefficient forecasting using linear and non-linear multiple regression models. Agric. Water Manag. 2022, 262, 107317. [Google Scholar] [CrossRef]
  58. Sharafi, S.; Ghaleni, M.M. Evaluation of multivariate linear regression for reference evapotranspiration modeling in different climates of Iran. Theor. Appl. Climatol. 2021, 143, 1409–1423. [Google Scholar] [CrossRef]
  59. Soni, A.; Syed, T.H. Analysis of variations and controls of evapotranspiration over major Indian River Basins (1982–2014). Sci. Total Environ. 2021, 754, 141892. [Google Scholar] [CrossRef] [PubMed]
  60. Bian, Y.; Dai, H.; Zhang, Q.; Yang, L.; Du, W. Spatial distribution of potential evapotranspiration trends in the Inner Mongolia Autonomous Region (1971–2016). Theor. Appl. Climatol. 2020, 140, 1161–1169. [Google Scholar] [CrossRef]
  61. Yirga, S.A. Modelling reference evapotranspiration for Megecha catchment by multiple linear regression. Modeling Earth Syst. Environ. 2019, 5, 471–477. [Google Scholar] [CrossRef]
  62. Dimitriadou, S.; Katsanou, K.; Charalabopoulos, S.; Lambrakis, N. Interpretation of the Factors Defining Groundwater Quality of the Site Subjected to the Wildfire of 2007 in Ilia Prefecture, South-Western Greece. Geosciences 2018, 8, 108. [Google Scholar] [CrossRef] [Green Version]
  63. Copernicus Land Monitoring Service. CLC 2018. © European Union, Copernicus Land Monitoring Service 2018, European Environment Agency (EEA). Available online: https://land.copernicus.eu/pan-european/corine-land-cover/clc2018 (accessed on 11 April 2022).
  64. Kottek, M.; Grieser, J.; Beck, C.; Rudolf, B.; Rubel, F. World Map of the Köppen-Geiger climate classification updated. Meteorol. Z. 2006, 15, 259–263. [Google Scholar] [CrossRef]
  65. Wang, K.; Xu, Q.; Li, T. Does recent climate warming drive spatiotemporal shifts in functioning of high-elevation hydrological systems? Sci. Total Environ. 2020, 719, 137507. [Google Scholar] [CrossRef]
  66. Rahimikhoob, A. Estimation of evapotranspiration based on only air temperature data using artificial neural networks for a subtropical climate in Iran. Theor. Appl. Climatol. 2010, 101, 83–91. [Google Scholar] [CrossRef]
  67. Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: New York, NY, USA, 2013. [Google Scholar] [CrossRef]
  68. Gokmen, M.; Vekerdy, Z.; Verhoef, A.; Verhoef, W.; Batelaan, O.; van der Tol, C. Integration of soil moisture in SEBS for improving evapotranspiration estimation under water stress conditions. Remote Sens. Environ. 2012, 121, 261–274. [Google Scholar] [CrossRef]
  69. Chen, M.; Senay, G.B.; Singh, R.K.; Verdin, J.P. Uncertainty analysis of the Operational Simplified Surface Energy Balance (SSEBop) model at multiple flux tower sites. J. Hydrol. 2016, 536, 384–399. [Google Scholar] [CrossRef] [Green Version]
  70. Krishnashetty, P.H.; Balasangameshwara, J.; Sreeman, S.; Desai, S.; Kantharaju, A.B. Cognitive computing models for estimation of reference evapotranspiration: A review. Cogn. Syst. Res. 2021, 70, 109–116. [Google Scholar] [CrossRef]
  71. Maroufpoor, S.; Bozorg-Haddad, O.; Maroufpoor, E. Reference evapotranspiration estimating based on optimal input combination and hybrid artificial intelligent model: Hybridization of artificial neural network with grey wolf optimizer algorithm. J. Hydrol. 2020, 588, 125060. [Google Scholar] [CrossRef]
  72. Sattari, M.T.; Apaydin, H.; Band, S.S.; Mosavi, A.; Prasad, R. Comparative analysis of kernel-based versus ANN and deep learning methods in monthly reference evapotranspiration estimation. Hydrol. Earth Syst. Sci. 2021, 25, 603–618. [Google Scholar] [CrossRef]
  73. Dimitriadou, S.; Nikolakopoulos, K.G. Artificial Neural Networks for the Prediction of the Reference Evapotranspiration of the Peloponnese Peninsula, Greece. Water 2022, 14, 2027. [Google Scholar] [CrossRef]
  74. Seifi, A.; Riahi, H. Estimating daily reference evapotranspiration using hybrid gamma test-least square support vector machine, gamma test-ANN, and gamma test-ANFIS models in an arid area of Iran. J. Water Clim. Chang. 2020, 11, 217–240. [Google Scholar] [CrossRef]
  75. Ferreira, L.B.; da Cunha, F.F.; de Oliveira, R.A.; Fernandes Filho, E.I. Estimation of reference evapotranspiration in Brazil with limited meteorological data using ANN and SVM—A new approach. J. Hydrol. 2019, 572, 556–570. [Google Scholar] [CrossRef]
  76. Wilks, D.S. Statistical Methods in the Atmospheric Sciences; Academic Press: Oxford, UK; Waltham, MA, USA, 2006. [Google Scholar]
  77. Willmott, C.; Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res. 2005, 30, 79–82. [Google Scholar] [CrossRef]
  78. McVicar, T.R.; Roderick, M.L.; Donohue, R.J.; Li, L.T.; Van Niel, T.G.; Thomas, A.; Grieser, J.; Jhajharia, D.; Himri, Y.; Mahowald, N.M.; et al. Global review and synthesis of trends in observed terrestrial near-surface wind speeds: Implications for evaporation. J. Hydrol. 2012, 416–417, 182–205. [Google Scholar] [CrossRef]
Figure 1. LULC map of the Peloponnese (Adapted with permission from ref. [63] 2018, © European Union).
Figure 1. LULC map of the Peloponnese (Adapted with permission from ref. [63] 2018, © European Union).
Hydrology 09 00124 g001
Table 1. The statistical measures employed to evaluate the performance of the regression models.
Table 1. The statistical measures employed to evaluate the performance of the regression models.
Formulae of the Indices
RMSE = i = 1 n p i r i 2 n (2) NRMSE = i = 1 n p i r i 2 n r ¯ (3)
MAE = i = 1 n p i r i n (4) adj . R 2 = 1 1 R 2 1 n n k 1 (5)
R 2 = n i = 1 n p i   o i i = 1 n p i   i = 1 n o i n i = 1 n p i   2 i = 1 n p i   2 n i = 1 n o i   2 i = 1 n p i   2 2 (6) IoA = 1 i = 1 n p i r i 2 i = 1 n ( p i r ¯ r i r ¯ 2 (7)
Table 2. The 16 regression models and the corresponding indices of performance.
Table 2. The 16 regression models and the corresponding indices of performance.
ModelsIndependent VariablesSig.Adj. R2D–WCook’s DC. LeverageSERMSENRMSMAEIoA
MLR1N<0.0010.9601.7420.003 ± 0.0060.003 ± 0.00010.034 ± 0.00030.4090.1210.3200.990
MLR2N, Tmean<0.0010.9711.7320.004 ± 0.0100.007 ± 0.0040.035 ± 0.0060.3430.1020.2480.993
MLR3N, Tmean, u2<0.0010.9751.6750.004 ± 0.0110.010 ± 0.0150.036 ± 0.0110.3210.0950.2380.994
MLR4N, Tmean, u2, es − ea<0.0010.9801.7840.005 ± 0.0140.014 ± 0.0160.036 ± 0.0110.2860.0850.2140.995
MLR5N, Tmean, u2, es − ea, Rn<0.0010.9811.7380.004 ± 0.0140.017 ± 0.0180.039 ± 0.0110.2800.0830.2060.995
MLR6N, Tmean, u2, es − ea, Rn, Z<0.0010.9811.7460.005 ± 0.0140.021 ± 0.0190.044 ± 0.0120.2760.0820.2060.995
MLR7Rn<0.0010.9552.0930.003 ± 0.0060.003 ± 0.0010.036 ± 0.0030.4290.1270.3230.989
MLR8Rn, u2<0.0010.9691.9490.004 ± 0.0070.007 ± 0.0110.035 ± 0.0100.3570.1060.2800.992
MLR9Rn, u2, Tmean<0.0010.9751.9520.005 ± 0.0190.010 ± 0.0130.036 ± 0.0110.3210.0950.2510.994
MLR10Rn, u2, Tmean, Z<0.050.9751.9040.005 ± 0.0150.014 ± 0.0140.041 ± 0.0110.3180.0940.2490.994
MLR11Rn, u2, Tmean, Z, Rs<0.050.9761.9220.005 ± 0.0160.017 ± 0.0160.042 ± 0.0120.3130.0930.2450.994
MLR12Tmean<0.0010.9181.8570.003 ± 0.0040.003 ± 0.0020.048 ± 0.0080.5820.1720.4610.978
MLR13Tmean, u2<0.050.9201.8420.007 ± 0.0460.007 ± 0.0120.056 ± 0.0180.5740.1700.4670.979
MLR14Rs<0.0010.9182.0980.003 ± 0.0060.003 ± 0.0020.048 ± 0.0060.5820.1720.4420.978
MLR15u2<0.050.0300.1220.003 ± 0.0030.003 ± 0.0120.154 ± 0.0642.0010.5921.9530.120
MLR16es − ea<0.0010.8171.8840.003 ± 0.0070.003 ± 0.0030.071 ± 0.0140.8680.2570.6370.947
Note: Sig., significance (p-value); D–W, Durbin–Watson; Cook’s D, Cook’s Distance; SE, standard error of the predicted value; and C. Leverage, Centred Leverage. All values are presented as mean ± SD.
Table 3. Formulae of the MLR model with the best performance (MLR5) and the two most accurate models with one input (MLR1, MLR7).
Table 3. Formulae of the MLR model with the best performance (MLR5) and the two most accurate models with one input (MLR1, MLR7).
ModelMultiple Linear Regression EquationAdj. R2
MLR5 ETo = 9.166 + 1.066 N + 1.284 es     ea + 0.064 u 2 + 0.031 Tmean (2)0.981 *
MLR1 ETo = 8.188 + 1.011 N (3)0.960 *
MLR7 ETo = 0.494 + 0.033 Rn (4)0.955 *
Note: * Sig. < 0.001.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dimitriadou, S.; Nikolakopoulos, K.G. Multiple Linear Regression Models with Limited Data for the Prediction of Reference Evapotranspiration of the Peloponnese, Greece. Hydrology 2022, 9, 124. https://doi.org/10.3390/hydrology9070124

AMA Style

Dimitriadou S, Nikolakopoulos KG. Multiple Linear Regression Models with Limited Data for the Prediction of Reference Evapotranspiration of the Peloponnese, Greece. Hydrology. 2022; 9(7):124. https://doi.org/10.3390/hydrology9070124

Chicago/Turabian Style

Dimitriadou, Stavroula, and Konstantinos G. Nikolakopoulos. 2022. "Multiple Linear Regression Models with Limited Data for the Prediction of Reference Evapotranspiration of the Peloponnese, Greece" Hydrology 9, no. 7: 124. https://doi.org/10.3390/hydrology9070124

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop