Next Article in Journal
Development of a Vision-Guided Shared-Control System for Assistive Robotic Manipulators
Next Article in Special Issue
UAV Enhanced Target-Barrier Coverage Algorithm for Wireless Sensor Networks Based on Reinforcement Learning
Previous Article in Journal
Flexible Platform of Acoustofluidics and Metamaterials with Decoupled Resonant Frequencies
Previous Article in Special Issue
A Comprehensive Survey on the Internet of Things with the Industrial Marketplace
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Interannual Rainfall Runoff Equation Derived from Ol’Dekop’s Model Using Artificial Neural Networks

1
Research Laboratory in Applied Hydraulics and Environment (LRHAE), Department of Hydraulics, Faculty of Technology, University of Bejaia, Targa Ouzemour, Bejaia 06000, Algeria
2
Laboratory of Biomathematics, Biophysics, Biochemistry, and Scientometric (BBBS), Bejaia University, Bejaia 06000, Algeria
3
Faculty of Computer Science, Free University of Bozen-Bolzano, 39100 Bolzano, Italy
4
Research Center of Agro-Food Technologies (CRTAA), Bejaia 06000, Algeria
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(12), 4349; https://doi.org/10.3390/s22124349
Submission received: 1 May 2022 / Revised: 4 June 2022 / Accepted: 6 June 2022 / Published: 8 June 2022
(This article belongs to the Collection Wireless Sensor Networks towards the Internet of Things)

Abstract

:
In water resources management, modeling water balance factors is necessary to control dams, agriculture, irrigation, and also to provide water supply for drinking and industries. Generally, conceptual and physical models present challenges to find more hydro-climatic parameters, which show good performance in the assessment of runoff in different climatic regions. Accordingly, a dynamic and reliable model is proposed to estimate inter-annual rainfall-runoff in five climatic regions of northern Algeria. This is a new improvement of Ol’Dekop’s equation, which models the residual values obtained between real and predicted data using artificial neuron networks (ANNs), namely by ANN1 and ANN2 sub-models. In this work, a set of climatic and geographical variables, obtained from 16 basins, which are inter-annual rainfall (IAR), watershed area (S), and watercourse (WC), were used as input data in the first model. Further, the ANN1 output results and De Martonne index (I) were classified, and were then processed by ANN2 to further increase reliability, and make the model more dynamic and unaffected by the climatic characteristic of the area. The final model proved the best performance in the entire region compared to a set of parametric and non-parametric water balance models used in this study, where the R2Adj obtained from each test gave values between 0.9103 and 0.9923.

1. Introduction

Precipitations are the origin of water resources, which undergo different quantitative and qualitative transformations on the slopes. The losses of rainwater are almost observed as a form of infiltration, retention in the soil, and evaporation. Some part of this quantity can also flow into wadis until it moves into the sea. The estimation of actual evapotranspiration is the component most required for estimating water and energy balance equations [1,2]. This resource can be accessed on the inter-annual scale, according to the history of climatic and hydrometric measurements that could be given in the outlet of some watersheds. For ungauged watersheds, the lack of some information posed a big problem in the estimation of the mean annual flow, which is the main scientific challenge for many hydrologists [3,4,5]. The water balance concept was considered to study the hydrological behavior of watersheds and to describe the relationship between water and thermal components of the earth. This relativity was defined by a mathematical ratio between rainfall (R), rainfall-runoff (RR), and real evapotranspiration (Ea) [6]. The estimation of rainfall-runoff is necessary as the first step to search for the best evaluation of Ea. In literature, the first attempts were started by Schreiber [7] and Ol’Dekop [8]. Then, Budyko [9] proposed an average model of both previous equations to minimize the estimation errors that were given by Schreiber and Ol’Dekop. On the other hand, Pike [10] also gave a new formula to estimate Ea, in the form of a power equation. However, in 1974, Budyko and Miller developed a new theory, which describes the main factors to determine the actual rate of evapotranspiration on the inter-annual scale [6]. This work sparked the interest to understand the effect of watershed climate characteristics, vegetation, and capacity of watershed storage on Ea estimation [11,12,13]. In this regard, several conceptual models cited in the literature show the best performance when using physical indices (Soil, vegetation, and the capacity of basin storage) as input variables to assess Ea on daily and intra-daily scales, including black-box models (NARX-NN), machine learning models (ANN, random forest), the GR6J-WGANN2 model, etc. [14,15,16]. On the other hand, climatic variability also impacts the choice and the performance of runoff models applied on the annual and interannual scale, in which different models were developed regionally according to the rainfall characteristics of basins [17,18]. The proposed technique by Budyko was proven to be an effective tool to explain the interactions between a set of climatic factors and the characteristics of watersheds, and also their impacts on the water balance energy; however, its limit of application has always posed a problem for ungauged watersheds [5,19,20]. Furthermore, climatic variability and the aridity ratio of catchment areas also posed a problem for hydrologists trying to propose a dynamic and reliable model for estimating a rainfall-runoff in different regions of the world. Du et al. explained that the Budyko hypothesis was generally used to estimate the rainfall-runoff in arid regions [21]. Moreover, Wu et al. proved that applying Budyko’s model to different climates and watersheds of China, using different time scales, performs well at a mean annual scale where the climate area is arid, which is better than wet areas [22]. Xiong and Guo [23], deduced that Budyko’s non-parametric model, despite being easier for application, has never given good results in the context of long-term water balance studies in humid watersheds. Contrariwise, the parametric models are more performant in humid regions with local optimization of those input parameters. Thus, we can understand that no single model is perfect for an easy application in estimating inter-annual rainfall-runoff, especially in areas characterized by climatic diversity.
This work aims to propose a dynamic model that will be used to estimate inter-annual rainfall-runoff based on modeling residual data distribution (IARR’) given by Ol’Dekop’s model [8] in different areas. The new equation form was obtained by applying an artificial neural network (ANN) on several climatic and geographical input variables, which proved a good correlation with IARR’, such as inter-annual rainfall (IAR), De Martone index (I), watershed area (S), and watercourse (WC), respectively. Furthermore, the model was applied to five climatic floors of northern Algeria according to the climatic and geophysical diversity that characterized this area, which helped to determine the parameters of the new model. Generally, the article is structured in five sections, where a presentation of the study area, data, models, and metrics of comparison are given in the Section 2. Then, the statistical analysis, the bioclimatic classification, the ANNs design, and modelling steps are shown in Section 3. On the other hand, the comparison and the performance analysis with a set of parametric and nonparametric models that are mostly applied to assess IARR are given in Section 4. Finally, the conclusions were drawn in Section 5.

2. Material and Methods

2.1. Study Area and Data

Algeria is an important country in Northern Africa, which is located on the southern shore of the Mediterranean Sea. It is bordered on the east by Tunisia and Libya, in the west by Morocco, and in the southwest by Mauritania and Western Sahara; moreover, in the southern part, it overlooks Niger and Mali. It extends over an area of 2,381,741 km2, where 85% of the surface represents the desert region. Our study relates to the northern Algeria area, which includes a surface of 480,000 km2 [24]. It is bordered in the north by the Mediterranean Sea, while the southern part overlooks the Great Sahara. It is located between a longitude of −2.2° and 8.6°, and a latitude of 33° and 37°, which is distributed over 16 watersheds numbered from 01 to 17, except basin 13 which represents the southern part of the country (Figure 1A). In this region, the spatial precipitation distribution is very heterogeneous, characterized by a strong gradient from north to south, and a weak one from east to west [25]. North Algeria has a typically Mediterranean climate, which is hot and dry in summer. Contrariwise, in winter, it is mild and rainy. Annual precipitation data varies between 400 mm and 1500 mm. In the highlands and the Saharan Atlas the climate is generally semi-arid, and the rainfall does not exceed 500 mm annually. The pre-Saharan and Saharan regions have a very arid climate, which is almost devoid of precipitation, and the mean annual rainfall there varies between 50 mm and 200 mm [26,27]. The temperature of the country varies between day and night, as well as between summer and winter [28]. According to the data, the mean daily temperature is observed in January with a value between around 10 °C and 12 °C. In July it is comprised between 25 °C and 27 °C. In the highlands and the Saharan Atlas, the temperatures recorded a thermal amplitude higher than that of the coastal regions, given by an average daily temperature in January which is around 2 °C and 9 °C. However, in July these are between 19 °C and 33 °C.
Spatial variation maps displaying the maximum and the minimum altitudes observed in 16 watersheds in northern Algeria are shown in Figure 1B,C, respectively. The maps show that the northern Algeria morphology is higher on the southern side than on the northern, where the minimum altitudes observed on the marine side reach measurements between 17.9 m and 271 m in the basins numbered 3, 2, and 4. However, the maximum altitudes in this region are between 262.2 m and 1079 m. In the southeastern region, the maximum elevations of basins numbered 06, 05, 07, and 12 almost reach values between 1080 m and 1896 m. Contrariwise, in the middle region of basins numbered 6 and 7, the highest points in this region arrive between 1897 m and 2304 m. However, the minimum altitudes are between 524 m and 1280 m. In the western area, the maximum altitudes reach the interval values given between 1080 m and 1487 m, especially in basins 08, 11, 16, and 01. Where the minimum elevations are between 272 m and 523 m.
The hydrometric network of northern Algeria consists of 200 measuring stations, which are distributed over 16 watersheds. The present work aims to estimate the runoff of 102 sub-basins obtained by defining the extreme outlets of the wadis in the entire region. In the middle location of each sub-basin, we chose a meteorological and a hydrological station in the same coordinate to provide sufficient data that were used to assess rainfall flow between 1965 and 2020. The climatic data used in this study are precipitation (IAR) and temperature (IAT). The data was obtained from the available dataset on the climatic stations provided by the National Centers for Environmental Information (NCEI-NOAA). https://doiwww.ncdc.noaa.gov/ (accessed on 12 May 2021), and Climate Knowledge Portal https://climateknowledgeportal.worldbank.org/ (accessed on 17 May 2021), whereas the hydrological and geographical data, which are the watershed area (S) and the length of the watercourse (WC), were obtained from hydrometric stations provided by the Algerian National Agency of Hydrological Resources (A.N.R.H).
The De Martonne aridity index (I) was used in this study as input data of the dynamic model given by the artificial neuron network (ANN2) and to classify the bioclimatic floor of the northern Algeria area. The I index values were obtained according to the following Equation (1) [29]:
I = IAR IAT + 10  
The actual measurement of inter-annual rainfall-runoff (IARRR) was obtained from the A.N.R.H service. This data was used to estimate the computational residues (IARR’) obtained by Ol’Dekop’s model and to control the performance of the proposed model in the study area [21], that is:
  IARR E = IAR IAE a  
IARR = IARR R IARR E  
where IARRE represents the inter-annual rainfall-runoff data estimated by the Ol’Dekop model, IAEa is the mean annual real evapotranspiration, and IARRR is the inter-annual real rainfall-runoff.

2.2. Artificial Neuron Network

An artificial neuron network (ANN) is a data-driven process with a flexible mathematical algorithm capable of solving the complex nonlinear relationships between input and output datasets. In fact, it mimics the biological neuron architecture [30]. It is a family of parallel architectures used to solve the most complex mathematical problems in modeling, optimization, and prediction [31,32]. In practice, using ANN involves taking into consideration three main elements:
  • The interconnection between the input data ensures good results through the process.
  • The transfer function controls the generation of the neural output.
  • The summing function and the statistical parameters describe how the weights of input data are adjusted during the treatment [30].
During computation the ANN receives data from the input layer, then a combination between selected data is performed by the hidden layer using the summing function and a number of statistical control parameters. In general, the summing function formula is represented by:
net j =   W ij x i
where netj is the mean of weighted input for the jth neuron; Wij is the weight from the ith neuron in the previous layer to the jth neuron in the current layer, and Xi is the input from the ith to the jth neuron.
The transfer function Ψ is also used by the hidden layer to generate the final result, which is called by output data Yj, using the function (5). The training stops when the error obtained by the validation test reaches the minimum.
Y j = Ψ ( net j + θ j )
where netj represents the obtained data from the summing function (4), θj is the external threshold that is obtained from the summing step. In literature, hydro-climatic modeling using ANN typically uses the feed-forward network structure with either one or multiple layers, depending on the objective of the study [33,34]. When using ANN for modeling climatic and hydrological phenomena, a set of fundamental decisions need to be made [33], which are:
  • The choice to use the appropriate neural network architecture.
  • The best selection of a suitable training algorithm, study periods, and network structure.
  • The way to pre-process and post-process the input and output data, respectively.

ANN Transfer Function

In the experimental part, the transfer function used by the ANN model to estimate the output data was given by the trend equation of the multiple linear regression model (MLR). This latter one was applied to a set of input data (Xi*) that could prove a good regression with IARR’*. The test was conducted after linearizing the data series using the Ln equation. In this machine learning, the output data represents the predicted residual values obtained by the Ol’Dekop model. The concept of multiple linear regression used to study the linear relationship between the dependent variable Y and the vector of regressors (X, X2, …, Xk) is given by the following function [35]:
Y = α + β 1 X 1 + β 2 X 2 + + β k X k + Ɛ
where α is the intercept, β is the slope or the coefficient, k is the number of observations and Ɛ represents the estimation error.

2.3. Water Balance Model

A set of nonparametric and parametric models used to compare and analyze the performance of the proposed model are detailed by the Equations (7)–(11), as detailed next.

2.3.1. Schreiber

Schreiber [7] proposed a simple model to estimate inter-annual evapotranspiration (IAEa) in terms of inter-annual precipitation (IAR) and mean annual potential evapotranspiration (IAEo), that is:
IAEa = IAR × [ 1 exp ( IAE o IAR ) ]

2.3.2. Ol’Dekop

Ol’Dekop [36] applied a trigonometric hyperbolic tangent function (8) to show the relationship between potential evapotranspiration (IAEo) and the drying factor ( IAR IAE 0 ).
IAE a = IAE 0 × tanh ( IAR IAE 0 )  

2.3.3. Budyko

Budyko [9] defined an average estimation using Schreiber [7] and Ol’dekop [8] models to reduce errors obtained by both models (9), which is given by following formula,
IAE a = [ IAR × [ 1 exp ( IAE o IAR ) ] × ETR tan h ( IAR IAE o ) ] 0.5

2.3.4. Yang

Yang [37] proposed an alternative model (10) to assess mean annual actual evapotranspiration by entering local parameter (n) characteristics on Budyko’s hypothesis. The model uses the watershed characteristics for a better estimation, that is
IAE a = [ [ [ ( IAE o IAR ) n ] + 1 ] 1 n ] × I
where n > 0.

2.3.5. Sharif

This model is an improvement of the Mezentsev–Choudhury–Yang (MCY) proposition, achieved by replacing b, k, and n parameters with 0, 2, and 1, respectively [38].
IAE a = 2 × IAR × IAE o IAR + 2 × IAE o
where by IAR is the inter-annual rainfall; IAE0 is inter-annual potential evapotranspiration; and IAEa is inter-annual actual evapotranspiration.

2.4. Metrics of Performance

The statistical criteria applied in each modeling step, which are also used to compare the performance of the obtained model with non-parametric and parametric models mostly cited in the literature, are well detailed in Table 1. Where by Qs is the estimated runoff; Qo is the observed runoff; N is the total number of ordinates; K is the number of independent variables; and ei is the residual for the time period i.

3. Results

In this section, we start our study by spatially analyzing and classifying the input data used by each sub-model. Then we show the ANN design, such as the choice of input data by the selection criteria, the regression equation used as a transfer function, and the output results, which are mainly used to estimate the residual data (IARR’). Finally, the computational and calibration steps followed to obtain the runoff equation are given in detail in the last part.

3.1. Data Description and Classification

Data distribution and variability analyses of selected variables used for modeling IARR’ are shown in this section, which aims to study the behavior and the regression relationship between input variables and the IARR’ dataset. This study is based mainly on qualitative and quantitative tests, where the results of the data distribution are given by P–P plot, Q–Q plot (which allowed us to compare the empirical and theoretical distribution of data and cumulative quartiles, respectively, using normal law), and scattergram graph (Figure 2). The Q–Q plot and the scattergram graph are obtained after linearizing the used variables by applying the Ln function, to simplify the comparison between data series distributions, which have different measurement units. Moreover, Table 2 gives us a set of statistical parameters to quantify the variability of the data series. The P–P plot and the Q–Q plot show that IAR, S, and WC data series have a similar variability to the IARR’s dataset. However, IAR and S data distributions are closer to IARR than the WC data series (Figure 2). On the other hand, IAT shows a different behavior of data distribution, where the graphs prove that the stochastic model, which defines the IAT variability, is closer to the normal law. The p-value results obtained by the DKS fit test using a set of distribution laws (Table 2) show that IARR’ and IAR follow the Weibull 3 law, where the p-values equal 0.9789 and 0.8572, respectively. On contrary, IAT data have a GEV distribution, where the p-value is equal to 0.8764. Moreover, S and WC show a similar distribution, in which both variables follow the law of gamma 2. According to the fit results, the last three variables accept the Weibull 3 as a second closest fit model, where p-values equal 0.6136, 0.8363, and 0.8263, respectively. In Figure 2, the scattergram shows a descriptive comparison between the variability of data cited above. The graphs show a similar variability between IARR’, IAR, and S data series, where the majority of the values of each variable are very close between them and below the average of their series (which equals 49.113, 494.6529, and 719.7106, respectively). On the other hand, Table 2 shows that 25% of the dataset, which is bounded between the third quartile and the maximum value, has very large variability, given by the interval of [64.3229, 284.3221], [607.00, 1107.00], and [1028.25, 4050.00], respectively. The WC variable shows a slight difference between data distribution. Inversely, the variability is more similar to IARR’. In this series, the mean and the median are close, and equal 45.60 and 51.10, respectively. However, 25% of the WC values give a very high variability, which is given by a value range of [65.5250, 179.80]. On the other hand, the IAT dataset shows a very low variability given by a variation coefficient of 0.1180. This last dataset proves a convergence between the median and the mean, which are equal to 15.2792 and 15.4569, respectively. Moreover, the scattergram shows that the upper and the lower IAT values, compared to the average, have a similar distance, given by the interval of [−2, 0] and [0, 2], respectively. In Table 2, this similarity is given by [11.5833, 15.2792] and [15.2792, 21.7583], respectively. According to this analysis, the greatest variability was obtained from IARR’ and S datasets, whereby the variation coefficients equal 1.1039 and 1.0379, respectively.
The spatial inter-annual rainfall distribution and the De Martonne index obtained by applying Equation (1) are mapped using the data of 102 meteorological stations to determine the bioclimatic floor of each watershed of northern Algeria between 1965 and 2020. This study helps to provide the application areas used to control the performance of the model proposed in this section. Climate classification analysis is shown in Figure 3. A very large variability of rainfall from north to south is shown in Map (a), wherein the southern part, the IAR reaches up to 200 mm. However, in the north, the rainfall reaches values higher than 800 mm.
According to the bioclimatic classification given by [45], northern Algeria has a climatic diversity spread over five floors, from very humid to very dry (Figure 3B). The figure shows that watershed 02, which is located in the northern part and overlooking the Mediterranean Sea has a very humid climate, characterized by IAR values varying between 700 mm and >800 mm (Figure 3A). On the other hand, watersheds 09, 15, and 10 have the greatest climatic diversity, varying between very humid, humid, semi-humid, Mediterranean, and semi-dry. Where in this area, the rainfall varies between 400 mm and >800 mm. This diversity depends mainly on the geographical and geological characteristics of the region. In the northeastern part, watershed 03 is characterized by a very humid, humid, and semi-humid climate, where the IAR varies between 500 mm and >800 mm. The catchment area 01, 14, and 12 have a climatic diversity which is between humid, semi-humid, Mediterranean, and semi-dry. In this area, the minimum rainfall was observed in watershed 14, which arrives at 300 mm. Differently, in watersheds 01 and 12, the minimum rainfall values reach up to 200 mm in the southern areas.
In addition, watersheds 16, 04, 11, 08, 17, 05, 06, and 07 are characterized by a semi-dry climate, where the rainfall varies generally between 300 mm and 400 mm. Contrariwise, the minimum rainfall of watersheds 08, 17, 05 and 06 reaches up to 200 mm.

3.2. Proposed ANNs

The different steps followed to obtain the best ANNs model for estimating the computational errors of IARR’ given by the Ol’Dekop model are represented in this section. This study allows us to develop a new form of water balance model based on a set of climatic and geomorphological variables, which can be applied in a different area, without being conditioned by the aridity state of the watershed. The statistical tests used in this study are: residual analysis curve, R2, R2Adj, MSE, RMSE, and Durbin–Watson (DW). These latter tests were used to analyze the performance of each transfer equation used by the ANN model, and also to quantify the reliability degree of the proposed model compared to a set of parametrical and non-parametrical water balance models, which are what is mostly used in the literature. Our model is classified into two steps, given as ANN1 and ANN2 (Figure 4), which show that initially a local model (ANN1) was given to estimate IARR in all northern Algeria watersheds. Then an improvement was made to increase the reliability of the previous model in each basin climatic area and to make it more dynamic and applicable in different regions (ANN2).
Figure 4 shows that the two previous sub-models are the type of feed-forward network with the architecture of (3-2-1-1) and (10-5-1-1), respectively. In the first attempts of this modeling, we used IAR, S, and WC variables as input layers in the ANN1 model. After that, we classified the estimation results that were obtained by this model in groups, according to each bioclimatic level. In this step, we also used the aridity index I of each hydrological station as input data in the ANN2 model to determine, in each climatic area, its transfer function, which allows us to deduce the final rainfall-runoff model. Figure 4, shows that the intermediate nodes (denoted by IRR1, IRR2, IRR3, IRR4, IRR5, IRR6, and IRR7, respectively) make it possible to apply sub-processing, using a summing and transfer function on the input data “output layer” to estimate the output data in each step of the ANN model. In our case, the summing function combines the input variables two by two to have the best modeling results. Moreover, the choice of combination between variables was justified by the results of the correlation test, which were applied to the linearized variables compared to the linearized IARR’ (IARR’*) (Table 3). The table shows the degree of correlation between a set of candidate variables that were cited in the previous section of this study and IARR’*, using two correlation forms. A direct correlation was found between the IARR’* and (S*, WC*, IAT*, IAR*, and I*), then an indirect correlation between IARR’* and (WC*, S*), by studying the relationship between IARR’* and (( IARR *   WC * ), ( IARR *   S * )) then between (( IARR *   WC * ), ( IARR *   S * )) and (Wc*, S*), respectively.
Table 3 shows that IARR’* has the best correlation with IAR*, which equals 0.8513. It is also strongly correlated with climatic data obtained from the I* index. Contrariwise, the IAT* shows a weak correlation. However, the geo-hydrologic variables such as WC* and S* are slightly correlated with the response variable (IARR’*), which equals 0.5141 and 0.5023, respectively. The results highlight that the relation proposed by ( IARR *   WC * ) and ( IARR *   S * ) shows a very good correlation with IARR’*, given by 0.8124 and 0.7322, respectively. Furthermore, the new variables also show a strong correlation with WC* and S*, respectively, given by correlation coefficients of 0.6168 and 0.7923, respectively.

3.3. IARR Modeling

Describing different modeling steps used to propose a new equation of IARR estimation is based mainly on the computational error analysis of the Ol’Dekop model in a set of bioclimatic floors. A non-linear regression relationship between IARR’ and selected input variables provided in (Table 3) are shown graphically in Figure 5 as the first step of this analysis. In this regard, a direct regression is applied between the response variable (IARR’) and the input variables (IAR, I). Then, intermediate variables were used to express the indirect relationship between (S, Wc) and IARR’. The results show a similar regression for each pair of the dataset (IARR ‘, IAR) and (IARR’, I), given by an R2 which equals 0.8608 and 0.8134, respectively.
The graphs shown in Figure 5A,B prove some trends of values, which are shown in the range of the maximum value. Regression models cited above are defined by Equations (12) and (13):
IARR = 5 × 10 6 × IAR 2.5538
IARR = 0.024 × I 2.4345
On the other hand, the figure also shows a good nonlinear regression between IARR’ and   IARR S , and also between IARR’ and IARR Wc according to R2 results, which equal 0.7501 and 0.8209, respectively, given by Figure 5C,D. The regression relationship between both input variables are defined by Equations (14) and (15), respectively:
IARR = 99.714 × ( IARR S ) 0.4554
IARR = 37.187 × ( IARR Wc ) 0.6338    
Moreover, both ratios ( IARR S ) and ( IARR Wc ) proved a good regression trend with S and WC variables, respectively. Where R2 equals 0.8103 and 0.6314, respectively. Both statistical relationships between input and response variables are defined by Equations (16) and (17):
IARR S = 443.59 × S 1.445
IARR Wc = 365.71 × Wc 1.666
In this study, we found that all the cases of regression cited in Figure 5 followed the power model trend.
Figure 6 represents the linear regression graphs, which express the degree of correlation between the real values of IARR’* and the estimated values that were obtained by the IRR1 and IRR2 models. In this step, we proved the correlation’s degree and the reliability of the obtained model. We have well explained the different multiple regression models, which are applied to the pair of variables (IAR*, S*) and (IAR*, WC*), using the intermediate variable ( IARR *   S * ) and ( IARR *   WC * ), respectively, which showed a good nonlinear regression with S and WC data, and also with IARR’ (Figure 5). Moreover, Table 4 shows a set of statistical parameters relating to this modeling, which gives information about the reliability and the trend analysis of the sub-models that are noted by (A, B, C, and D), compared to the linearized real data (IARR’*). According to the results, we found that the regression model obtained from (IAR*, IARR *   S * ) and (IAR*, IARR *   WC * ) have a very good regression, proved by an (R2, R2adj) results which equal (0.9301, 0.9286) and (0.9405, 0.9393), respectively. Moreover, the errors given by MSE, RMSE, and DW show that all models did not prove a large trend deviation compared to IARR’* real data. The computational steps of IRR1 and IRR2 models are well detailed by the following Equations (18) and (19):
Ln ( IRR 1 ) = 1.7166 × Ln ( IAR ) + 0.2057 × Ln ( IARR S ) 6.5738
Ln ( IRR 2 ) = 1.4538 × Ln ( IAR ) + 0.3296 × Ln ( IARR Wc ) 5.3946  
To deduce the previous equations as functions of fundamental variables IAR, S, and WC, we start with Equation (18) by replacing ( IARR S ) with (S) using Equation (16). So we obtain:
Ln ( IRR 1 ) = 1.7166 × Ln ( IAR ) + 0.2057 × Ln ( 443.59 × S 1.445 ) 6.5738   = 1.7166 × Ln ( IAR ) 0.2972 × Ln ( S ) 5.3201
On the other hand, we use Equation (17) to replace ( IARR Wc ) with (WC) in Equation (19). We find:
Ln ( IRR 2 ) = 1.4538 × Ln ( IAR ) + 0.3296 × Ln ( 365.71 × Wc 1.666 )         = 1.4538 × Ln ( IAR ) 0.5491 × Ln ( Wc ) 3.4494
Figure 6C,D shows the reliability of the results obtained by the IRR1 and IRR2 models that are given by Equations (18) and (19), respectively. The corresponding graphs show a good fit of regression between the actual and the estimated values of IARR’*. This reliability was shown by R2, and R2Adj statistical parameters, which equal (0.9036, 0.9006) and (0.9105, 0.909), respectively (Table 4). We apply the Exp function in Equations (20) and (21), to obtain the IRR model used by ANN1.
We have,
IRR 1 = Exp ( 1.7166 × Ln ( IAR ) 0.2972 × Ln ( S ) 5.3201 )   = 0.0049 × ( IAR 1.7166 ) × ( S 0.2972 )
  IRR 2 = Exp ( 1.4538 × Ln ( IAR ) 0.5491 × Ln ( Wc ) 3.4494 )   = 0.0318 × ( IAR 1.4538 ) × ( Wc 0.5491 )
In this step, we found two reliable equations to estimate IARR’. Figure 6E shows that the best regression can be obtained as a function of the three variables (IAR, S, and WC), which is given by Equation (24).
According to Equations (22) and (23), the general model IRR is defined as follows:
IRR = IRR 1 × IRR 2                                                                                                                                                                                                                       = 0.0049 × ( IAR 1.7166 ) × ( S 0.2972 ) × 0.0318 × ( IAR 1.4538 ) × ( Wc 0.5491 )                                         = 0.01 × ( IAR 1.5852 ) × ( S 0.1486 ) × ( Wc 0.27455 )
Table 4 shows that the IRR model gives a good estimation, proven by a set of statistical parameters. The results obtained by this model give an R2 and R2Adj, which are equal to 0.9518 and 0.9508, respectively. Moreover, the computational errors given by MSE, RMSE, and DW parameters show values of 208.539, 11.4409, and 0.7094, respectively. The ANN1 model shows that the use of geomorphological parameters increases the reliability of estimation when compared with the simple regression model obtained by IAR only (Figure 5A).
In the second step of this study, we improved the proposed model, which is defined by Equation (24) to be more dynamic and applicable to each bioclimatic region. We applied multiple linear regression to aridity index series (I) and the predicted data obtained previously by the IRR model. This technique was applied separately to each bioclimatic stage in order to find the corresponding estimation model of each area. Table 5 shows all the statistical parameters relating to each local regression model. The results show that the model obtained in a very humid climate area gives a more reliable estimation compared to the performance models of other bioclimatic floors, where the R2 and R2Adj given for the IRR7 model equal 0.9072 and 0.9001, respectively (Figure 4 and Table 5). The table shows that the obtained models have different reliability in each climate region, wherein the Mediterranean area, the R2, and R2Adj were proven to perform well, and equal 0.7804 and 0.7647, respectively.
On the other hand, in the semi-dry, semi-humid, and humid climate floor, the R2, R2Adj equal (0.6501, 0.6420), (0.6820, 0.6675) and (0.6533, 0.6448), respectively. Moreover, the trend pattern obtained by modeling the IARR’ in the dry climate floors is increased when compared to the estimated IARR’ in wet regions. The performance criteria show that the greatest values are given in the semi-dry climate level, where MSE, RMSE, and DW equal 0.0783, 0.298, and 1.6870, respectively. In the humid area, R2 proved lower performance compared to the estimation obtained in the Mediterranean region. Contrariwise, the errors are more remarkable in the humid regression model, where MSE, RMSE, and DW values equal 0.0138, 0.1175, and 0.8625, respectively. On the other hand, on the Mediterranean climate floor, these parameters equal 0.0296, 0.1721, and 1.7861, respectively. The model named IRR3, IRR4, IRR5, IRR6, and IRR7, which were obtained by modeling IARR’ in semi-dry, Mediterranean, semi-humid, humid, and very humid climate floors, respectively, are defined by Equations (25)–(29), as follows:
Ln ( IRR 3 ) = 0.50033 × Ln ( EIRR S ) + 1.16331 × Ln ( I S ) 1.74497
Ln ( IRR 4 ) = 0.42496 × Ln ( EIRR ME ) + 1.17641 × Ln ( I ME ) 1.41471
Ln ( IRR 5 ) = 0.29175 × Ln ( EIRR SH ) + 1.00656 × Ln ( I SH ) 0.2404
Ln ( IRR 6 ) = 0.16179 × Ln ( EIRR H ) + 1.34092 × Ln ( I H ) + 0.45666
Ln ( IRR 7 ) = 0.28185 × Ln ( EIRR VH ) + 1.50295 × Ln ( I VH ) 1.60261
We apply the Exp function on Equations (25)–(29)to generate the final estimation model (IRRF) in terms of IAR, S, WC, and I (Table 5) So we find,
        IRR 3 = 0.1747 × EIRR S 0.50033 × I S 1.16331
IRR 4 = 0.2430 × EIRR ME 0.42496 × I ME 1.17641
IRR 5 = 0.7865 × EIRR SH 0.29175 × I SH 1.00656  
            IRR 6 = 1.5788 × EIRR H 0.16179 × I H 1.34092
              IRR 7 = 0.2014 × EIRR VH 0.28185 × I VH 1.50295
We apply the different models obtained in each bioclimatic area using all the datasets of northern Algeria to show the trend that can be caused by the static models. In this step, we want to propose a dynamic model, by eliminating all constants and finding a mathematical relationship with variables that can prove a good correlation. Table 6 shows statistic results, obtained from IRR3, IRR4, IRR5, IRR6, IRR7, and IRRF models. The table shows that all previous models cited above proved an R2 and R2Adj greater than 0.80. Moreover, the error trends increase in these models, which are proven by MSE, RMSE, and DW parameters. On the other hand, the IRRF model has the best reliability, given by an R2, which equals 0.9841. This model proved a low tendency, where the MSE, RMSE, and DW equal 62.5948, 5.9117, and 0.5250, respectively. The final model (IRRF) was obtained by applying the weighted average using the R2 values that are obtained in Table 6 as weighting coefficients to estimate the a 1 and a2 coefficients of this model. Where the IRRF equation is defined as follows:
IRR F = Cte × EIRR a 1 × I a 2
We have,
  a 1 = ( R IRR 3 2 × a 1 IRR 3 ) + ( R IRR 4 2 × a 1 IRR 4 ) + ( R IRR 5 2 × a 1 IRR 5 ) + ( R IRR 6 2 × a 1 IRR 6 ) + ( R IRR 7 2 × a 1 IRR 7 ) R IRR 3 2 + R IRR 4 2 + R IRR 5 2 + R IRR 6 2 + R IRR 7 2
By replacing variables with values, we find:
a 1 = ( 0.9494 × 0.50033 ) + ( 0.9538 × 0.42496 ) + ( 0.9443 × 0.29175 ) + ( 0.8358 × 0.16179 ) + ( 0.9616 × 0.28185 ) ( 0.9494 + 0.9538 + 0.9443 + 0.8358 + 0.9616 ) = 0.27808                                                                                                                                                                                                                                                                      
We have also,
  a 2 = ( R IRR 3 2 × a 2 IRR 3 ) + ( R IRR 4 2 × a 2 IRR 4 ) + ( R IRR 5 2 × a 2 IRR 5 ) + ( R IRR 6 2 × a 2 IRR 6 ) + ( R IRR 7 2 × a 2 IRR 7 ) R IRR 3 2 + R IRR 4 2 + R IRR 5 2 + R IRR 6 2 + R IRR 7 2
We replace variables with values, we find:
a 2 = ( 0.9494 × 1.16331 ) + ( 0.9538 × 1.17641 ) + ( 0.9443 × 1.00656 ) + ( 0.8358 × 1.34092 ) + ( 0.9616 × 1.50295 ) ( 0.9494 + 0.9538 + 0.9443 + 0.8358 + 0.9616 ) = 1.2364                                                                                                                                                                                                                                                                          
Using the results obtained by Equations (36) and (37) in Equation (35), we find:
EIRR F = Cte × EIRR 0.27808 × I 1.2364
To make the obtained Equation (38) more dynamic, the constant (Cte) is defined in terms of IAR data, which is given by Equation (42). The Cte must be obtained by each watershed to take into consideration the variability of each climate area. For this, we propose in Equation (39) the hypothesis that the predicted and real data of the Ol’dekop residuals (IARR’) are almost equal.
  EIRR F IARR
We replace EIRRF in Equation (39) by the formula defined in Equation (38), by doing so, we obtain
IARR Cte × EIRR 0.27808 × I 1.2364  
Cte IARR EIRR 0.27808 × I 1.2364
Figure 7 shows that Cte data has a very good correlation with IAR, where R2 equals 0.7235. When we use the trend equation obtained from the regression model to represent the Cte variable as a function of IAR, we obtain:
Cte = 0.0026 × IAR 0.7875
When we use Equations (22) and (39) in Equation (36), we find:
  EIRR F = 0.0026 × IAR 0.7875 × EIRR 0.27808 × I 1.2364                                                                                                                                                     = 0.0026 × IAR 0.7875 × ( 0.01 × IAR 1.5852 × S 0.1486 × Wc 0.27455 ) 0.27808 × I 1.2364           = 0.00072 × IAR 1.2283 × Wc 0.07635 × S 0.04132 × I 1.23640
In Equation (43), we used Equations (8) and (42) to define the new form of the Ol’Dekope model used for the IARR estimation.
  IARR E = IAR IAE a + EIRR F   = IAR ETP × tanh ( IAR ETP ) + ( 0.00072 × IAR 1.2283 × Wc 0.07635 × S 0.04132 × I 1.23640 )  

4. Discussion

Figure 8 shows regression graphs obtained from real and predicted IARR data that are estimated by a set of non-parametric and parametric water balance models most cited in the literature review. These are Schreiber, Ol’Dekop, Budyko, Yang (n = 2), and Sharif, as given by Figure 8B–F, respectively. These models were applied to the dataset series used in this modeling to compare the results with the predicted IARR values obtained by the new model, given in Figure 8A. The results demonstrate that the best IARR estimation are obtained by the proposed model, which proved a very good match between the estimated and the actual data (Figure 8A), where the model proved no trend and a very good regression, given by an R2, which equals 0.9924. On the other hand, Schreiber and Yang’s models give similar results in which R2 equals 0.9211 and 0.9334, respectively (Figure 8B,E). Budyko’s model also gives a good result, and performs less than the previous models, given by an R2 equal to 0.9041. On the other hand, Ol’Dekop and Sharif’s models proved a similar performance given by an R2 equal to 0.8525 and 0.8739, respectively.
Moreover, in Figure 8C,F, the predicted data obtained by both models show a high tendency. Trend analysis of the new equation compared to the models cited above is given in Figure 9 and Table 7, whereby the figure represents residual curves obtained between measured and predicted data, and the table shows a set of statistical parameters used to analyze the variability and the performance of each model. Figure 9 shows that the new proposition gives the lowest error and no trend of residuals was obtained in Figure 9A when compared to data processed by all runoff models in the 102 sub-basin. Furthermore, the parametric model given by Yang (n = 2) is also reliable and can be taken as the second choice to estimate IARR in the region that has a great climatic diversity—i.e., the northern Algeria area.
On the other hand, Sharif’s model shows good results compared to Schreiber, Ol’dekop, and Budyko, where the mean residuals obtained by this last one are symmetrical, distributed between negative and positive value ranges for the axis (X = 0). In this study, the Ol’Dekop model shows a very big tendency for residual data compared to other models, reaching up to 300 mm. In addition, Schreiber and Budyko produce a significant trend of data compared to real observations in watersheds, ranked from 61 to 102, which are located in semi-humid, humid, and hyper-humid climate regions (Figure 3). Table 7 shows that the IARR data series, which is obtained by the proposed model has a similar variability to the real data, given by a coefficient of variation, which equals 9292.465 and 9797.2623, respectively. Moreover, the mean values of both series are 81.4977 and 81.1249, respectively.
The results prove that the variability of data obtained by Schreiber’s model is closer to real data compared to the data obtained by other models, where the coefficient of variation given by this method equals 5772.4952. On the other hand, the statistical analysis of data distribution shows that Yang’s model provides data closer to the real dataset compared to data obtained by Schreiber’s model; whereby the first quartile, the median, the mean, and the third quartile results equal 31.2638, 46.9607, 116.9946, and 86.4257, respectively (Table 7). In this analysis, the new model shows the best performance compared to all models used in this study proven by R2, R2Adj, RMSE, and MAE parameters, which equal 0.9924, 0.9923, 8.5073, and 5.2053, respectively. Figure 10 shows a set of performance tests applied on data series obtained by all models used in this study in five climate regions of northern Algeria.
The results show that the new model has the best reliability and more efficiency in estimating IARR in all watersheds. Where R2 and R2Adj show values between 0.9 and 0.99, which was proved to be the best performance in the five climatic regions. Furthermore, the model’s performance increases according to the rainfall condition of the basin, leading it to perform more in humid regions than in the arid. The new model is even able to assess flows produced by little precipitation, which is proven in the figure by an R2Adj greater than 0.9 in the semi-arid region. On the contrary, the other models are regional and they can be reliable in more arid areas than in humid areas. The R2 and the R2Adj show that Yang (n = 2) and Sharif models perform better than non-parametric models, such as Schreiber, Ol’Dekop, and Budyko models in the semi-humid, humid, and very humid areas. Contrariwise, the performance of these models is better than the Yang and Sharif models in the arid region. The histogram of RMSE and MAE shows that the proposed model has the lowest amount of errors, which are closer to the axis (X = 0) during all observations. However, the other models have more computational residuals, which increase more in humid areas. Ol’Dekop’s model has the highest values of RMSE and MAE observed in semi-humid, humid, and very humid climate areas. The results deduced by this analysis show the power of the proposed model to estimate runoff in different regions of the world. The use of the S and Wc input variables and the calibration applied to the model parameters using the De Martonne series is proof of the dynamicity of the new model and its ability to work with the varying climatic and geomorphological conditions of the watershed, even if it is ungauged. This advantage was justified by the steady performance and lack of a significant trend residual estimation during all the processes carried out in the 102 sub-basins.

5. Conclusions and Future Work

This research work aims to propose a new equation derived from Ol’Dekop’s model to estimate the inter-annual rainfall runoff in a large region, which is characterized by climatic diversity. The proposed model was obtained by applying new ANNs to estimate the residual values (IARR’) given when comparing actual and predicted runoff assessed by Ol’Dekop. In this regard, multiple regressions were used as transfer functions and applied to a set of climatic and geomorphological input variables such as IAR, Wc, S, and I. As the first step, the sub-model, namely ANN1, proposed to estimate IARR’ in the entire region. Then an improvement was made by classifying output data in each climatic region using the De Martonne index (I), which was also used to calibrate the coefficients of the final model (IRRF). The performance analysis shows that the use of the IRR sub equation which is obtained by ANN1 as a function of IAR, S, and Wc proved good reliability (R2Adj = 0.9518) with a significant error (RMSE = 208.539 mm) when estimating IARR’ in the entire region. Contrariwise, the IRRF sub equation given by ANN2 shows an improvement in the estimation model performance, where the R2Adj and RMSE equal 0.9789 and 62.5948 mm, respectively. The new equation proposed in this model shows the best performance compared to all parametric and non-parametric water balance models used in this study. This performance is given by an R2Adj equal to 0.9923 obtained for the entire northern region of Algeria, with the absence of significant errors proven by residual analyses. Inversely, the compared models are less performed. They are set by the R2Adj test between 0.8525 and 0.9333, with a tendency of residuals observed in some watersheds that are located in humid and very humid regions. The comparative performance analysis, which was applied in five climatic regions, shows the best and steady performance of the proposed model in all areas, given by an R2Adj more than 0.92, with RMSEs less than 15 mm. However, the other models are regional and perform better in arid regions than in humid. A large trend of residuals is observed by non-parametric models such as Schreiber, Ol’Dekop, and Budyko in the very humid region, where the RMSEs are greater than 100 mm.
Our future work aims to further improve this model designed to estimate rainfall-runoff data in ungauged watersheds using different time scales. We also want to propose a new model using machine learning for estimating rainfall-runoff in sandy basins.

Author Contributions

All authors of this paper have directly participated in the planning, execution, and analysis of this study. O.M. worked on data collection, analysis, and mapping. A.A. worked on the ANN design, statistical modeling, and validation. A.L. worked in statistical modeling, investigation, and validation. K.M. participated in validation and methodology followed in the redaction. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

Our work is supported by the Open Access Publishing Fund of the Free University of Bozen-Bolzano. We thank the Ministry of High Education and Scientific Research of Algeria for their support. We want also to thank the staff of biomathematics, biophysics biochemistry, and Scientometry Laboratory (Algeria), laboratory of Applied Hydraulics and Environment (Algeria), as well as the Faculty of Computer Science, Free University of Bozen-Bolzano (Italy) for their help to understand the hydro-climatic characteristic of northern Algeria.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

IARInter annual rainfall
IATInter annual temperature
SWatershed area
WCWatercourse
IDe Martonne index
IARRInter annual rainfall runoff
IAE0Inter annual potential evapotranspiration
IAEaInter annual actual evapotranspiration
IARRRReal inter annual rainfall runoff
IARREInter annual rainfall runoff estimated by Ol’Dekop model
IARR’Inter annual rainfall-runoff residual data obtained from Ol’Dekop model
(Xi)*Linearized variable
EIRRInter annual rainfall runoff residual data estimated by ANN1
EIRRFInter annual rainfall runoff residual estimated by ANN2
IRRANN transfer function given by regression equation
ANNArtificial neuron network
ANN1Basic artificial neuron network model
ANN2Dynamic artificial neuron network model
SDSemi-dry
MEMediterranean
SHSemi-humid
HHumid
VHVery humid
R2Coefficient of determination
R2AdjAdjusted coefficient of determination
MSEMean squared error
RMSERoot mean square error
MAEMean absolute error
DWDurbin–Watson coefficient

References

  1. Xu, C.-Y.; Singh, V.P. Evaluation of three complementary relationship evapotranspiration models by water balance approach to estimate actual regional evapotranspiration in different climatic regions. J. Hydrol. 2005, 308, 105–121. [Google Scholar] [CrossRef]
  2. Fathi, M.M.; Awadallah, A.G.; Abdelbaki, A.M.; Haggag, M. A new Budyko framework extension using time series SARIMAX model. J. Hydrol. 2019, 570, 827–838. [Google Scholar] [CrossRef]
  3. Zhang, L.; Potter, N.; Hickel, K.; Zhang, Y.; Shao, Q. Water balance modeling over variable time scales based on the Budyko framework – Model development and testing. J. Hydrol. 2008, 360, 117–131. [Google Scholar] [CrossRef]
  4. Al-Rawas, G.; Andréassian, V.; Ao, T.; Archfield, S.A.; Arheimer, B.; Bárdossy, A.; Biggs, T.; Blume, T.; Borga, M.; Bormann, H.; et al. Runoff Prediction in Ungauged Basins; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar] [CrossRef]
  5. Hrachowitz, M.; Savenije, H.H.G.; Bloschl, G.; Mcdonnell, J.J.; Sivapalan, M.; Pomeroy, J.W.; Arheimer, B.; Blume, T.; Clark, M.P.; Ehret, U.; et al. A decade of Predictions in Ungauged Basins (PUB)—A review. Hydrol. Sci. J. 2013, 58, 1198–1255. [Google Scholar] [CrossRef]
  6. Budyko, M.I.; Miller, D.H. Climate and Life; Academic Press: New York, NY, USA, 1974; Volume 508. [Google Scholar]
  7. Schreiber, P. Über die Beziehungen zwischen dem Niederschlag und der Wasserführung der Flüsse in Mitteleuropa. Meteorol. Z. 1904, 21, 441–452. [Google Scholar]
  8. Ol’dekop, E. On Evaporation from the Surface of River Basins: Transactions on Meteorological Observations; University of Tartu: Tartu, Estonia, 1911. (In Russian) [Google Scholar]
  9. Budyko, M. The Heat Balance of the Earth’s Surface; U.S. Department of Commerce: Washington, DC, USA, 1958.
  10. Pike, J. The estimation of annual run-off from meteorological data in a tropical climate. J. Hydrol. 1964, 2, 116–123. [Google Scholar] [CrossRef]
  11. Fu, B. On the calculation of the evaporation from land surface. Sci. Atmos. Sin. 1981, 5, 23–31. [Google Scholar]
  12. Milly, P.C.D. Climate, soil water storage, and the average annual water balance. Water Resour. Res. 1994, 30, 2143–2156. [Google Scholar] [CrossRef]
  13. Atkinson, S.E.; Woods, R.; Sivapalan, M. Climate and landscape controls on water balance model complexity over changing timescales. Water Resour. Res. 2002, 38, 50-1–50-17. [Google Scholar] [CrossRef]
  14. Seo, Y.; Kim, S.; Singh, V. Machine Learning Models Coupled with Variational Mode Decomposition: A New Approach for Modeling Daily Rainfall-Runoff. Atmosphere 2018, 9, 251. [Google Scholar] [CrossRef] [Green Version]
  15. Kadri, I.; Mansouri, R.; Aieb, A. Comparison between NARX-NN and HEC-HMS models to simulate Wadi Seghir catchment runoff events in Algerian northern. Int. J. River Basin Manag. 2021, 1–28. [Google Scholar] [CrossRef]
  16. Sezen, C.; Partal, T. New hybrid GR6J-wavelet-based genetic algorithm-artificial neural network (GR6J-WGANN) conceptual-data-driven model approaches for daily rainfall–runoff modelling. Neural Comput. Appl. 2022, 1–25. [Google Scholar] [CrossRef]
  17. Viola, F.; Caracciolo, D.; Forestieri, A.; Pumo, D.; Noto, L. Annual runoff assessment in arid and semiarid Mediterranean watersheds under the Budyko’s framework. Hydrol. Process. 2017, 31, 1876–1888. [Google Scholar] [CrossRef]
  18. Khedimallah, A.; Meddi, M.; Mahé, G. Characterization of the interannual variability of precipitation and runoff in the Cheliff and Medjerda basins (Algeria). J. Earth Syst. Sci. 2020, 129, 134. [Google Scholar] [CrossRef]
  19. Xu, X.; Liu, W.; Scanlon, B.R.; Zhang, L.; Pan, M. Local and global factors controlling water-energy balances within the Budyko framework. Geophys. Res. Lett. 2013, 40, 6123–6129. [Google Scholar] [CrossRef]
  20. Wang, C.; Wang, S.; Fu, B.; Zhang, L. Advances in hydrological modelling with the Budyko framework. Prog. Phys. Geogr. Earth Environ. 2016, 40, 409–430. [Google Scholar] [CrossRef]
  21. Du, C.; Sun, F.; Yu, J.; Liu, X.; Chen, Y. New interpretation of the role of water balance in an extended Budyko hypothesis in arid regions. Hydrol. Earth Syst. Sci. 2016, 20, 393–409. [Google Scholar] [CrossRef] [Green Version]
  22. Wu, C.; Yeh, J.F.; Xu, K.; Hu, X.B.; Huang, G.; Wang, P. Modeling water balance using the Budyko framework over variable timescales under diverse climates. Hydrol. Earth Syst. Sci. Discuss. 2017, 1–33. [Google Scholar] [CrossRef] [Green Version]
  23. Xiong, L.; Guo, S. Appraisal of Budyko formula in calculating long-term water balance in humid watersheds of southern China. Hydrol. Process. 2011, 26, 1370–1378. [Google Scholar] [CrossRef]
  24. Guezgouz, N.; Boutoutaou, D.; Zeggane, H.; Chefrour, A. Multivariate statistical analysis of the groundwater flow in shallow aquifers: A case of the basins of northern Algeria. Arab. J. Geosci. 2017, 10, 336. [Google Scholar] [CrossRef]
  25. Touazi, M.; Laborde, J.P. Modélisation pluie-débit à l’échelle annuelle en Algérie du nord. Rev. Sci. L’eau 2005, 17, 503–516. [Google Scholar] [CrossRef] [Green Version]
  26. Laborde, P.J.; Gourbesville, P.; Assaba, M.; Abdelmatif, D. Climate evolution and possible effects on surface water resources of North Algeria. Curr. Sci. 2010, 98, 1056–1062. [Google Scholar]
  27. Meddi, M.; Toumi, S. Study of the interannual rainfall variability in northern Algeria. Rev. Ljee. 2014, 23. [Google Scholar]
  28. Zeroual, A.; Assani, A.A.; Meddi, M. Combined analysis of temperature and rainfall variability as they relate to climate indices in Northern Algeria over the 1972–2013 period. Water Policy 2016, 48, 584–595. [Google Scholar] [CrossRef]
  29. Hrnjak, I.; Lukić, T.; Gavrilov, M.B.; Marković, S.B.; Unkašević, M.; Tosic, I. Aridity in Vojvodina, Serbia. Arch. Meteorol. Geophys. Bioclimatol. Ser. B 2013, 115, 323–332. [Google Scholar] [CrossRef]
  30. Tokar, A.S.; Markus, M. Precipitation-Runoff Modeling Using Artificial Neural Networks and Conceptual Models. J. Hydrol. Eng. 2000, 5, 156–161. [Google Scholar] [CrossRef]
  31. Antar, M.A.; Elassiouti, I.; Allam, M.N. Rainfall-runoff modelling using artificial neural networks technique: A Blue Nile catchment case study. Hydrol. Process. 2005, 20, 1201–1216. [Google Scholar] [CrossRef]
  32. Dahmoune, F.; Remini, H.; Dairi, S.; Aoun, O.; Moussi, K.; Bouaoudia-Madi, N.; Adjeroud-Abdellatif, N.; Kadri, N.; Lefsih, K.; Boughani, L.; et al. Ultrasound assisted extraction of phenolic compounds from P. lentiscus L. leaves: Comparative study of artificial neural network (ANN) versus degree of experiment for prediction ability of phenolic compounds recovery. Ind. Crop. Prod. 2015, 77, 251–261. [Google Scholar] [CrossRef]
  33. Dawson, C.; Wilby, R.L. Hydrological modelling using artificial neural networks. Prog. Phys. Geogr. Earth Environ. 2001, 25, 80–108. [Google Scholar] [CrossRef]
  34. Hung, N.Q.; Babel, M.S.; Weesakul, S.; Tripathi, N.K. An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrol. Earth Syst. Sci. 2009, 5, 183–218. [Google Scholar] [CrossRef] [Green Version]
  35. Uyanık, G.K.; Güler, N. A Study on Multiple Linear Regression Analysis. Procedia Soc. Behav. Sci. 2013, 106, 234–240. [Google Scholar] [CrossRef] [Green Version]
  36. Huang, Y.; Yu, M.; Tian, H.; Liu, Y. Decomposition and Attribution Analysis of Runoff Alteration of the Dongting Lake in China. Water 2020, 12, 2729. [Google Scholar] [CrossRef]
  37. Yang, H.; Yang, D.; Lei, Z.; Sun, F. New analytical derivation of the mean annual water-energy balance equation. Water Resour. Res. 2008, 44. [Google Scholar] [CrossRef]
  38. Shan, X.; Li, X.; Yang, H. Towards understanding the mean annual water-energy balance equation based on an Ohms-type approach. Hydrol. Earth Syst. Sci. Discuss. 2019, 1–17. [Google Scholar] [CrossRef]
  39. LeGates, D.R.; McCabe, G.J., Jr. Evaluating the use of “goodness-of-fit” Measures in hydrologic and hydroclimatic model validation. Water Resour. Res. 1999, 35, 233–241. [Google Scholar] [CrossRef]
  40. Srivastava, A.K.; Srivastava, V.K.; Ullah, A. The coefficient of determination and its adjusted version in linear regression models. Econ. Rev. 1995, 14, 229–240. [Google Scholar] [CrossRef]
  41. Wang, Z.; Bovik, A. Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures. IEEE Signal Process. Mag. 2009, 26, 98–117. [Google Scholar] [CrossRef]
  42. Green, I.R.A.; Stephenson, D. Criteria for comparison of single event models. Hydrol. Sci. J. 1986, 31, 395–411. [Google Scholar] [CrossRef] [Green Version]
  43. Cheng, K.S.; Lien, Y.T.; Wu, Y.C.; Su, Y.S. On the criteria of model performance evaluation for real-time flood forecasting. Stoch. Environ. Res. Risk Assess. 2017, 31, 1123–1146. [Google Scholar] [CrossRef] [Green Version]
  44. Dufour, J.-M.; Dagenais, M.G. Durbin-Watson tests for serial correlation in regressions with missing observations. J. Econ. 1985, 27, 371–381. [Google Scholar] [CrossRef] [Green Version]
  45. Baltas, E. Spatial distribution of climatic indices in northern Greece. Meteorol. Appl. 2007, 14, 69–78. [Google Scholar] [CrossRef]
Figure 1. Maps of the northern Algeria region (A) and elevation variation (B,C) of 16 watersheds, showing the hydrometric stations’ locations.
Figure 1. Maps of the northern Algeria region (A) and elevation variation (B,C) of 16 watersheds, showing the hydrometric stations’ locations.
Sensors 22 04349 g001
Figure 2. P–P plot, Q–Q plot, and scattergram graph of the dataset series, used in this modeling. Inter-annual rainfall-runoff residual (IARR’), inter-annual rainfall (IAR), inter-annual temperature (IAT), watershed area (S), and watercourse (WC).
Figure 2. P–P plot, Q–Q plot, and scattergram graph of the dataset series, used in this modeling. Inter-annual rainfall-runoff residual (IARR’), inter-annual rainfall (IAR), inter-annual temperature (IAT), watershed area (S), and watercourse (WC).
Sensors 22 04349 g002
Figure 3. Map of spatial inter-annual rainfall distribution (A), followed by bioclimatic classification (B) of northern Algeria.
Figure 3. Map of spatial inter-annual rainfall distribution (A), followed by bioclimatic classification (B) of northern Algeria.
Sensors 22 04349 g003
Figure 4. Proposed ANNs model architecture, used to estimate residuals computational of the Ol’Dekop model. The local and basic model (ANN1), general and dynamic model (ANN2). Inter-annual rainfall (IAR), watershed area (S), watercourse (WC), De Martonne index (I), Semi-Dry (SD), Mediterranean (ME), Semi-Humid (SH), Humid (H), and Very Humid (VH).
Figure 4. Proposed ANNs model architecture, used to estimate residuals computational of the Ol’Dekop model. The local and basic model (ANN1), general and dynamic model (ANN2). Inter-annual rainfall (IAR), watershed area (S), watercourse (WC), De Martonne index (I), Semi-Dry (SD), Mediterranean (ME), Semi-Humid (SH), Humid (H), and Very Humid (VH).
Sensors 22 04349 g004
Figure 5. Nonlinear regression curve between inter-annual rainfall-runoff residuals data obtained by Ol’Dekop’s model (IARR’) and ANNS input variables which are (A) Inter-annual rainfall (IAR); (B) De martone index (I); (C,E) watershed area (S); (D,F) watercourse (WC).
Figure 5. Nonlinear regression curve between inter-annual rainfall-runoff residuals data obtained by Ol’Dekop’s model (IARR’) and ANNS input variables which are (A) Inter-annual rainfall (IAR); (B) De martone index (I); (C,E) watershed area (S); (D,F) watercourse (WC).
Sensors 22 04349 g005
Figure 6. Multiple linear regression curves between real and estimated Ol’Dekop’s rainfall-runoff residual data (IARR’), obtained by ANN1 transfer functions by using a set of input variables which are (A) (IAR*, IARR*/S*); (B) (IAR*, IARR*/WC*); (C) (IAR*, S*); (D) (IAR*, WC*); (E) (IAR*, S*, WC*). * input variable linearized by the Ln function, as follow: Xi* = Ln Xi. Inter-annual rainfall (IAR), watershed area (S), watercourse (WC).
Figure 6. Multiple linear regression curves between real and estimated Ol’Dekop’s rainfall-runoff residual data (IARR’), obtained by ANN1 transfer functions by using a set of input variables which are (A) (IAR*, IARR*/S*); (B) (IAR*, IARR*/WC*); (C) (IAR*, S*); (D) (IAR*, WC*); (E) (IAR*, S*, WC*). * input variable linearized by the Ln function, as follow: Xi* = Ln Xi. Inter-annual rainfall (IAR), watershed area (S), watercourse (WC).
Sensors 22 04349 g006
Figure 7. Nonlinear regression graph (A), followed by standardized residual analysis histogram (B) between inter-annual rainfall (IAR) and Cte data. Real Inter-annual rainfall-runoff residuals (IARR), estimated inter-annual rainfall-runoff residuals by ANN1 (EIRR).
Figure 7. Nonlinear regression graph (A), followed by standardized residual analysis histogram (B) between inter-annual rainfall (IAR) and Cte data. Real Inter-annual rainfall-runoff residuals (IARR), estimated inter-annual rainfall-runoff residuals by ANN1 (EIRR).
Sensors 22 04349 g007
Figure 8. Linear regression graphs between real and estimated inter-annual rainfall-runoff obtained by applying new method (A), Schreiber (B), Ol’Dekop (C), Budyko (D), Yang (E), and Sharif (F) models. Coefficient of determination (R2).
Figure 8. Linear regression graphs between real and estimated inter-annual rainfall-runoff obtained by applying new method (A), Schreiber (B), Ol’Dekop (C), Budyko (D), Yang (E), and Sharif (F) models. Coefficient of determination (R2).
Sensors 22 04349 g008
Figure 9. Residual analysis curves between real and estimated interannual rainfall-runoff (IARR), which is obtained by applying a new method (A), Schreiber (B), Ol’Dekop (C), Budyko (D), Yang (E), and Sharif (F) models.
Figure 9. Residual analysis curves between real and estimated interannual rainfall-runoff (IARR), which is obtained by applying a new method (A), Schreiber (B), Ol’Dekop (C), Budyko (D), Yang (E), and Sharif (F) models.
Sensors 22 04349 g009
Figure 10. Histograms explaining performance analysis of the new model against a set of parametric (Yang, and Sharif) and non-parametric (Schreiber, Ol’Dekop, and Budyko) water balance models on five climatic regions of Northern Algeria. These were shown by (A) determination coefficient (R2); (B) adjusted coefficient of determination (R2Adj); (C) root mean square error (RMSE); (D) mean absolute error (MAE).
Figure 10. Histograms explaining performance analysis of the new model against a set of parametric (Yang, and Sharif) and non-parametric (Schreiber, Ol’Dekop, and Budyko) water balance models on five climatic regions of Northern Algeria. These were shown by (A) determination coefficient (R2); (B) adjusted coefficient of determination (R2Adj); (C) root mean square error (RMSE); (D) mean absolute error (MAE).
Sensors 22 04349 g010
Table 1. Statistical criteria used to evaluate the performance of sub-models proposed in all climatic regions.
Table 1. Statistical criteria used to evaluate the performance of sub-models proposed in all climatic regions.
CriteriaStatistical FormulaReference
Coefficient of determination (R2) R 2 = i = 1 N ( Qoi Qo ¯ ) × ( Qsi Qs ¯ ) [ i = 1 N ( Qoi Qo ¯ ) 2 ] 0.5 [ i = 1 N ( Qsi Qs ¯ ) 2 ] 0.5 [39]
Adjusted coefficient of determination (R2Adj) R Adj 2 = 1 ( 1 R 2 ) × ( N 1 ) N K 1 [40]
Mean squared error (MSE) MSE = 1 N i = 1 N ( Qsi Qoi ) 2 [41]
Root mean square error (RMSE) RMSE = i = 1 N ( Q si Q oi ) 2 N [42]
Mean absolute error (MAE) MAE = i = 1 N | Qoi Qsi | N [43]
Durbin–Watson coefficient (DW) DW = i = 2 N ( e i e i 1 ) 2 i = 1 N e i 2 [44]
Table 2. Statistical description and data distribution analysis of a hydro-morphological dataset, obtained between 1967 and 2020 in northern Algeria.
Table 2. Statistical description and data distribution analysis of a hydro-morphological dataset, obtained between 1967 and 2020 in northern Algeria.
StatisticIARR’IARIATSWC
No. of data102102102102102
Minimum4.5687222.000011.583316.00007.1000
Maximum284.32211107.000021.75834050.0000179.8000
1st Quartile13.6200332.750014.2313194.500029.0000
Median25.9667415.500015.2792475.000045.6000
3rd Quartile64.3929607.000016.65421028.250065.5250
Mean49.1013494.652915.4569719.710651.4108
Variation coefficient1.10390.40440.11801.03790.6313
Standard deviation (n)54.2053200.06201.8236746.992732.4534
p-value (Exponential)0.1353<0.0001<0.00010.80610.0001
p-value (Gamma 2)0.02010.08850.60430.84750.9799
p-value (GEV)<0.0001<0.00010.8764<0.00010.7764
p-value (Log-normal)0.15800.08050.61050.76880.6973
p-value (Logistic)0.00030.06150.46040.00280.3787
p-value (Normal)0.00030.00610.43580.00120.0580
p-value (Weibull 2)0.07760.02830.19290.79940.6725
p-value (Weibull 3)0.97890.85720.61360.83630.8263
Inter-annual rainfall-runoff residuals (IARR’), Inter-annual rainfall (IAR), inter-annual temperature (IAT), Watershed area (S), Watercourse (WC).
Table 3. Matrix of correlation between inter-annual rainfall-runoff residual data of Ol’Dkop model and hydro-morphological variables selected for the ANNS modeling.
Table 3. Matrix of correlation between inter-annual rainfall-runoff residual data of Ol’Dkop model and hydro-morphological variables selected for the ANNS modeling.
VariableIARR’*S*WC*IAT*IAR*(IARR’*/S*)(IARR’*/WC*)I*
IARR’*10.51410.50230.06060.85130.73220.81240.8052
S*0.514110.89290.00920.14280.79230.57580.1326
WC*0.49230.892910.00590.12660.68780.61680.1180
IAT*0.06060.00920.005910.08040.04450.05520.0126
IAR*0.85130.14280.12660.080410.55390.68760.9505
(IARR’/S)*0.73220.79230.68780.04450.553910.92670.5320
(IARR’/WC)*0.81240.57580.61680.05520.68760.926710.6599
I*0.80520.13260.11800.01260.95050.53200.65991
Inter-annual rainfall-runoff residual data of Ol’Dekop (IARR’), inter-annual rainfall (IAR), inter-annual temperature (IAT), watershed area (S), watercourse (WC), De martone index (I), * input variable linearized by the Ln function, as follow: Xi* = Ln Xi.
Table 4. Statistical parameters and performance tests of multiple regression models, obtained by ANN1.
Table 4. Statistical parameters and performance tests of multiple regression models, obtained by ANN1.
StatisticA 1B 1C 1D 1E 2
YIARR’*IARR’*IARR’*IARR’*IARR’
X1IAR*IAR*IAR*IAR*IAR
X2(IARR’/S)*(IARR’/WC)*S*WC*S
X3----WC
Coef (a1)1.71661.45381.71661.45381.5852
Coef (a2)0.20570.3296−0.2972−0.5491−0.1486
Coef (a3)----−0.2746
Intercept (B)−6.5738−5.3946−5.3201−3.44940.01
No. of observation102102102102102
R20.93010.94050.90360.91050.9518
Adjusted R20.92860.93930.90060.90960.9508
MSE0.07090.06030.09470.0802208.539
RMSE0.26630.24550.30770.283314.4409
DW1.75101.85571.49471.48220.7094
* input variable linearized by the Ln function, as follow: Xi* = Ln Xi, 1 linear model ((a1X1 + a2X2 + B)), 2 final equation represented by power model (B × X1a1 × X2a2), no data (-), panels represented in Figure 1A–E, output response (Y), input variable (X), determination coefficient (R2), adjusted coefficient of determination (R2Adj), mean squared error (MSE), root mean square error (RMSE), Durbin–Watson coefficient (DW).
Table 5. Statistical parameters and performance tests of multiple regression models, obtained by ANN2 for each bioclimatic floor in northern Algeria.
Table 5. Statistical parameters and performance tests of multiple regression models, obtained by ANN2 for each bioclimatic floor in northern Algeria.
StatisticSemi-DryMediterraneanSemi-HumidHumidVery Humid
Y 1(IARR’SD) *(IARR’ME) *(IARR’SH) *(IARR’H) *(IARR’VH) *
X1(EIRRSD) *(EIRRME) *(EIRRSH) *(EIRRH) *(EIRRVH) *
X2(ISD) *(IME) *(ISH) *(IH) *(IVH) *
Coef (a1)0.500330.424960.29175−0.161790.28185
Coef (a2)1.163311.176411.006561.340921.50295
Intercept (B)−1.74497−1.41471−0.24040.45666−1.60261
No. of observations4516151115
R20.65010.78040.68200.65330.9072
Adjusted R20.64200.76470.66750.64480.9001
MSE0.07830.02960.01540.01380.0173
RMSE0.27980.17210.12390.11750.1314
DW1.68701.78611.68030.86250.7707
1 Lineaire model (a1X1 + a2X2 + B), * input variable linearized by the Ln function, as follow: Xi* = Ln Xi, Semi-Dry (SD), Mediterranean (ME), Semi-Humid (SH), Humid (H), Very Humid (VH), output response (Y), input variable (X), determination coefficient (R2), adjusted coefficient of determination (R2Adj), mean squared error (MSE), root mean square error (RMSE), Durbin–Watson coefficient (DW).
Table 6. Performance tests of final and regional inter-annual rainfall-runoff residual models applied to northern Algeria climate regions.
Table 6. Performance tests of final and regional inter-annual rainfall-runoff residual models applied to northern Algeria climate regions.
StatisticIRR3IRR4IRR5IRR6IRR7IRRF
No. of observations102102102102102102
R20.94940.95380.94430.83580.96160.9841
Adjusted R20.94900.95340.94380.83420.96130.9789
MSE151.4110138.3785166.6840492.0888114.905162.5948
RMSE12.304911.763412.910622.183110.71945.9117
DW1.73241.61421.56751.84771.44340.5250
Semi-Dry (3), Mediterranean (4), Semi-Humid (5), Humid (6), Very Humid (7), final inter-annual rainfall-runoff residuals model (IRRF), regional interannual rainfall-runoff residuals model (IRR), determination coefficient (R2), adjusted coefficient of determination (R2Adj), mean squared error (MSE), root mean square error (RMSE), and Durbin–Watson coefficient (DW).
Table 7. Statistical tests and performance analysis of estimated inter-annual rainfall-runoff final results (IARR), obtained by applying a new method, Schreiber, Ol’Dekop, Budyko, Yang, and Sharif models.
Table 7. Statistical tests and performance analysis of estimated inter-annual rainfall-runoff final results (IARR), obtained by applying a new method, Schreiber, Ol’Dekop, Budyko, Yang, and Sharif models.
StatisticReal DataNew MethodSchreiberOl’DekopBudykoYang (n = 2)Sharif
No. of obs102102102102102102102
Minimum77.03710.55532.0391.298417.10063.0295
Maximum497514.9642377.8251237.4534310.7265351.4336296.4772
1st Quartile20.12519.3945.98086.77146.330531.26389.9495
Median3935.96318.680413.995316.404346.960720.2944
3rd Quartile104.2597.877772.774741.415457.3576116.994658.088
Mean81.497781.124952.880732.353942.871586.425744.2095
Variance (n)9292.4659798.26235472.49521805.64233450.31244759.82563013.5653
Std. Dev (n)96.397498.986273.976342.492958.739468.991554.8959
Var. Coef1.18281.22021.31341.37011.39891.24170.7983
R210.99240.92110.85250.90410.93340.8739
R2Adj10.99230.92030.85240.90740.93330.8729
RMSE08.507312.86530.608420.73912.271712.5336
MAE05.20537.871618.728112.6897.50897.6688
Mean A. Dev67.875268.506129.283853.326238.832541.596753.2656
Std. E. Mean9.59199.84954.22826.86495.46245.84487.3609
Standard deviation (Std. Dev), variation coefficient (Var. Coef), determination coefficient (R2), adjusted coefficient of determination (R2Adj), root mean square error (RMSE), mean absolute error (MAE), mean absolute deviation (Mean. A. Dev), standard error of the mean (Std. E. Mean).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mimeche, O.; Aieb, A.; Liotta, A.; Madani, K. A Novel Interannual Rainfall Runoff Equation Derived from Ol’Dekop’s Model Using Artificial Neural Networks. Sensors 2022, 22, 4349. https://doi.org/10.3390/s22124349

AMA Style

Mimeche O, Aieb A, Liotta A, Madani K. A Novel Interannual Rainfall Runoff Equation Derived from Ol’Dekop’s Model Using Artificial Neural Networks. Sensors. 2022; 22(12):4349. https://doi.org/10.3390/s22124349

Chicago/Turabian Style

Mimeche, Omar, Amir Aieb, Antonio Liotta, and Khodir Madani. 2022. "A Novel Interannual Rainfall Runoff Equation Derived from Ol’Dekop’s Model Using Artificial Neural Networks" Sensors 22, no. 12: 4349. https://doi.org/10.3390/s22124349

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop