Next Article in Journal
Wheat Yield Gap Assessment in Using the Comparative Performance Analysis (CPA)
Next Article in Special Issue
An Overview of Machine Learning Applications on Plant Phenotyping, with a Focus on Sunflower
Previous Article in Journal
Genome-Wide Identification and Expression Analysis of RCC1 Gene Family under Abiotic Stresses in Rice (Oryza sativa L.)
Previous Article in Special Issue
Delineation of Soil Management Zone Maps at the Regional Scale Using Machine Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Weather-Based Statistical and Neural Network Tools for Forecasting Rice Yields in Major Growing Districts of Karnataka

by
Mathadadoddi Nanjundegowda Thimmegowda
1,*,
Melekote Hanumanthaiah Manjunatha
1,
Lingaraj Huggi
1,
Huchahanumegowdanapalya Sanjeevaiah Shivaramu
2,
Dadireddihalli Venkatappa Soumya
1,
Lingegowda Nagesha
1 and
Hejjaji Sreekanthamurthy Padmashri
3
1
AICRP on Agrometeorology, University of Agricultural Sciences, GKVK, Bengaluru 560065, India
2
College of Horticulture, University of Agricultural Sciences, Kolar 563103, India
3
Directorate of Research, university of Agricultural Sciences, Gkvk, Bengaluru 560065, India
*
Author to whom correspondence should be addressed.
Agronomy 2023, 13(3), 704; https://doi.org/10.3390/agronomy13030704
Submission received: 26 August 2022 / Revised: 11 October 2022 / Accepted: 12 October 2022 / Published: 27 February 2023
(This article belongs to the Collection Machine Learning in Digital Agriculture)

Abstract

:
Two multivariate models were compared to assess their yield predictability based on long-term (1980–2021) rice yield and weather datasets over eleven districts of Karnataka. Simple multiple linear regression (SMLR) and artificial neural network models (ANN) were calibrated (1980–2019 data) and validated (2019–2020 data), and yields were forecasted (2021). An intercomparison of the models revealed better yield predictability with ANN, as the observed deviations were smaller (−37.1 to 21.3%, 4% mean deviation) compared to SMLR (−2.5 to 35.0%, 16% mean deviation). Further, district-wise yield forecasting using ANN indicated an underestimation of yield, with higher errors in Mysuru (−0.2%), Uttara Kannada (−1.5%), Hassan (−0.1%), Ballari (−1.5%), and Belagavi (−15.3%) and overestimations in the remaining districts (0.0 to 4.2%) in 2018. Likewise, in 2019 the yields were underestimated in Kodagu (−0.6%), Shivamogga (−0.1%), Davanagere (−0.7%), Hassan (−0.2%), Ballari (−5.1%), and Belagavi (−10.8%) and overestimated for the other five districts (0.0 to 4.8%). Such model yield underestimations are related to the farmers’ yield improvement practices carried out under adverse weather conditions, which were not considered by the model while forecasting. As the deviations are in an acceptable range, they prove the better applicability of ANN for yield forecasting and crop management planning in addition to its use for regional agricultural policy making.

1. Introduction

Rice (Oryza sativa L.) is the most important staple food crop of India, next to wheat; is used for food and animal fodder; and is cultivated in a 45.76 million ha area, with a production 124.3 million tons in the country. Karnataka is the major rice-growing state in India, contributing 3 percent of the country’s rice area (1.397 million ha) and 3.45 percent (4.29 million tons) of production [1]. The crop is cultivated in a wide range of soils and rainfall and temperature situations. As a unique example, it is cultivated in areas where rainfall ranges from 600 to 3000 mm per annum [2]. The unique feature of rice culture in Karnataka is that either sowing or transplanting is seen in all seasons of the year, and the durations of cultivated rice varieties vary from 100 to 180 days, depending on the season and agroclimatic conditions. Despite its ability to adapt to a wide range of climatic conditions, the crop suffers from severe yield variability due to changes in weather factors [3]. Weather factors affect the crop both in direct and indirect ways: directly, as a source of water for crop growth and as an energy source for physiological aspects though light and temperature, and indirectly through the mineralization of nutrients, their movement to the plant root zone, etc. [4,5]. Further, these direct and indirect impacts of weather factors are the results of individual weather factors or the interactive effect of two or more weather factors on the crop yield. Previously, a lot of work has been carried out to estimate the impacts of individual weather factors on crop yield [6,7,8], but few recent studies have shown the interactive effects of weather factors on crop performance [9,10]. Studies in this direction will help in understanding crop responses in terms of final yield and provide a forecast of the crop prior to harvest [11].
The estimation of the interactive effects of weather factors on crop yield are aided by advancements in crop yield forecasting techniques, which predict a crop yield based on the weather variables that prevailed during the crop growth period. Such forecasting techniques, commonly called ‘crop models’ are handy in crop planning, as they are developed based on multidisciplinary sources of information such as edaphic (land use, soil physical properties, soil pH, soil fertility, soil moisture, etc.) [12], meteorological (temperature, rainfall, relative humidity, etc.) [13], management (row spacing, seed quantity, the fertilizers and pesticides used, etc.), and crop factors (genotype x environment interaction) [14]. Such models are classified into three groups based on the input requirements [15]: empirical or statistical models, simulation models, and weather analysis models. All of these models rely upon two important aspects: the usage of the traditional approach of mathematical models and the application of artificial intelligence [16].
Previously, in many studies statistical methods such as multiple linear regressions (MLRs) have been employed to develop statistical crop yield prediction models [17,18,19]. These models should be used cautiously, as there is a chance of model overfitting because of the overdependence of the dependent factor (yield) on independent factors (weather variables), as the independent factors are known for multicollinearity [20]. To overcome the problem of model overfitting, methods such as stepwise multiple linear regression (SMLR), artificial neural network (ANN), least absolute shrinkage and selection operator (LASSO), or elastic net (ENET) have been adopted in many previous studies to increase the precision in yield forecasts [21]. The SMLR models only consider major factors responsible for yield formation. The technological advancements have also brought solutions for complex agricultural problems that linear systems are unable to resolve. One such advancement is the use of neural networks; they take into account the multidirectional interactions between independent variables to precisely simulate the dependent variable [22]. Previously, many attempts were made in this direction using statistical and simulation models [23,24,25]. Here, an effort was made to establish and compare statistical (SMLR) and neural network (ANN) models for yield estimation to arrive at a better model with the intention of aiding regional policy making.

2. Materials and Methods

2.1. Study Area

The major rice districts of Karnataka were selected for the study. The majority of the districts were judged based on the area and production of the crop; these eleven districts contribute to roughly 50 percent of the state’s rice area and production. In total, eleven districts (Table 1, Figure 1a,b) were selected, considering their contributions to production over many years. As the crop was introduced to new districts in recent years, the fear of dataset unavailability prevented us from choosing those districts. Among the eleven districts, the highest area, production, and productivity of rice (178.5 thousand ha, 723.0 thousand t, and 4051.2 kg/ha, respectively) were observed in Ballari district, and lower levels were observed in Dakshina Kannada (DK) district (8.1 thousand ha area and 23.4 thousand t of production). Lower productivity of the crop was noticed in Uttara Kannada (UK) district (2149.3 kg/ha).

2.2. Dataset

Long-term (42 years; 1980 to 2021, Table S1) datasets pertaining to the area, production, and productivity of rice in the state were collected from the Directorate of Economics and Statistics, Government of Karnataka. Daily weather data (maximum and minimum temperature, morning and evening relative humidity, and rainfall) pertaining to the study years were collected from the India Meteorological Department, Pune, for the districts under study.

2.3. Methodologies Used for Yield Forecast

Two approaches of crop yield forecast are in vogue recently [26]. One is the data-intensive, cumbersome simulation model. These have limited applicability since they cannot be applied to large spatiotemporal scales due to the unavailability of sufficient input data. Therefore, the other method, statistical models using crop yield and weather data by means of simple regression, can be broadly used as an alternative to process weather-based statistical models [27]. For successful weather-based forecasting, statistical models should first be calibrated and tested using historical datasets. A district-wise yield model for rice in Bihar was developed using meteorological data, and it showed that models were able to predict preharvest crop yield with good accuracy. Most of the statistical models use multiple linear regression (MLR) equations to develop statistical crop yield prediction models [28]. To overcome multi-colinearity between independent variables, feature selection (stepwise multiple linear regression (SMLR), least absolute shrinkage and selection operator (LASSO), or the elastic net (ENET) method) or feature extraction (principal component analysis) statistical techniques can be used. In a few studies, PCA has been used in conjunction with MLR. However, studies on the comparison of the performance of models with and without feature selection, feature extraction, and a combination of both the methods for forecasting crop yield are meagre. In this context, our study has found an opportunity to develop and select a statistical forecasting model using SMLR and ANN for major rice-growing districts of Karnataka, with the objectives to (i) develop district-wise crop yield prediction models using multivariate models and (ii) evaluate the predictive performance of the developed models. The methodology followed for both SMLR and ANN are summarized in the next section.

2.3.1. Generation of Weather Indices

Weather indices were generated based on composite weather variable methods. Two types of weather variables were generated, i.e., unweighted and weighted weather variables. Unweighted weather indices are calculated using the sum of weekly weather variables experienced during a crop period, while the weighted indices are calculated using the sum product of this correlation coefficient and the value of the weekly weather variable. Correlation coefficients between the yield and the weather variables experienced during the respective week were calculated. Similar weather index-based yield forecasting model approaches were used for rice, wheat, sugarcane, and potato in Uttar Pradesh, India [29]. The procedure for the computation of unweighted and weighted weather indices is summarized below. In total, 42 weather variables were generated to determine their effects on the yield of rice
Unweighted weather indices:
Z i j = w = 1 m X i w Z i i j = w = 1 m X i w X i w
Weighted weather indices:
Z i j = w = 1 m r i w j   X i w Z i i j = w = 1 m r i i m j   X i w   X i w
where:
  • Xiw/Xii—the value of the ith/ihth weather variable understudy in the wth week;
  • rjiw/rjii/e—the correlation coefficient of the detrended yield with the ith weather variable/product of the ith and ia th weather variables in the wth week;
  • m—the week of the forecast.

2.3.2. Simple Multiple Linear Regression (SMLR)

ICAR, the Indian Agricultural Statistical Research Institute (IASRI), has developed models that express the effect of weather variables on the yields of the respective correlation coefficients between the yield and weather variables. Here, the yield is considered a dependent variable, and the weekly weather variables are considered independent variables. The weekly weather variables were generated using daily data by averaging the daily maximum temperature (daily TMAX) and minimum temperature (daily TMIN) and the morning relative humidity (daily RHI) and evening relative humidity (daily RHII) and summing up the rainfall (daily RF) and were used for further analysis (Figure 2) and for the generation of the weather indices indicated in Table 2.
Multiple linear regression (MLR) is the standard and simplest approach for the development of calibration models, but its application for datasets with independent variables with large sample numbers is not always successful [30]. However, feature selection in the form of stepwise MLR (SMLR) gives good results over large datasets. A stepwise regression procedure was adopted for the selection of the best regression variable among many independent variables [31].
The SMLR-based statistical inference relies upon the assumption that the sample mean is approximately normally distributed while testing the population mean. This necessitates checking for normality in the sample dataset [32]. Nearly 40 different normality tests have been developed and have proven their vitality in many statistical analyses by their different applicability values because the power would change owing to the sample size and the nature of the data [33]. Hence, one should be cautious when choosing an appropriate test of normality. In this study, the Shapiro–Wilk test [34] was used to test the normality of district-wise yield data.

2.3.3. Artificial Neural Networks (ANN)

Attaining the maximum crop yield at the minimum cost is one of the aims of agricultural production. Hence, the early detection and management of problems associated with crop yield indicators can help to increase yields. Recently, the application of artificial intelligence (AI), such as artificial neural networks (ANNs) (Figure 3), fuzzy systems, and genetic algorithms, has been shown to be more efficient in solving these problems. Using these processes can make models easier to use and more accurate when working with complex natural systems with many inputs. In the present study, we used three layers, namely input, hidden, and output feed-forward artificial neural network. Each layer had neurons or nodes interconnected with each other. The number of nodes in the input and output layers was fixed by the dataset used. There was a need to take care when choosing the optimal number of hidden layers while implementing the ANN for yield forecasting by using the ‘train’ function of the ‘caret’ package using the method ‘nnet’ with 10-fold cross-validation in R software [35]. Here, the analysis was carried out by selecting 80 percent of the data for calibration (training) purposes and the remaining dataset for validation (testing). In the present study, 32 weather indices were used as inputs. Yield was the dependent variable, and the rest were independent variables.

2.4. Tests of Model Performance

Model performance was tested using different statistical model performance evaluation measures. The use of more than one measure helped us to evaluate a single model’s performance and compare multiple models. In this study, the R2, root-mean-square error (RMSE), normalized root-mean-square error (nRMSE), and modeling efficiency (EF) were calculated using the formulae:
R 2 = ( 1 n i = 1 n ( M i   M ¯ ) ( O i   O ¯ ) σ M σ O ) 2
R2 is the proportion of variation in the outcome that is explained by the predictor variables. In multiple regression models, it corresponds to the squared correlation between the observed outcome values and the values predicted by the model. The higher the R-squared (~1), the better is the model prediction.
RMSE = 1 n i = 1 n ( O i M i ) 2
This measures the average magnitude of the errors and is concerned with deviations from the actual value. An RMSE value of zero indicates that the model has a perfect fit. The lower the RMSE, the better the model and its predictions.
nRMSE = 1 n i = 1 n ( O i M i ) 2 × 100   O ¯ EF = 1 i = 1 n ( O i M i ) 2 i = 1 n ( O i   O ¯ ) 2
EF = 1 i = 1 n ( O i M i ) 2 i = 1 n ( O i O ¯ ) 2
Mi: model output;   M ¯ and σ M : mean and standard deviation of model output, respectively; Oi: observations;   O ¯ and σ o : mean and standard deviation of observations, respectively.
The normalized root-mean-square error expresses the spread around the measurements and is used for the classification of model performance into distinct groups (excellent, good, fair, and poor when the values are in the ranges of <10%, 10–20%, 20–30%, and >30%, respectively) [36], while the modeling efficiency indicates whether the model describes the data better than simply using the average of the predictions. Optimal values are the ones that are close to 1.

3. Results

3.1. Observed Variability in Rainfall in the Study Districts

Rice seeding in Karnataka starts in June (24th SMW) and mostly ends in September (39th SMW). The spatial and temporal (weekly) rainfall distribution during the study period over the region was calculated and is depicted in Figure 4a,b. The average rainfall of all the districts indicated the maximum rainfall in the 27th SMW (2–8th July), and the rainfall declined from the 28th SMW (16–22nd July). Most of the districts followed the same trend, except Dakshina Kannada, where the rainfall was comparatively higher throughout the crop growth period. Spatially, the three coastal districts, Udupi, Dakshina Kannada, and Uttara Kannada, received more rainfall during the crop growth period (194.4, 164.0, and 118 mm, respectively) and annually. These were followed by Malnad districts Shivamogga, Chikkamagaluru, and Kodagu (104.3, 80.6, and 102.8 mm, respectively), and the remaining four interior districts, Hassan, Mysuru, Davanagere, and Ballari (41.0, 41.9, 21.7, and 25.3 mm, respectively) received lower rainfall during the crop growth period. Many studies have indicated the impact of such an unequal spatial distribution of rainfall [37,38,39,40] on the yield variability of rainfed crops. However, crops such as rice are capable of producing yields under irrigated situations, and rainfall distribution has a meagre impact, although in Karnataka only a few rice-growing districts are under irrigation (Ballari and Mandya), and in the remaining districts rice is grown in rainfed conditions. Hence, in the present study, only the kharif datasets on area, production, and productivity were collected and used.

3.2. Description of Rice Yield Variability in the Study Districts

The summary statistics of yield data pertaining to the eleven rice-growing districts of Karnataka over the years 1980–2019 are presented in Table 3. The maximum yield from the collected data was observed in Davanagere district (3541.17 kg ha−1), and the minimum yield was observed in Belagavi district (1950.63 kg ha−1). The standard deviation of the yields across the districts varied between 244.57 and 643.37 kg ha−1. Among the districts, higher yield variability (CV) was observed in Ballari and Belagavi (18.4 and 31.7%, respectively). Such variability in the Kharif rice yield in these districts was mainly attributed to the characteristic high temperatures in these districts, causing crop failures under minor changes in intra-annual rainfall distribution. Further, for the sake of fitting SMLR, the normality of the yield data was tested using normal Q–Q plots (Figure 5) and Shapiro–Wilk tests. The yield data were found to be normally distributed, as indicated by Shapiro–Wilk test (p value > 0.05), for all districts except Mysuru (p value = 0.003). The normal Q–Q plots also confirmed normality, as the quintiles almost formed a diagonal line, thereby fulfilling the basic assumption of the parametric models (MLR, LASSO, and ENET).

3.3. Rice Yield Forecasting Models

3.3.1. Stepwise Multiple Linear Regression Model

The Kharif rice yields were forecasted during 2021 at the F3 (preharvest) stage using SMLR with SPSS statistical software for eleven districts (Belgavi, Dakshina Kannada, Davanagere, Hassan, Chikkamaglur, Udupi, Shivamogga, Ballari, Mysore, Kodagu, and Uttara Kannada). Its regression equation, the weather variables influencing the equation, and the standard error (SE) of the estimated values resulting from different weather variables are presented in Table 4. Here, a lower SE (94.99) was observed in Udupi district, and a higher SE (465.48) was seen in Belagavi district. The method of yield forecasting at various crop growth stages had a variable capability of yield estimation. The forecasts given at the midcrop (F2) and preharvest (F3) stages had better similarity to the observed yield compared to that forecasted at the vegetative stage, i.e., F1 [41]. Previously, such models have proven their worth in forecasting the yields of many crops, such as sugarcane and potato [42].
The resulting stepwise multiple linear regression model was validated for the period from 2018 to 2019 at the preharvest stage to determine the accuracy of the models. The district-wise predicted rice yield deviation from the observed yield is depicted in Table 5. The yields were underestimated by the model. The error percentages for Kodagu, Mysuru, Udupi, Uttara Kannada, Dakshina Kannada, Ballari, and Belagavi were found to be −34.5%, −2.1%, −1.4%, −6.2%, −1.2%, −8.4%, and −65.1%, respectively, while for rest of the districts it showed overestimations during 2018. Similarly, during 2019 five districts underestimated the rice yield, with error percentages of −1.2%, −2.4%, −8.5%, −13.0%, and −42.3% in Kodagu, Shivamogga, Uttara Kannada, Ballari, and Belagavi, respectively, whereas for the other six districts the predicted yields were overestimated by the model, ranging from 1.3 to 14.9 percent. The results revealed that there seemed to be less agreement between the observed and the predicted yield, as the error calculated by this model was not found within the acceptable limit, i.e., ±10% for all districts, whereas for a few districts it showed excellent agreement between the observed and predicted yields.
The model performance was evaluated with the R2, RMSE, and correlation coefficient (CC). The RMSE ranged between 83.75 and 447.25. Here, a lower RMSE was observed in Udupi district, and a higher RMSE was found in Belagavi district, as an RMSE value close to 0 indicates better model performance. Meanwhile, the CC ranged between 0.51 and 0.95, and the R2 ranged between 0.60 and 0.98. An R2 above 0.6 is said to be a good fit, whereas an R2 between 0.4 and 0.6 is moderate. From the table, we can observe that all eleven districts showed R2 values above 0.6 (Table 6).

3.3.2. Artificial Neural Network Model

A feed-forward neural network with a single hidden layer with eight neurons was fitted with a ‘logistic activation function’ (default in r package ‘nnet’) using a calibration/training dataset from 2001 to 2017 (Figure 6). The resulting artificial neural network model was validated/tested with a test dataset for the period from 2018 to 2019 to determine the prediction accuracy of the model. Here, the district-wise predicted rice yield deviated from the observed yield, as depicted in Table 7. During 2018, the negative observed error for yields indicated underestimations by the model in Mysuru (−0.2%), Uttara Kannada (−1.5%), Hassan (−0.1%), Ballari (−1.5%), and Belagavi (−15.3%), while for rest of the districts it showed overestimations by the model ranging from 0.0 to 4.2. Likewise, during 2019, six districts had rice yield underestimations, i.e., Kodagu (−0.6%), Shivamogga (−0.1%), Davanagere (−0.7%), Hassan (−0.2%), Ballari (−5.1%), and Belagavi (−10.8%) districts, whereas for the other five districts the predicted yields were overestimated. Figure 7a,b reveals the excellent agreement between the observed and predicted yields using SMLR and ANN. But more accurate estimates were found in ANN model. As the error calculated by this model was within the acceptable limit, i.e., ±10% for all the districts and this model can be used to predict yields and was superior to the SMLR.
The prediction abilities of the fitted ANN models were evaluated in terms of the coefficient of determination (R2), the root-mean-square error (RMSE), and the mean absolute percentage error (MAPE) [43]. Here, the performance analysis included the computation of different statistical parameters, viz. the mean absolute error (MAE), root-mean-square error (RMSE), and normalized root-mean-square error (nRMSE) values for rice crops of different locations (Table 8). A model with smaller RMSE, nRMSE, and MAE values and higher EF values was considered to be the best. The model’s performance using ANN as indicated was validated using RMSE values between 1.60 and 281.01; nRMSE values between 0.06 and 15.40; MAE values between 1.22 and 187.72, and EF values between 0.80 and 1.00. Among the predicted districts yields, the lowest values of RMSE (1.60), NRMSE (0.06), MAE (1.22), and EF (1.00) were found in Shivamogga district, and the highest were observed in Belagavi district with 281.01, 15.40, 187.72, and 0.80 for RMSE, NRMSE, MAE, and EF, respectively. The model was said to perform excellently, with an nRMSE value less than 10 percent, categorized as excellent, for ten out of eleven districts and an nRMSE value categorized as good for one district, as it was between 10 and 20 percent. This observed variability among the models was due to the consideration of less independent parameters by SMLR compared to ANN, which took into account multiple interactions between the weather variables [44].

3.4. Effect of Weather Variables on Rice Yield

Rice, being one of the major food grain crops whose productivity is largely dependent on weather variables, has shown yield variability due to the impacts of individual and interactive effects of different weather variables. The average weekly temperature (Tmean) of the study regions during the rice growing period varied between 22 and 26 °C (Table 9), which is very much within the optimal temperature range required for rice growth from 15–18 to 30–33 °C, [45]. The average weekly maximum temperature (Tmax) ranged between 26 and 30 °C, and previous studies have observed yield variability of rice due to temperature fluctuations [46,47,48]. Sometimes the maximum temperature exceeded 35 °C, which has caused destructive effects on rice growth and yield [49]. This may be due to alterations in enzyme activities, leading to changes in the rate of photosynthesis, respiration, and other physiological aspects [17,50]. Higher temperatures were found to decrease the duration of the crop life cycle, thereby shortening the grain filling period, which might lead to lower crop yields and grain quality. The RHI ranged between 69 and 94%, and the RHII ranged between 59 and 91%. Higher relative humidity has an antagonistic effect on crop yield, as higher humidity causes a reduction in evapotranspiration, thereby lowering the cooling effect due to evaporation [51]. It also supports incidences of pests and diseases, which lead to reductions in crop yield. However, a higher vapor pressure deficit during anthesis leads to a reduction in the panicle temperature due to transpirational cooling, which helps in reducing high-temperature-induced spikelet sterility [52]. The average rainfall in the region varied between 27.7 and 181.8 mm during the crop growth period (Table 9). In districts such as Ballari, the rainfall was much too low to support crop growth, even though the yield levels were high. This was merely due to the availability of sufficient irrigation water through reservoirs. This strengthens crop management, as rainfall during the flowering and ripening stages reduces pollination and causes lodging [53].

3.5. Comparison of SMLR and ANN for the Predictability of Regional Rice Yield

The yields forecasted for rice during 2021 at the preharvest (F3) stage using a stepwise multiple linear regression (SMLR) and an artificial neural network (ANN) for eleven growing districts of Karnataka during kharif season are presented in Table 10. The yields forecasted by SMLR ranged from 2294 to 4093 kg/ha. The higher yield was predicted for Udupi (4093 kg/ha) district, followed by Hassan (3929 kg/ha), whereas the lower yield was predicted in Belagavi (2294 kg/ha), followed by Uttara Kannada (2314 kg/ha) district. Further, the ANN forecasted yields ranging from 2172 to 3919 kg/ha, and the higher yield was predicted in Ballari (3919 kg/ha) district, followed by Shivamogga (3586 kg/ha), whereas the lower yield was predicted in Uttara Kannada (3586 kg/ha), followed by Mysuru (2318 kg/ha) district. Meanwhile, the percentage deviations from the observed yield ranged between −2.5 and 35.0% (mean: 16%) in SMLR, and −37.1 to 21.3 percent (mean: 4%) deviations were observed using ANN. These outcomes prove the usability of ANN over SMLR in crop yield forecasting.
The average district yield forecasted using stepwise multiple linear regression was found to be 3235 kg ha−1, and using artificial neural network (ANN) this value was found to be 2833 kg ha−1 at the preharvest stage compared to the average yield (1980–2019) of 2718 kg ha−1, as depicted in Figure 8. The forecasted mean yield during 2021 using both the methods was found to be higher than the average yield. As the model is purely weather-based, good rains during the crop growing season could be the reason for higher yield estimates for 2021 in all forecasted districts.

4. Discussion

Agriculture is a production sector that is highly dependent on the climatic conditions [54,55,56]. Especially in tropical countries such as India, where the majority of the crop production is dependent on the climate. While agricultural output is dependent on other factors, such as pest and diseases, weeds and their management decisions, etc., those can be modified in an effort to provide the best growing environment for the crops [57]. Even after providing the best growing environment for the crop, variations in the crop yield are observed. Such variations are majorly attributed to spatial and temporal variations in weather factors [58]. This weather-induced production variability impacts regional food security, thus making it necessary to study the major weather factors behind crop production. The quantification of weather impacts on the crop growth is a cumbersome task, as weather factors impart yields through their direct and interactive effects. The present study involved the use of statistical (SMLR) [59,60] and machine learning tools (ANN) [61] to generate a better rice yield forecast model. As the crop is a staple food crop of the majority of the population in Karnataka, its interaction with weather needs to be assessed to have an advance estimate of its production in the region and to plan the alternatives for improving the productivity of the secured food supply. Previous studies have shown the crop’s dependence on weather, but few number studies have been conducted on quantifying the interactive effects of weather on rice crops. Therefore, efforts are being made to assess the interactive effects of weather factors on rice productivity through the generation of weather indices based on composite weather variable methods [62,63] for understanding the joint effect of two variables [64,65,66].
The SMLR and ANN models were calibrated (1980–2017) and validated (2018–2019) using the historical datasets of weather variables (IMD) and crop yield datasets (state agriculture department) to forecast the 2021 yield. In order to fit the SMLR, the normality in the yield data was checked using the Shapiro–Wilk test and Q-Q plots, which indicated that all districts’ yield datasets were normally distributed (p value > 0.05 and quantiles centered around the diagonal line) except Mysuru (p value = 0.001). The comparison of the SMLR and ANN models revealed a higher percentage deviation from the observed yield, ranging between −2.5 and 35.0 (mean: 16%), in SMLR compared to ANN (range: −37.1 to 21.3, mean: 4%), making ANN a better model for forecasting. This might be due to the ability of ANN to take an account of the collinearity between weather variables for yield prediction [67,68] Further, the district-wise yield predictability of ANN was assessed using the mean absolute error (MAE), which ranged between 1.22 and 187.72 with a mean of 42. Among the districts, the lowest value of MAE (1.22) was observed in Shivamogga, and the highest was in Belagavi district, indicating better predictability of ANN in these districts, and the mean RMSE obtained using the ANN model was found to be 70 compared to 213 in the regression model [69]. Hence, the use of machine learning tools such as ANN, LASSO, NNET, etc., paves a path for precision yield forecasting, which could be promising for decision making in future crop management and the planning of policies for improved yield production.

5. Conclusions

Machine learning approaches are promising alternatives or complimentary tools to support the commonly used crop simulation model for yield prediction, but their efficacy has to be evaluated before applying them to a specific crop or cropping system yield prediction. As a crop’s performance is influenced by more than one external factor such as the weather and the interactions between the weather factors themselves, a special method is needed to assess performance. Previously, several linear models were developed based on the direct relationships between the yield and the weather. They failed to measure the impact of multicollinearity between weather factors on the yield. Therefore, an attempt to determine this impact using a machine learning tool (ANN) and compare it with a simple regression model such as SMLR was made in the present study. Two multivariate models, SMLR and ANN, were used to forecast rice yields in major rice-growing districts of Karntaka. The outcomes demonstrated that an artificial neural network (ANN) can be utilized for yield prediction for the area with satisfactory results compared to a stepwise multiple linear regression since a good agreement was realized between the ANN-predicted and the observed yield, which was indicated by the root-mean-square error, normalized root-mean-square error, R2 statistics, and prediction error percentage. There are various advantages of ANNs over conventional approaches; they provide a stable analytical alternative to conventional regression techniques, which are often limited by strict assumptions of normality, linearity, variable independence, etc. As ANNs are able to capture interactions between independent variables, they allow a quick and easy method for modeling the complex agricultural phenomenon that is otherwise nearly impossible to explain.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/agronomy13030704/s1, Table S1: Long-term yield and weather dataset used for the analysis.

Author Contributions

M.N.T. and M.H.M. resources, conceptualization, and validation; L.H. and D.V.S., analysis, investigation, and original draft preparation; H.S.P. and L.N., data curation and original draft preparation; H.S.S., review and editing, visualization, and supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research was not funded by any externally funded project.

Data Availability Statement

The data presented in this study are available on request from the corresponding authors.

Acknowledgments

The authors are grateful to acknowledge the FASAL-India Meteorological Department and the Directorate of Economics and Statistics for providing the weather and yield data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Anonymous. Area, production and productivity of major crops of Karnataka. In Directorate of Agricultural Economics and Statistics; Government of Karnataka: Bengaluru, India, 2021. [Google Scholar]
  2. Guo, Y.; Fu, Y.; Hao, F.; Zhang, X.; Wu, W.; Jin, X.; Bryant, C.R.; Senthilnath, J. Integrated phenology and climate in rice yields prediction using machine learning methods. Ecol. Indic. 2021, 120, 106935. [Google Scholar] [CrossRef]
  3. Shrestha, S.; Deb, P.; Bui, T.T.T. Adaptation Strategies for Rice cultivation under Climate Change in Central Vietnam. Mitig. Adap. Strateg. Glob. Chang. 2016, 21, 15–37. [Google Scholar] [CrossRef]
  4. Conant, R.T.; Ryan, M.G.; Agren, G.I.; Birge, H.E.; Davidson, E.A.; Eliasson, P.E.; Bradford, M.A. Temperature and soil organic matter decomposition rates-synthesis of current knowledge and a way forward. Glob. Chang. Biol. 2011, 17, 3392–3404. [Google Scholar] [CrossRef]
  5. Kruse, J.S.; Kissel, D.E.; Cabrera, M.L. Effects of drying and rewetting on carbon and nitrogen mineralization in soils and incorporated residues. Nutr. Cycl. Agroecosyst. 2004, 69, 247–256. [Google Scholar] [CrossRef]
  6. Fisher, R.A. The influence of rainfall on the yield of wheat at Roth Amsted. R. Soc. Phil. Trans. Ser. B 1924, 213, 89–142. [Google Scholar]
  7. Huda, A.K.S.; Ghildyal, B.P.; Jain, R.C. Contribution of climatic variables in predicting rice yield. J. Agric. Meteorol. 1975, 15, 71–86. [Google Scholar] [CrossRef]
  8. Jain, R.C.; Agarwal, R.; Jha, M.P. Effects of climatic variables on rice yield and its forecasts. Mausam 1980, 31, 591–596. [Google Scholar] [CrossRef]
  9. Hundal, S.S. Climatic variability and its impact on cereal productivity in Indian Punjab. Curr. Sci. 2007, 92, 506–512. [Google Scholar]
  10. Hatfield, J.L.; Boote, K.J.; Kimball, B.A.; Ziska, L.H.; Izaurralde, R.C.; Ort, D.; Wolfe, D. Climate impacts on agriculture: Implications for crop production. Agronomy 2011, 103, 351–370. [Google Scholar] [CrossRef]
  11. Amrender, K.; Lalmohan, B. Forecasting model for yield of Indian mustard (Brassica juncea) using weather parameters. Indian J. Agric. Sci. 2005, 75, 688–690. [Google Scholar]
  12. Grotelüschen, K.; Gaydon, D.S.; Senthilkumar, K.; Langensiepen, M.; Becker, M. Model-based evaluation of rainfed lowland rice responses to N fertiliser in variable hydro-edaphic wetlands of East Africa. Field Crops Res. 2022, 285, 108602. [Google Scholar] [CrossRef]
  13. Ullah, I.; Ma, X.; Yin, J.; Omer, A.; Habtemicheal, B.A.; Saleem, F.; Liu, M. Spatiotemporal characteristics of meteorological drought variability and trends (1981–2020) over South Asia and the associated large-scale circulation patterns. Clim. Dyn. 2022, 1–24. [Google Scholar] [CrossRef]
  14. Wajid, A.; Hussain, K.; Ilyas, A.; Habib-ur-Rahman, M.; Shakil, Q.; Hoogenboom, G. Crop models: Important tools in decision support system to manage wheat production under vulnerable environments. Agriculture 2021, 11, 1166. [Google Scholar] [CrossRef]
  15. Baier, W. Crop Weather Models and Their Use in Yield Assessments; (Technical Note No.151); WMO: Geneva, Switzerland, 1977; p. 48. [Google Scholar]
  16. Manideep, A.P.S.; Kharb, S. A Comparative Analysis of Machine Learning Prediction Techniques for Crop Yield Prediction in India. Turk. J. Comput. Math. Educ. 2022, 13, 120–133. [Google Scholar]
  17. Rai, Y.K.; Ale, B.B.; Alam, J. Impact assessment of climate change on paddy yield: A case study of Nepal agriculture research council (NARC), Tarahara, Nepal. J. Inst. Eng. 2012, 8, 147–167. [Google Scholar] [CrossRef]
  18. Verma, U.; Piepho, H.P.; Goyal, A. Role of climatic variables and crop condition term for mustard yield prediction in Haryana. Int. J. Agric. Stat. Sci. 2016, 12, 45–51. [Google Scholar]
  19. Das, B.; Sahoo, R.N.; Pargal, S. Comparison of different uni- and multi-variate techniques for monitoring leaf water status as an indicator of water-deficit stress in wheat through spectroscopy. Biosyst. Eng. 2017, 160, 69–83. [Google Scholar] [CrossRef]
  20. Gandhi, N.; Armstrong, L.J.; Petkar, O.; Tripathy, A.K. Rice crop yield prediction in India using support vector machines. In Proceedings of the 2016 13th International Joint Conference on Computer Science and Software Engineering (JCSSE), Khon Kaen, Thailand, 13–15 July 2016; pp. 1–5. [Google Scholar]
  21. Qian, B.; De Jong, R.; Warren, R.; Chipanshi, A.; Hill, H. Statistical spring wheat yield forecasting for the Canadian Prairie Provinces. Agric. Meteorol. 2009, 149, 1022–1031. [Google Scholar] [CrossRef]
  22. Amoghavarsha, C.; Pramesh, D.; Sridhara, S.; Patil, B.; Shil, S.; Naik, G.R.; Prasannakumar, M.K. Spatial distribution and identification of potential risk regions to rice blast disease in different rice ecosystems of Karnataka. Sci. Rep. 2022, 12, 7403. [Google Scholar] [CrossRef]
  23. Kumar, N.; Pisal, R.R.; Shukla, S.P.; Pandye, K.K. Regression technique for South Gujarat. Mausam 2014, 65, 361–364. [Google Scholar] [CrossRef]
  24. Ravindran, A. Comparison of Different Weather-Based Models for Forecasting Rice Yield in Central Zone of Kerala. Master’s Thesis, Kerala Agricultural University, Thrissur, India, 2018. [Google Scholar]
  25. Wickramasinghe, L.; Weliwatta, R.; Ekanayake, P.; Jayasinghe, J. Modeling the relationship between rice yield and climate variables using statistical and machine learning techniques. J. Math. 2021, 2021, 6646126. [Google Scholar] [CrossRef]
  26. Bocca, F.F.; Rodrigues, L.H.A. The effect of tuning, feature engineering, and feature selection in data mining applied to rainfed sugarcane yield modelling. Comput. Electron. Agric. 2016, 128, 67–76. [Google Scholar] [CrossRef]
  27. Lobell, D.B.; Burke, M.B. On the use of statistical models to predict crop yield responses to climate change. Agric. Meteorol. 2010, 150, 1443–1452. [Google Scholar] [CrossRef]
  28. Dutta, S.; Patel, N.K.; Srivastava, S.K. District wise yield models of rice in Bihar based on water requirement and meteorological data. J. Indian Soc. Remote Sens. 2001, 29, 175–182. [Google Scholar] [CrossRef]
  29. Mehta, S.C.; Pal, S.; Kumar, V. Weather Based Models for Forecasting Potato Yield in Uttar Pradesh; IASRI: New Delhi, India, 2010. [Google Scholar]
  30. Balabin, R.M.; Lomakina, E.I.; Safieva, R.Z. Neural network (ANN) approach to biodiesel analysis: Analysis of biodiesel density, kinematic viscosity, methanol and water contents using near infrared (NIR) spectroscopy. Fuel 2011, 90, 2007–2015. [Google Scholar] [CrossRef]
  31. Singh, R.S.; Patel, C.; Yadav, M.K.; Singh, K.K. Yield forecasting of rice and wheat crops for eastern Uttar Pradesh. J. Agrometeorol. 2014, 16, 199–202. [Google Scholar] [CrossRef]
  32. Hazelton, M.L. A graphical tool for assessing normality. Am. Stat. 2003, 57, 285–288. [Google Scholar] [CrossRef]
  33. Dufour, J.M.; Hallin, M.; Mizera, I. Generalized runs tests for heteroscedastic time series. J. Nonparametric Stat. 1998, 9, 39–86. [Google Scholar] [CrossRef]
  34. Shapiro, S.S.; Wilk, M.B. An analysis of variance test for normality (complete samples). Biometrika 1965, 52, 591–611. [Google Scholar] [CrossRef]
  35. Kuhn, M. Building Predictive Models in R Using the caret Package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
  36. Jamieson, P.D.; Porter, J.R.; Wilson, D.R. A test of the computer simulation model ARCWHEAT1 on wheat crops grown in New Zealand. Field Crops Res. 1991, 27, 337–350. [Google Scholar] [CrossRef]
  37. Guhathakurta, P.; Sanap, S.; Menon, P.; Ashwini Kumar Prasad, S.T.; Sable; Advani, S.C. Observed Rainfall Variability and Changes Over Karnataka State; Pune Met Monograph No.: ESSO/IMD/HS/Rainfall Variability/; India Meteorological Department: Pune, India, 2020; Volume 13, p. 37. [Google Scholar]
  38. Kalbarczyk, R.; Kalbarczyk, E. Research into Meteorological Drought in Poland during the Growing Season from 1951 to 2020 Using the Standardized Precipitation Index. Agronomy 2022, 12, 2035. [Google Scholar] [CrossRef]
  39. Achli, S.; Epule, T.E.; Dhiba, D.; Chehbouni, A.; Er-Raki, S. Vulnerability of Barley, Maize and Wheat Yields to Variations in Growing Season Precipitation in Morocco. Appl. Sci. 2022, 12, 3407. [Google Scholar] [CrossRef]
  40. Han, Y.; Liu, B.; Xu, D.; Yuan, C.; Xu, Y.; Sha, J.; Xu, Z. Temporal and Spatial Variation Characteristics of Precipitation in the Haihe River Basin under the Influence of Climate Change. Water 2021, 13, 1664. [Google Scholar] [CrossRef]
  41. Agrawal, R.; Jain, R.C.; Mehta, S.C. Yield forecast based on weather variables and agricultural input on agroclimatic zone basis. Ind. J. Agric. Sci. 2001, 71, 487–490. [Google Scholar]
  42. Tripathi, M.K.; Mehra, B.; Chattopadhyay, N.; Singh, K.K. Yield prediction of sugarcane and paddy for districts of Uttar Pradesh. J. Agrometeorol. 2012, 14, 173–175. [Google Scholar] [CrossRef]
  43. Stangierski, J.; Weiss, D.; Kaczmarek, A. Multiple regression models and Artificial Neural Network (ANN) as prediction tools of changes in overall quality during the storage of spreadable processed Gouda cheese. Eur. Food Res. Technol. 2019, 245, 2539–2547. [Google Scholar] [CrossRef]
  44. Das, B.; Nair, B.; Reddy, V.K.; Venkatesh, P. Evaluation of multiple linear, neural network and penalised regression models for prediction of rice yield based on weather parameters for west coast of India. Int. J. Biometeorol. 2018, 62, 1809–1822. [Google Scholar] [CrossRef]
  45. Nishiyama, I. Effects of temperature on the vegetative growth of rice plants. In Symposium on Climate and Ric; International Rice Research Institute: Los baonos, Philippines, 1976; pp. 159–185. [Google Scholar]
  46. Sridevi, V.; Chellamuthu, V. Impact of weather on rice-A review. Int. J. Appl. Res. 2015, 1, 825–831. [Google Scholar]
  47. Cai, C.; Yin, X.; He, S.; Jiang, W.; Si, C.; Struik, P.C.; Luo, W.; Li, G.; Xie, Y.; Xiong, Y. Responses of wheat and rice to factorial combinations of ambient and elevated CO2 and temperature in FACE experiments. Glob. Chang. Biol. 2016, 22, 856–874. [Google Scholar] [CrossRef]
  48. Talla, A.; Swain, D.K.; Tewari, V.K.; Biswal, M.P. Significance of weather variables during critical growth stages for hybrid rice production in subtropical India. Agron. J. 2017, 109, 1891–1899. [Google Scholar] [CrossRef]
  49. Sun, W.; Huang, Y. Global warming over the period 1961–2008 did not increase high-temperature stress but did reduce low-temperature stress in irrigated rice across China. Agric. For. Meteorol. 2011, 151, 1193–1201. [Google Scholar] [CrossRef]
  50. Akinbile, C.O.; Akinlade, G.M.; Abolude, A.T. Trend analysis in climatic variables and impacts on rice yield in Nigeria. J. Water Clim. Chang. 2015, 6, 534–543. [Google Scholar] [CrossRef]
  51. Wassmann, R.; Jagadish, S.V.K.; Sumfleth, K.; Pathak, H.; Howell, G.; Ismail, A.; Heuer, S. Regional vulnerability of climate change impacts on Asian rice production and scope for adaptation. Adv. Agron. 2009, 102, 91–133. [Google Scholar]
  52. Matsui, T.; Kobayasi, K.; Yoshimoto, M.; Hasegawa, T. Stability of rice pollination in the field under hot and dry conditions in the Riverina region of New South Wales, Australia. Plant Prod. Sci. 2007, 101, 57–63. [Google Scholar] [CrossRef]
  53. Yang, L.; Qin, Z.; Tu, L. Responses of rice yields in different rice-cropping systems to climate variables in the middle and lower reaches of the Yangtze River, China. Food Secur. 2015, 7, 951–963. [Google Scholar] [CrossRef]
  54. Henryson, K.; Sundberg, C.; Kätterer, T.; Hansson, P.A. Accounting for long-term soil fertility effects when assessing the climate impact of crop cultivation. Agric. Syst. 2018, 164, 185–192. [Google Scholar] [CrossRef]
  55. Taylor, C.B.; Cullen, M.; Occhio, D.; Rickards, L.; Eckard, R. Trends in wheat yields under representative climate futures: Implications for climate adaptation. Agric. Syst. 2018, 164, 1–10. [Google Scholar] [CrossRef]
  56. Rivera, X.C.S.; Bacenetti, J.; Fusi, A.; Niero, M. The influence of fertiliser and pesticide emissions model on life cycle assessment of agricultural products: The case of Danish and Italian barley. Sci. Total Environ. 2017, 59, 745–757. [Google Scholar] [CrossRef]
  57. Schewe, J.; Otto, C.; Frieler, K. The role of storage dynamics in annual wheat prices. Environ. Res. Lett. 2017, 12, 054005. [Google Scholar] [CrossRef]
  58. Frieler, K.; Schauberger, B.; Arneth, A.; Balkovič, J.; Chryssanthacopoulos, J.; Deryng, D.; Levermann, A. Understanding the weather signal in national crop-yield variability. Earth’s Future 2017, 5, 605–616. [Google Scholar] [CrossRef] [PubMed]
  59. Kitchen, N.R.; Sudduth, K.A.; Drummond, S.T. Electrical conductivity as a crop productivity measure for claypan soils. J. Prod. Agric. 1999, 12, 607–617. [Google Scholar] [CrossRef]
  60. Rumelhart, D.E.; McClelland, J.L. Parallel Distributed Processing; MIT Press: Boston, MA, USA, 1986; Volume 1. [Google Scholar]
  61. Ge, J.; Zhao, L.; Gong, X.; Lai, Z.; Traore, S.; Li, Y.; Long, H.; Zhang, L. Combined effects of ventilation and irrigation on temperature, humidity, tomato yield, and quality in the greenhouse. HortScience 2021, 56, 1080–1088. [Google Scholar] [CrossRef]
  62. Menzal, A.; Fabian, P. Growing season extended in Europe. Nature 1999, 397, 659–663. [Google Scholar] [CrossRef]
  63. Traore, S.; Zhang, L.; Guven, A.; Fipps, G. Rice yield response forecasting tool (YIELDCAST) for supporting climate change adaptation decision in Sahel. Agric. Water Manag. 2020, 239, 106242. [Google Scholar] [CrossRef]
  64. Rafi, Z.; Rehan, A. Wheat crop model based on water balance for Agrometeorological crop monitoring. Pak. J. Meteorol. 2005, 2, 23–33. [Google Scholar]
  65. Sridhara, S.; Ramesh, N.; Gopakkali, P.; Das, B.; Venkatappa, S.D.; Sanjivaiah, S.H.; Kumar Singh, K.; Singh, P.; El-Ansary, D.O.; Mahmoud, E.A.; et al. Weather-Based Neural Network, Stepwise Linear and Sparse Regression Approach for Rabi Sorghum Yield Forecasting of Karnataka, India. Agronomy 2020, 10, 1645. [Google Scholar] [CrossRef]
  66. López-García, P.; Intrigliolo, D.; Moreno, M.A.; Martínez-Moreno, A.; Ortega, J.F.; Pérez-Álvarez, E.P.; Ballesteros, R. Machine Learning-Based Processing of Multispectral and RGB UAV Imagery for the Multitemporal Monitoring of Vineyard Water Status. Agronomy 2022, 12, 2122. [Google Scholar] [CrossRef]
  67. Haghverdi, A.; Washington-Allen, R.A.; Leib, B.G. Prediction of cotton lint yield from phenology of crop indices using artificial neural networks. Comput. Electron. Agric. 2018, 152, 186–197. [Google Scholar] [CrossRef]
  68. Abrouguia, K.; Gabsib, K.; Mercatorisc, B.; Khemisa, C.; Amamia, R.; Chehaibia, S. Prediction of organic potato yield using tillage systems and soil properties by artificial neural network (ANN) and multiple linear regressions (MLR). Soil Tillage Res. 2019, 190, 202–208. [Google Scholar] [CrossRef]
  69. Ji, B.; Sun, Y.; Yang, S.; Wan, J. Artificial neural network for rice yield prediction in mountainous re gions. J. Agric. Sci. 2007, 145, 249–261. [Google Scholar] [CrossRef]
Figure 1. Percentage of rice area (a) and percent contribution to state’s rice production (b) by each district of the study area (Source: Rice area and production (2021), Directorate of Economics and Statistics, GoK).
Figure 1. Percentage of rice area (a) and percent contribution to state’s rice production (b) by each district of the study area (Source: Rice area and production (2021), Directorate of Economics and Statistics, GoK).
Agronomy 13 00704 g001
Figure 2. Flowchart representing the different stages of model calibration and validation.
Figure 2. Flowchart representing the different stages of model calibration and validation.
Agronomy 13 00704 g002
Figure 3. Diagrammatic representation of ANN.
Figure 3. Diagrammatic representation of ANN.
Agronomy 13 00704 g003
Figure 4. Spatial (a) and temporal (b) variability in rainfall in rice-growing districts during rice growing period, i.e., 24th to 39th SMW.
Figure 4. Spatial (a) and temporal (b) variability in rainfall in rice-growing districts during rice growing period, i.e., 24th to 39th SMW.
Agronomy 13 00704 g004
Figure 5. Normal Q–Q plot for kharif rice yields in 11 study districts of Karnataka.
Figure 5. Normal Q–Q plot for kharif rice yields in 11 study districts of Karnataka.
Agronomy 13 00704 g005
Figure 6. Graphical representation of the established neural network for rice yield forecasting Z indicates the weather indices used for the forecast; Ref. Table 2.
Figure 6. Graphical representation of the established neural network for rice yield forecasting Z indicates the weather indices used for the forecast; Ref. Table 2.
Agronomy 13 00704 g006
Figure 7. Boxplots of district-wise error percentages in Kharif rice yields (kg/ha) validated (2018 and 2019) using a stepwise multiple linear regression (a) and an artificial neural network (b).
Figure 7. Boxplots of district-wise error percentages in Kharif rice yields (kg/ha) validated (2018 and 2019) using a stepwise multiple linear regression (a) and an artificial neural network (b).
Agronomy 13 00704 g007aAgronomy 13 00704 g007b
Figure 8. District-wise yield prediction for Kharif rice (kg/ha) using an artificial neural network and a stepwise multiple linear regression during 2021, along with the average yield comparison. Numbers in parenthesis indicate the deviation percentages from the average yields.
Figure 8. District-wise yield prediction for Kharif rice (kg/ha) using an artificial neural network and a stepwise multiple linear regression during 2021, along with the average yield comparison. Numbers in parenthesis indicate the deviation percentages from the average yields.
Agronomy 13 00704 g008
Table 1. Area, production, and productivity of rice (2020).
Table 1. Area, production, and productivity of rice (2020).
DistrictsRice Area *Production **Productivity (kg/ha)
Ballari178.5723.04051.2
Belagavi60.7166.72744.3
Chikkamagaluru14.037.52668.3
Dakshina Kannada8.123.42883.4
Davanagere126.1444.23523.7
Hassan39.699.12502.9
Kodagu21.356.42643.1
Mysuru87.1263.13021.4
Shivamogga88.2245.12778.1
Udupi40.2115.32870.7
Uttara Kannada47.0101.02149.3
Karnataka1484.04717.53178.0
* Area in thousand hectares. ** Production in thousand tons.Source: Rice area and production (2021), Directorate of Economics and Statistics, GoK.
Table 2. Weather-derived indices used in models using composite weather variables.
Table 2. Weather-derived indices used in models using composite weather variables.
Weather VariablesUnweighted Weather Indices (0)Weighted Weather Indices (1)
TMAX (1)TMIN (2)RF (3)RHI (4)RHII (5)Tmax (1)Tmin (2)RF (3)RHI (4)RHII (5)
Maximum temperature (1)Z10 Z11
Minimum temperature (2)Z120Z20 Z121Z21
Rainfall (3)Z130Z230Z30 Z131Z231Z31
Morning relative humidity (4)Z140Z240Z340Z40 Z141Z241Z341Z41
Evening relative humidity (5)Z150Z250Z350Z450Z50Z151Z251Z351Z451Z51
Table 3. Statistics of rice yield variability in eleven study districts.
Table 3. Statistics of rice yield variability in eleven study districts.
DistrictMeanMaximumMinimumStdCV (%)Shapiro–Wilk Test
Statisticp Value
Ballari348545712406643.318.41.780.162
Belagavi19513096589618.931.71.450.269
Chikkamagaluru254730232062249.59.81.360.266
Dakshina Kannada252933752061328.412.92.30.091
Davanagere354141353114244.56.90.530.774
Hassan265334171602444.316.70.930.510
Kodagu265631262064245.59.22.020.117
Mysuru314736691968305.39.738.870.003
Shivamogga271035392024346.812.80.710.056
Udupi267432501904311.111.61.580.505
Uttara Kannada205527581462288.314.00.260.433
Table 4. Rice yield prediction equations using stepwise multiple linear regression for different districts of Karnataka during 2021 at preharvest (F3) stage.
Table 4. Rice yield prediction equations using stepwise multiple linear regression for different districts of Karnataka during 2021 at preharvest (F3) stage.
DistrictRegression EquationWeather Variables in the EquationFStd. Error
BallariY = −13.44 + 35.87 *Time + 8.61 *Z10 + 0.04 *Z341Time, Tmax, Rf *Rh1118.54201.3
BelagaviY = −145.69 + 0.25 *Z231 + 0.60 *Z251Tmin *Rf, Tmin *Rh219.39465.48
ChikkamagaluruY = −74.7731 + 22.23 *Time + 10.63 *Z10 + 0.26 *Z131Time, Tmax, Tmax *Rf50.52248.45
Dakshina KannadaY = 25.81 + 42.80 *Time-4.93 *Z51*0.40 *Z121Time, Rh2, Tmax *Tmin90.21181.93
DavanagereY = −61.0 + 30.14 *Time + 13.84 * Z10 + 0.16 *Z131Time, Tmax, Tmax *Rf115.9190.31
HassanY = −193.35 + 113.07 *Time + 0.030 *Z450Time, Rh1 *Rh2119.76213.9
KodaguY = −69.33 + 16.88 *Time + 6.29 *Z50-0.18 * Z250 + 0.01 *Z341Time, Rh2, Tmin *Rh2, Rf *Rh164.6175.83
MysuruY = −215.62 + 18.56 *Time + 9.71 *Z51 + 5.56 *Z121Time, Rh2, Tmax *Tmin58.53250.23
ShivamoggaY = −773.35 + 195.47*Z11Tmax169.45218.64
UdupiY = −162.82 + 49.15 *Time + 5.02 *Z121 + 0.27 *Z141 + 0.007*Z230Time, Tmax *Tmin, Tmax *Rh, Tmin *Rf217.1194.99
Uttara KannadaY = −1285.31 + 16.73 *Z20 + 199.08 *Z21Tmin42.2244.28
Table 5. District-wise error percentage of Kharif rice yield (Kg/ha) at preharvest (F3) stage, validated for 2018 and 2019 using a stepwise multiple linear regression.
Table 5. District-wise error percentage of Kharif rice yield (Kg/ha) at preharvest (F3) stage, validated for 2018 and 2019 using a stepwise multiple linear regression.
District20182019
Predicted Yield
(kg/ha)
Observed Yield
(kg/ha)
Error
(%)
Predicted Yield
(kg/ha)
Observed Yield
(kg/ha)
Error
(%)
Ballari39264256−8.440464571−13.0
Belagavi16082655−65.118532637−42.3
Chikkamagaluru2937253213.8266325324.9
Dakshina Kannada31173154−1.2315530224.2
Davanagere3815318716.5386335149.0
Hassan3252248223.73642310114.9
Kodagu21922948−34.526622695−1.2
Mysuru31003166−2.1352133145.9
Shivamogga278426694.128252893−2.4
Udupi30593101−1.4307130321.3
Uttara Kannada20652194−6.222142403−8.5
Table 6. Statistical evaluation of kharif rice yield using stepwise regression.
Table 6. Statistical evaluation of kharif rice yield using stepwise regression.
DistrictR2RMSECorrelation Coefficient (CC)
Ballari0.94188.600.90
Belagavi0.60447.250.70
Chikkamagaluru0.84236.120.80
Dakshina Kannada0.91171.520.89
Davanagere0.95175.440.70
Hassan0.94199.920.93
Kodagu0.91162.280.80
Mysuru0.87235.920.68
Shivamogga0.90212.480.51
Udupi0.9883.750.95
Uttara Kannada0.78234.300.65
Table 7. District-wise error percentage of Kharif rice yield (Kg/ha) forecasted using artificial neural network (ANN) during 2018 and 2019.
Table 7. District-wise error percentage of Kharif rice yield (Kg/ha) forecasted using artificial neural network (ANN) during 2018 and 2019.
District20182019
PredictedObservedError (%)PredictedObservedError (%)
Ballari41924256−1.543514571−5.1
Belagavi23032655−15.323792637−10.8
Chikkamagaluru264425324.2265226490.1
Dakshina Kannada315331540.0302930220.2
Davanagere320331870.534903514−0.7
Hassan24802482−0.130963101−0.2
Kodagu294929480.0326782695−0.6
Mysuru31593166−0.2348233144.8
Shivamogga267026690.0428892893−0.1
Udupi310131010.0303130320.0
Uttara Kannada21612194−1.5246924032.7
Table 8. Statistical evaluation of validated kharif rice yield using an artificial neural network (ANN).
Table 8. Statistical evaluation of validated kharif rice yield using an artificial neural network (ANN).
DistrictRMSEnRMSEnRMSE * MAEEF
Ballari88.612.70Excellent46.040.96
Belagavi281.0115.40Good187.720.80
Chikkamagaluru99.753.91Excellent53.230.94
Dakshina Kannada15.210.61Excellent6.931.00
Davanagere28.340.80Excellent20.790.99
Hassan6.880.25Excellent4.661.00
Kodagu71.992.72Excellent42.860.93
Mysuru44.451.40Excellent29.950.98
Shivamogga1.600.06Excellent1.221.00
Udupi3.830.15Excellent1.991.00
Uttara Kannada131.066.45Excellent72.770.82
* nRMSE classes: <10% = Excellent, 10–20% = Good, 20–30% = Fair, and >30% = Poor.
Table 9. Statistics of weather variables observed during the crop growth period.
Table 9. Statistics of weather variables observed during the crop growth period.
StatisticTmax (°c)Tmin (°c)Tmean (°c)RHI
(%)
RHII
(%)
Rainfall
(mm)
Mean27.920.824.387.179.085.6
Maximum30.622.526.494.091.5181.8
Minimum26.319.422.969.059.027.7
Standard deviation1.21.01.16.79.150.0
Coefficient of variation (%)4.45.04.57.711.558.4
Table 10. District-wise average Kharif rice yield and predicted yields using a stepwise multiple linear regression and an artificial neural network (ANN) during 2021 at the preharvest (F3) stage.
Table 10. District-wise average Kharif rice yield and predicted yields using a stepwise multiple linear regression and an artificial neural network (ANN) during 2021 at the preharvest (F3) stage.
DistrictAverage Yield *SMLRANN
Predicted Yield 2021
(kg/ha)
Deviation
%
Predicted Yield 2021
(kg/ha)
Deviation
%
Ballari33023223−2.5391915.7
Belagavi1847229419.5234821.3
Chikkamagaluru255527477.02447−4.4
Dakshina Kannada2492322322.727539.5
Davanagere355138347.42882−23.2
Hassan2746392930.1316613.3
Kodagu26482591−2.227854.9
Mysuru3177357911.22318−37.1
Shivamogga2872322510.9358619.9
Udupi2660409335.027854.5
Uttara Kannada2044231411.721725.9
Average2718323516.028334.1
* Observed yield (kg/ha) averaged from 1980 to 2019.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Thimmegowda, M.N.; Manjunatha, M.H.; Huggi, L.; Shivaramu, H.S.; Soumya, D.V.; Nagesha, L.; Padmashri, H.S. Weather-Based Statistical and Neural Network Tools for Forecasting Rice Yields in Major Growing Districts of Karnataka. Agronomy 2023, 13, 704. https://doi.org/10.3390/agronomy13030704

AMA Style

Thimmegowda MN, Manjunatha MH, Huggi L, Shivaramu HS, Soumya DV, Nagesha L, Padmashri HS. Weather-Based Statistical and Neural Network Tools for Forecasting Rice Yields in Major Growing Districts of Karnataka. Agronomy. 2023; 13(3):704. https://doi.org/10.3390/agronomy13030704

Chicago/Turabian Style

Thimmegowda, Mathadadoddi Nanjundegowda, Melekote Hanumanthaiah Manjunatha, Lingaraj Huggi, Huchahanumegowdanapalya Sanjeevaiah Shivaramu, Dadireddihalli Venkatappa Soumya, Lingegowda Nagesha, and Hejjaji Sreekanthamurthy Padmashri. 2023. "Weather-Based Statistical and Neural Network Tools for Forecasting Rice Yields in Major Growing Districts of Karnataka" Agronomy 13, no. 3: 704. https://doi.org/10.3390/agronomy13030704

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop