Next Article in Journal
Coffee as an Identifier of Inflation in Selected US Agglomerations
Next Article in Special Issue
A Day-Ahead Photovoltaic Power Prediction via Transfer Learning and Deep Neural Networks
Previous Article in Journal
Forecasting the Monkeypox Outbreak Using ARIMA, Prophet, NeuralProphet, and LSTM Models in the United States
Previous Article in Special Issue
Evaluating the Comparative Accuracy of COVID-19 Mortality Forecasts: An Analysis of the First-Wave Mortality Forecasts in the United States
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of ARIMA, SutteARIMA, and Holt-Winters, and NNAR Models to Predict Food Grain in India

1
Department of Statistics, Universitas Negeri Makassar, Makassar 90223, Indonesia
2
School of Humanities and Social Sciences, Thapar Institute of Engineering and Technology, Patiala 147004, India
3
Department of Economics, Lakshmibai College, University of Delhi, Delhi 110052, India
4
Centre for the Integrated and Rural Development, Banaras Hindu University, Varanasi 221005, India
5
RamManohar Lohia University, Faizabad 224001, India
*
Author to whom correspondence should be addressed.
Forecasting 2023, 5(1), 138-152; https://doi.org/10.3390/forecast5010006
Submission received: 2 December 2022 / Revised: 26 December 2022 / Accepted: 7 January 2023 / Published: 10 January 2023
(This article belongs to the Special Issue Feature Papers of Forecasting 2022)

Abstract

:
The agriculture sector plays an essential function within the Indian economic system. Foodgrains provide almost all the calories and proteins. This paper aims to compare ARIMA, SutteARIMA, Holt-Winters, and NNAR models to recommend an effective model to predict foodgrains production in India. The execution of the SutteARIMA predictive model used in this analysis was compared with the established ARIMA, Neural Network Auto-Regressive (NNAR), and Holt-Winters models, which have been widely applied for time series prediction. The findings of this study reveal that both the SutteARIMA model and the Holt-Winters model performed well with real-life problems and can effectively and profitably be engaged for food grain forecasting in India. The food grain forecasting approach with the SutteARIMA model indicated superior performance over the ARIMA, Holt-Winters, and NNAR models. Indeed, the actual and predicted values of the SutteARIMA and Holt-Winters forecasting models are quite close to predicting foodgrains production in India. This has been verified by MAPE and MSE values that are relatively low with the SutteARIMA model. Therefore, India’s SutteARIMA model was used to predict foodgrains production from 2021 to 2025. The forecasted amount of respective crops are as follows (in lakh tonnes) 1140.14 (wheat), 1232.27 (rice), 466.46 (coarse), 259.95 (pulses), and a total 3069.80 (foodgrains) by 2025.

1. Overview of the Study

India’s agriculture has the potential to increase productivity and overall production [1,2]. The present global governance architecture for nutrition, food, and agriculture is incapable of appropriately addressing the difficulties of ensuring progress toward food security [3]; with disproportionately concentrated malnutrition among poor children in India [4]. India’s population is expected to reach 1.7 billion by 2050, roughly equaling China and the United States combined. Can India adequately feed its 1.7 billion people? If food is scarce now [5], what will the circumstances be like in 2050, when India will have an additional 430 million people to feed? [6,7,8].
Food grain prediction is a significant activity for global decision-making in the agriculture sector. An accurate foodgrains prediction approach will help farmers decide when and how much to cultivate [9]. In this situation, it is even more essential to forecast the quantity of crops produced across India effectively. To accomplish this, multiple forecasting models were used to choose the most appropriate model and anticipate future crop output values. An accurate crop production estimate serves as the cornerstone for its agricultural commodities marketing policy decisions. Crop yield forecasting is essential in farm planning and management, domestic food production, international food trade, habitat preservation, and other fields [10]. As a result, the primary goal of this work is to provide an adequate model for forecasting food grain production in India. Several models were used for crop yield forecasting: machine learning algorithms and modeling techniques are often undervalued in defining agricultural produce. A land of a growing population undoubtedly requires optimized output from agriculture. Incorporating the latest comprehensive technology in agriculture is the only way to survive. Liakos et al. [11] demonstrated how agri-technology domain and machine learning programs benefit farmers’ decisions regarding crop, livestock, water, and soil management. Veenadhari et al. [12] have attempted to develop a crop advisor to consider the effect of the most influential climate change parameters on crop yields in Madhya Pradesh. Paul and Sinha [13] forecasted wheat yield concerning weather variables in the Kanpur district of Uttar Pradesh using parametric and non-parametric tests. Turmeric yield was similarly forecasted on climate parameters using a data set for 20 years [14]. In another study performed by Kandiannan et al. [14], turmeric yield was predicted using a multiple regression model where a significant correlation was found between yield and rainfall, evaporation wind speed, minimum temperature, and humidity. Gandhi and Armstrong [15] in their study examined the correlation between climate factors and rice crop yields for the Kharif season in India using a historical data set of agriculture. Likewise, Woodhouse [16] also found technological options for the productivity revolution in Africa. With technology, environmental impact is also an important dimension to consider. The research study by Chlingaryan et al. [17] talks about precision agriculture and improvement in yield production with proper nitrogen management. The future demands sustainable and optimized crop management while minimizing environmental harm. Machine learning algorithms and remote sensing technologies are of great use in this estimation. According to [18], machine learning is a reliable technology for analyzing crop price prediction considering annual rainfall, WPI to produce a logical conclusion. Random Forest Ensembled (RFE) learning and decision tree regressor were compared, and RFE was found to have a higher accuracy. Data mining techniques helped improve crop yield prediction as reflected by studies conducted by Gandhi and Armstrong [15] and Sujatha and Isakki [19].
The developing model that accurately forecasts growth and output can support policy makers by improving control for accurate production, equating market supply and demand with lower cost. According to Alhnaity et al. [20], development in machine learning, in particular, deep learning, can provide new analytical tools.
India has the most significant number of hungry people in the world, as the research work by Banik [21] demonstrates; it reviews India’s success in preventing famine and compares it to the country’s inability to provide food security to hundreds of millions of its citizens. A study conducted by Sherman [22] explores how the methods to get the country to “grow more food” as part of the drive towards nationwide self-sufficiency were marked by conflict between the dream of supplying the benefits of development to all Indians and the reality that the resources in independent India were minimal. Hazra and Bohra [23] suggested meeting growing food demand by ensuring sustainable intensification of crops, achieving high yield per unit production, and bringing more cropland area under cultivation. Thus, achieving food security with constrained resources is required. Agricultural policy makers in most countries rely on prediction models to influence food security policies.
The paper by Mittal et al. [24] stresses using modeling results to coordinate environmental and renewable energy source policies to reach the GHG emission lowering target efficiently and cost-effectively. The mitigation price in terms of economic development and social welfare loss can be reduced by improving the penetration of renewable energy. Farmer’s income is also considered an essential factor in determining agricultural yield. Farmers who lack access to resources, capital, and information lag in optimizing their produce. So, Birthal et al. [25] suggested diversification by small farmers towards high-value crops such as fruits and vegetables that significantly raise farm income. Choosing and producing a crop is a tedious task as various factors affect agriculture such as soil, temperature, humidity, and rainfall, and these differ across regions. According to Thomas [26], it would be of incredible help to agricultural farmers to determine on the storage and business side if it was possible to find the best crop before sowing it. Recommending appropriate crops to farmers can significantly increase productivity and reduce loss. Sujatha and Isakki [19] also demonstrate the crop yield estimation by selecting the most excellent crop, thereby improving the value and gain of the farming area using data mining.
Kang et al. [27] present an estimation of maize yield forecast in the U.S. using different machine learning algorithms [random forest, XGBoost, support vector regressor, Lasso, LSTM, and CNN]. Furthermore, the study found that the yield prediction method helps from the advanced algorithm and large composite of details. The XGBoost outperforms other algorithms both in exactness and strength. It also showed that incorporating the best algorithm and inputs enhances the prediction exactness by 5% compared to the baseline model (Lasso). Zheng et al. [28] added that crewless-aerial-vehicle (UAV)-based remote sensing (RS) possesses the significant advantage of collecting images for precision agricultural applications. Data visualization is vital for understanding the general trends of various factors influencing crop yield. The study by Gandhi and Armstrong [15] examined data visualization approaches to find a correlation between rice crop yield and climatic factors. In agriculture, crop yield production is becoming a significantly challenging task. Earlier, yield prediction was performed by considering farmers’ experience in particular. However, this demands accuracy and farmers’ involvement which is not always feasible. So competitive analysis was performed by Sivanandhini et al. [29] of the crop yield protection model for a district in Tamil Nadu. Then, empirical investigation suggested that SVM (support vector regression) implementation is better than KNN (k-nearest neighbor) decision tree models. The SVM model with two other enhanced variants, F-SVM (fuzzy-support vector regression) and BS-SVR (boosted-support vector regression), was used by Fung et al. [30] to predict the standardized precipitation and transpiration indices and minimize potential drought impact in oil palm plantations at the Langat river basin. F-SVR was found to be best with improved accuracy.

Research Gap

Studies have explained multiple views on the comparative performance of forecasting models. Hence, further investigation is required to help unify a coherent picture of an advanced methodology. This paper, therefore, seeks to compare and clarify different contradictory opinions reported in the literature on the superiority of ARIMA, SutteARIMA, Holt-Winters (H-W), and Neural Network Auto-Regressive (NNAR) models. Moreover, the study has compared ARIMA, SutteARIMA, H-W, and NNAR with their MAPE and MSE values to suggest an appropriate prediction model. The ARIMA method was chosen because it is used in agriculture crop research and foodgrains predictions; for example, ARIMA was used to forecast food crop price [31] and Chinese food grain price [32]; ARIMA along with LSTM-NN models was used to forecast rice cultivation in India [33]. Studies have also predicted the amount of monsoon rainfall in Andhra Pradesh, India by using MANNs (Modular Artificial Neural Network). Subsequently, the crop yield is predicted with rainfall data and the area allotted to that gave crop output by support vector regression. ANN and multiple linear regression were also used by Maya Gopal and Bhargavi [34]. A hybrid relationship of the Multiple Linear Regression-Artificial Neural Network (MLR-ANN) model has been proposed in research work for crop yield forecasts. This hybrid model was compared with Artificial Neural Network (ANN), Multiple Linear Regression (MLR), Support Vector Regression (SVR), k-Nearest Neighbor (KNN), and Random Forest models, and its accuracy outperforms all the other conventional models.
The SutteARIMA model also performed well in several other areas, including COVID-19 and stock prices in Spain [35], to predict COVID-19 in the U.S. [36], and to predict COVID-19 in the World [37]. Moreover, the Holt-Winters method is one of the most widely used predicting algorithms [38]. The method estimated the trend in overall emissions of organic water contaminants in Poland and Romania [39]. In one study, Kahforoushan et al. [40] applied ANN and H-W methods to predict the added value of agriculture subsections. In addition, Michel and Makowski [41] have compared different forecasting models, including Holt-Winters, to predict wheat yield in France.

2. Model

2.1. ARIMA

The Yt process is an ARIMA(p,q) or autoregressive-moving average model if it fulfilled as:
ϕ p B Y t = θ q B a t , a t ~ W N 0 , σ 2 , ϕ p , θ q , t .
with ϕp(B) is AR(p) and θq(B) is MA(q).
The ARIMA model becomes:
ϕ p B 1 B d Y t = θ q B a t , a t ~ W N 0 , σ 2 , ϕ p , θ q , t .
If there is a difference with ϕp(B) is AR(p), (1 − B)d is differencing non-seasonal, and θq(B) is MA(q).

2.2. α-Sutte Indicator

This model (α-Sutte indicator) are developed on the technique of forecasting on the previous values of the indicator set [42]. The ideal applies the adapted version of the moving average method of forecasting. α-Sutte indicator is essentially used to identify and forecast trends and fluctuations in a time-series database. It is based on four data sets of the previous period, that is, Yt−1, Yt−2, Yt−3, and Yt−4 [42]. The equations of the α-Sutte indicator method are [42]:
Y t = γ Δ x γ + δ 2 + β Δ y β + γ 2 + α Δ z α + β 2 3  
where:
δ = Y t 4 γ = Y t 3 β = Y t 2 α = Y t 1
Δ x = γ δ = Y t 3 Y t 4 Δ y = β γ = Y t 2 Y t 3 Δ z = α β = Y t 1 Y t 2
Y t = data at t time, Y t k = data at (t − k) time

2.3. SutteARIMA

SutteARIMA is a method of forecasting that incorporates ARIMA with the method of α-Sutte Indicator [42]. SutteARIMA using the average of results from ARIMA and α-Sutte indicator.
The ARIMA in Equation (1) can be described as:
1 ϕ 1 B ϕ 2 B 2 ϕ p B p Y t = 1 θ 1 B θ 2 B 2 θ q B q a t Y t ϕ 1 B Y t ϕ 2 B 2 Y t ϕ p B p Y t = a t θ 1 B a t θ 2 B 2 a t θ q B q a t
While Equation (3) can be reduced by using the backward shift operator (BpZt = Zt−p):
Y t ϕ 1 Y t 1 ϕ 2 Y t 2 ϕ p Y t p = a t θ 1 a t 1 θ 2 a t 2 θ q a t q Y t = ϕ 1 Y t 1 + ϕ 2 Y t 2 + + ϕ p Y t p + a t θ 1 a t 1 θ 2 a t 2 θ q a t q
If we define:
δ = Y t 4 γ = Y t 3 β = Y t 2 α = Y t 1
Equation (4):
Y t = ϕ 1 α + ϕ 2 β + ϕ 3 γ + ϕ 4 δ + + ϕ p Y t p + a t θ 1 a t 1 θ 2 a t 2 θ q a t q
Furthermore, Equation (2) can be simplified as:
Y t = γ Δ x γ + δ 2 + β Δ y β + γ 2 + α Δ z α + β 2 3
Y t = γ Δ x γ + δ 2 + β Δ y β + γ 2 + α Δ z α + β 2 3
Y t = γ Δ x 3 γ + 3 δ 2 + β Δ y 3 β + 3 γ 2 + α Δ z 3 α + 3 β 2
Y t = 2 γ Δ x 3 γ + 3 δ + 2 β Δ y 3 β + 3 γ + 2 α Δ z 3 α + 3 β
Y t = γ 2 Δ x 3 γ + 3 δ + β 2 Δ y 3 β + 3 γ + α 2 Δ z 3 α + 3 β
When Equation (4) is added to Equation (5), we find:
2 Y t = ϕ 1 α + ϕ 2 β + ϕ 3 γ + ϕ 4 δ + + ϕ p Y t p + a t θ 1 a t 1 θ 2 a t 2 θ q a t q + γ 2 Δ x 3 γ + 3 δ + β 2 Δ y 3 β + 3 γ + α 2 Δ z 3 α + 3 β
Z t = α ϕ 1 2 + Δ z 3 α + 3 β + β ϕ 3 2 + Δ y 3 β + 3 γ + γ ϕ 3 2 + Δ x 3 γ + 3 δ + ϕ 4 δ 2 + + ϕ p Y t p 2 + a t 2 θ 1 a t 1 2 θ 2 a t 2 2 θ q a t q 2
Therefore, Equation (6) is the formula of SutteARIMA.

2.4. Holt-Winters

Holt-Winters approach was divided into two categories: Additive Holt-Winters (AH-W) and Multiplicative Holt-Winters (MH-W). Theseparts are explained in following equations:
The component form for the multiplicative method is:
A t = α y t s t m + 1 α A t 1 + B t 1 ,
B t = β * A t A t 1 + 1 β * B t 1 ,
X t = γ y t t + 1 γ X t m ,
F t + m = A t + B t + m   X t s + m ,
The component form for the additive method is:
A t = α y t X t m + 1 α A t 1 + B t 1 ,
B t = β * A t A t 1 + 1 β * B t 1   ,
X t = γ y t A t 1 B t 1 + 1 γ X t m   ,
F t + m = A t + B t + m   X t s + m ,
where: yt = data on t time, s = the seasonal length in a certain time, and m = the amount of data to be predicated.

2.5. Neural Network Auto-Regressive (NNAR)

The NNAR is the widely used model for time series modeling and forecasting [43]. This model is a three-layer feedforward neural network involving linear combination and activation functions. The relationship between the model output (yt) and the inputs (at−1; …; yt−p) has the following mathematical representation:
y t = w 0 t = 0 k w j   . g w 0 j + i = 0 n w i , j   . y t 1 + E t ,
where, wi,j (i = 0, 1, 2, …, n, j = 1, 2, …, h) and wj (j = 0, 1, 2, …, h) are techniques parameters or association weights; n is number of input nodes; and h is number of hidden nodes. A sigmoid function used as the hidden layer transfer function shown in Equation (16). A linear function is the output layer’s most widely used activation function.
S i g x = 1 1 + exp x ,

3. Methodology

Agriculture food grain production data from 1950–1951 to 2019–2020 was collected from the open data source of Reserver Bank of India (RBI) in India [44]. Reserve Bank of India collects agriculture food grain data from the Ministry of Agriculture & Farmers Welfare, Government of India. Data for 2019–2020 is based on fourth advance estimates. This data was fragmented into five parts: wheat, rice, coarse, pulses, and total crops. Total crops are the sum of all four crops (wheat, rice, coarse, and pulses). The data was split into two categories: training data and test data. Data from 1951 to 2013 was considered for training on that basis test conducted from 2014 to 2020. Based on the fitted data, this study conducted short-term forecasts for the next five years (2021–2025). Ahmar [45] designed the SutteForecast R package in R software which compares the forecasting outcomes of other forecasting approaches. In diagnosing the predictions, we used the forecasting accuracy estimation MAPE (Mean Absolute Percentage Error) [46].
MAPE = 1 N t = 1 N   A t Y t A t ,
The mean square error is
MSE = t = 1 n A t Y t 2 N ,
where, At = actual values at data time t, and Y t =forecast value at data time t.

4. Experimental Results and Discussion

Due to extensive growth of the population, there is a need to make predictions on foodgrains. Past studies, which have made predictions on various aspects, employed limited approach/models. The present research study employs SutteForecast R-studio package to analyze models for ARIMA, SutteARIMA, H-W, and NNAR models, which would be quite helpful in making relatively accurate predictions. Hence, the usage of extensive models makes this study unique and valuable. The acquired findings are provided in the sections below. The first step for analysis is plot data. The plot of data can see in Figure 1.
From Figure 1, we can see that the data has increased from year to year in both data rice, wheat, coarse cereals, pulses, and total foodgrains.

4.1. Result of ARIMA Model

Although ARIMA has been working as a good model for forecasting [47], the current experiment with the ARIMA model gave relatively poor forecast results, as indicated in Table 1, Table 2, Table 3, Table 4 and Table 5. Therefore, ARIMA has not been considered the best model based on the criteria of MAPE and MSE. As shown in Table 6 the mean absolute percentage error (MAPE) and the mean square error (MSE) values for wheat (MAPE:3.71 and MSE:339.78), rice (MAPE:3.23 and MSE:260.77), coarse (MAPE:7.68 and MSE:201.08), pulses (MAPE:16.06 and MSE:272.83), and total (MAPE:2.83 and MSE:3082.6) with ARIMA is relatively higher as compared to some of the other models.

4.2. Result of SutteARIMA Model

In this study, the SutteARIMA model has performed as the best forecast model in predicting wheat, rice, coarse, pulses, and total crops, as indicated in Table 1, Table 2, Table 3, Table 4 and Table 5. On the other hand, the forecast error is relatively low and outstanding since the projected values are near to the actual values and move in the direction of the forecasted values in many instances, as demonstrated in Table 1, Table 2, Table 3, Table 4 and Table 5. In addition, MAPE and MSE values with the SutteARIMA model were recorded relatively lower compared to ARIMA, Holt-Winter (except coarse), and NNAR, as given in Table 6. The actual and predicted values are presented in Table 1, Table 2, Table 3, Table 4 and Table 5, comparing all the models mentioned in the study. Hence, this model has been selected as the best model to predict crop production for the next five years (2021–2025). Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6 provide the graph for the predicted values of crops.

4.3. Comparison of SutteARIMA and Holt-Winters (H-W)

From the observed outcomes presented in Table 1, Table 2, Table 3, Table 4 and Table 5 and Figure 2, Figure 3, Figure 4 and Figure 5, it is observed that the forecasting exactness level of the SutteARIMA model compared with that of the Holt-Winters (H-W) model is not quite significantly different from each other. It can be argued that both models achieved good forecast performance. They were judged on the prediction error (MAPE and MSE) for both models (SutteARIMA and Holt-Winters), which came out pretty low as compared to ARIMA and NNAR models. The SutteARIMA model was also found as the best method in several studies to forecast COVID-19 and the stock price in Spain [35], to predict COVID-19 in the U.S. [36], and to predict COVID-19 in world [37]. The Holt-Winters approach is one of the widely used forecasting techniques [38]. It was used to evaluate the movement in overall emissions of organic water contaminants as well as the exposure of the textile industry to pollution in Eastern Europe [39]. It has ensured the best predicting values for long-term and short-term thermal load forecasting [48]. However, the execution of the SutteARIMA model is better than the Holt-Winter model in terms of forecasting accuracy in test data in the current study. Table 6 shows that the SutteARIMA model is better than the Holt-Winters model for wheat, rice, pulses, and total food grain production, whereas Holt-Winters performed well to predict coarse production with lower MAPE (4.36) and MSE (85.20) values. Statistical tests were carried out, showing no significant difference between the actual and predicted values of the SutteARIMA and Holt-Winters models, as shown in Table 1, Table 2, Table 3, Table 4 and Table 5. Notwithstanding, SutteARIMA is still better. Hence, this study further clarifies the contrary opinions reported in the literature about the superiority of the SutteARIMA model over ARIMA, Holt-Winters, and NNAR models in time series prediction.

4.4. Result of NNAR Model

MAPE and MSE values with the NNAR model were significantly higher compared to ARIMA, SutteARIMA, and H-W. This finding agrees with some research studies; ARIMA was found more accurate in forecasting the horizon-daily values of CO2 than NNAR [49], in admission the NNAR model is considered inferior to H-W and ARIMA to predict tourist arrival data [50]. However, in several studies, it performed quite well, including unemployment rate prediction in Romania [51] and dengue epidemics [52].
Based on forecasting results, this study concludes that the SutteARIMA model is the most suitable forecasting method to predict foodgrains production in India. The value of MAPE, the least that of MSE value were verified. India’s SutteARIMA model was used to predict foodgrains production from 2021–2025. The result of the forecast of SutteARIMA is shown in Table 7. In addition, Table 7, presents the lower and upper value (99%) of forecasting.

5. Conclusions

The observed results combined with published food grain data on the performance of different prediction models SutteARIMA, ARIMA, Holt-Winters, and NNAR models were used in this study. The performance of the SutteARIMA predictive model applied in this study was compared with the conventional ARIMA, Holt-Winters, and NNAR models, which were widely used for time series forecasting. Our findings reveal that both the SutteARIMA model and the Holt-Winters model could achieve reliable forecasts when applied to real-life problems. This helps further to effectively and profitably engage food grain forecasting in India. The food grain prediction with the SutteARIMA approach revealed superior performance over ARIMA, Holt-Winters, and NNAR models. Indeed, the actual and predicted values of the SutteARIMA and Holt-Winters predictive model are quite close in predicting food grain production in India. This has been verified by MAPE and MSE values that are relatively low for the SutteARIMA model. India’s SutteARIMA model has been used to predict food grain production from 2021 to 2025. The forecasted amount of respective crops in lakh tonnes was predicted to be 1140.14 (wheat), 1232.27 (rice), 466.46 (coarse), 259.95 (pulses), and a total of 3069.80 (foodgrains) by 2025.

6. Policy Implication and Further Suggestion for Research

India enjoys the dubious distinction of being ranked the second most populated country on the planet. Speculations are made that in the near future, India can supposedly overtake China in the number of inhabitants. This presents a precarious condition as the rising population will eventually place demand on foodgrains for human survival, and it would necessarily be the responsibility of the State to provide the same. Hence, to feed the huge population, there has to be a realistic attempt to understand what would be the possible demand placed by the population. Predictions about foodgrains can assist in understanding whether the current rate at which foodgrains are produced would be adequate to meet the increasing demands. Accurate predictions of foodgrains is a herculean task in the Indian context considering the volatile environment and extremely diverse soil quality. The issues of predictions of foodgrains will be resolved if the appropriate method is used. The SutteARIMA model was found to be a suitable method for predictions of foodgrains in India. Moreover, it will assist in providing a baseline from which various policymakers can seek to resolve future predictions of foodgrains while designing the strategy. It is estimated that the world population will grow to 9 billion by the year 2050.
Consequently, there will be enormous challenges at the global level to produce a significant amount of food as compared to the present scenario. In the given scenario, the study would ideally help in accurately anticipating food grain production. This study would essentially assist the policy makers/decision makers in designing special policies in case of a serious mismatch between the demand placed for foodgrains by the ever-increasing population and the predictions about the production of foodgrains. By and large, these policy interventions would yield benefits for future studies and also for the larger society. In the future, research combining this model with other methods, such as the LSTM, RNN, decision tree, and reinforcement learning methods, can lead to better forecasting accuracy. This research work has been conducted for academic and research only, and the predictions for the future are based on the premise that the existing restrictive circumstances will remain.

Author Contributions

A.S.A.: conceptualization, methodology, software data curation, visualization, writing—original draft preparation, writing—reviewing and editing. P.K.S.: conceptualization, methodology, software data curation, visualization, writing—original draft preparation, writing—reviewing and editing. R.R.: software data curation, visualization, writing—reviewing. A.K.P.: investigation, supervision, validation, writing—reviewing and editing. S.G.: software data curation, visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Huang, J.; Pray, C.; Rozelle, S. Enhancing the Crops to Feed the Poor. Nature 2002, 418, 678–684. [Google Scholar] [CrossRef] [PubMed]
  2. Ray, D.K.; Ramankutty, N.; Mueller, N.D.; West, P.C.; Foley, J.A. Recent Patterns of Crop Yield Growth and Stagnation. Nat. Commun. 2012, 3, 1293. [Google Scholar] [CrossRef] [Green Version]
  3. von Braun, J. Food Insecurity, Hunger and Malnutrition: Necessary Policy and Technology Changes. New Biotechnol. 2010, 27, 449–452. [Google Scholar] [CrossRef] [PubMed]
  4. Pathak, P.K.; Singh, A. Trends in Malnutrition among Children in India: Growing Inequalities across Different Economic Groups. Soc. Sci. Med. 2011, 73, 576–585. [Google Scholar] [CrossRef]
  5. Chatterjee, N.; Fernandes, G.; Hernandez, M. Food Insecurity in Urban Poor Households in Mumbai, India. Food Secur. 2012, 4, 619–632. [Google Scholar] [CrossRef]
  6. FAO. Food and Agriculture Organization of the United Nations. Available online: http://www.fao.org/home/en/ (accessed on 31 May 2021).
  7. Le Mouël, C.; Forslund, A. How Can We Feed the World in 2050? A Review of the Responses from Global Scenario Studies. Eur. Rev. Agric. Econ. 2017, 44, 541–591. [Google Scholar] [CrossRef]
  8. Pingali, P.; Aiyar, A.; Abraham, M.; Rahman, A. Indian Food Systems towards 2050: Challenges and Opportunities. In Transforming Food Systems for a Rising India; Palgrave Studies in Agricultural Economics and Food Policy; Pingali, P., Aiyar, A., Abraham, M., Rahman, A., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 1–14. ISBN 978-3-030-14409-8. [Google Scholar]
  9. van Klompenburg, T.; Kassahun, A.; Catal, C. Crop Yield Prediction Using Machine Learning: A Systematic Literature Review. Comput. Electron. Agric. 2020, 177, 105709. [Google Scholar] [CrossRef]
  10. Guo, W.W.; Xue, H. Crop Yield Forecasting Using Artificial Neural Networks: A Comparison between Spatial and Temporal Models. Math. Probl. Eng. 2014, 2014, 857865. [Google Scholar] [CrossRef]
  11. Liakos, K.G.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine Learning in Agriculture: A Review. Sensors 2018, 18, 2674. [Google Scholar] [CrossRef] [Green Version]
  12. Veenadhari, S.; Misra, B.; Singh, C. Machine Learning Approach for Forecasting Crop Yield Based on Climatic Parameters. In Proceedings of the 2014 International Conference on Computer Communication and Informatics, Coimbatore, India, 3–5 January 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–5. [Google Scholar]
  13. Paul, R.K.; Sinha, K. Forecasting Crop Yield: A Comparitve Assessment of Arimax and Narx Model. RASHI J. Soc. Appl. Stat. Agric. Allied Sci. 2016, 1, 77–85. [Google Scholar]
  14. Kandiannan, K.; Chandaragiri, K.K.; Sankaran, N.; Balasubramanian, T.N.; Kailasam, C. Crop-Weather Model for Turmeric Yield Forecasting for Coimbatore District, Tamil Nadu, India. Agric. For. Meteorol. 2002, 112, 133–137. [Google Scholar] [CrossRef]
  15. Gandhi, N.; Armstrong, L.J. Rice Crop Yield Forecasting of Tropical Wet and Dry Climatic Zone of India Using Data Mining Techniques. In Proceedings of the 2016 IEEE International Conference on Advances in Computer Applications (ICACA), Coimbatore, India, 24 October 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 357–363. [Google Scholar]
  16. Woodhouse, P. Technology, Environment and the Productivity Problem in African Agriculture: Comment on the World Development Report 2008. J. Agrar. Chang. 2009, 9, 263–276. [Google Scholar] [CrossRef]
  17. Chlingaryan, A.; Sukkarieh, S.; Whelan, B. Machine Learning Approaches for Crop Yield Prediction and Nitrogen Status Estimation in Precision Agriculture: A Review. Comput. Electron. Agric. 2018, 151, 61–69. [Google Scholar] [CrossRef]
  18. Prassanna, A.S.; Harshanand, B.A.; Srishti, B.; Chaitanya, R.; Soundari, K.N.; Sriram, S.; Kumar, V.M.; Chennamsetti, V.; Venkateshwaran, G.; Maurya, P.K. Crop Value Forecasting Using Decision Tree Regressor and Models. Eur. J. Mol. Clin. Med. 2020, 7, 3702–32722. [Google Scholar]
  19. Sujatha, R.; Isakki, P. A Study on Crop Yield Forecasting Using Classification Techniques. In Proceedings of the 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE’16), Kovilpatti, India, 7–9 January 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–4. [Google Scholar]
  20. Alhnaity, B.; Pearson, S.; Leontidis, G.; Kollias, S. Using Deep Learning to Predict Plant Growth and Yield in Greenhouse Environments. Acta Hortic. 2020, 1296, 425–432. [Google Scholar] [CrossRef]
  21. Banik, D. The Hungry Nation: Food Policy and Food Politics in India. Food Ethics 2016, 1, 29–45. [Google Scholar] [CrossRef] [Green Version]
  22. Sherman, T.C. From ‘Grow More Food’ to ‘Miss a Meal’: Hunger, Development and the Limits of Post-Colonial Nationalism in India, 1947–1957. South Asia J. South Asian Stud. 2013, 36, 571–588. [Google Scholar] [CrossRef] [Green Version]
  23. Hazra, K.K.; Bohra, A. Increasing Relevance of Pulse Crops to Sustainable Intensification of Indian Agriculture. Natl. Acad. Sci. Lett. 2021, 44, 1–3. [Google Scholar] [CrossRef]
  24. Mittal, S.; Dai, H.; Fujimori, S.; Masui, T. Bridging Greenhouse Gas Emissions and Renewable Energy Deployment Target: Comparative Assessment of China and India. Appl. Energy 2016, 166, 301–313. [Google Scholar] [CrossRef]
  25. Birthal, P.S.; Joshi, P.K.; Roy, D.; Thorat, A. Diversification in Indian Agriculture toward High-Value Crops: The Role of Small Farmers. Can. J. Agric. Econ. Can. Agroecon. 2013, 61, 61–91. [Google Scholar] [CrossRef]
  26. Thomas, K.T. Crop Prediction Using Machine Learning. Int. J. Future Gener. Commun. Netw. 2020, 13, 6. [Google Scholar]
  27. Kang, Y.; Ozdogan, M.; Zhu, X.; Ye, Z.; Hain, C.; Anderson, M. Comparative Assessment of Environmental Variables and Machine Learning Algorithms for Maize Yield Prediction in the US Midwest. Environ. Res. Lett. 2020, 15, 064005. [Google Scholar] [CrossRef]
  28. Zheng, H.; Li, W.; Jiang, J.; Liu, Y.; Cheng, T.; Tian, Y.; Zhu, Y.; Cao, W.; Zhang, Y.; Yao, X. A Comparative Assessment of Different Modeling Algorithms for Estimating Leaf Nitrogen Content in Winter Wheat Using Multispectral Images from an Unmanned Aerial Vehicle. Remote Sens. 2018, 10, 2026. [Google Scholar] [CrossRef]
  29. Sivanandhini, P.; Prakash, J. Crop Yield Prediction Analysis Using Feed Forward and Recurrent Neural Network. Int. J. Innov. Sci. Res. Technol. 2020, 5, 1092–1096. [Google Scholar]
  30. Fung, K.F.; Huang, Y.F.; Koo, C.H.; Mirzaei, M. Improved SVR Machine Learning Models for Agricultural Drought Prediction at Downstream of Langat River Basin, Malaysia. J. Water Clim. Chang. 2020, 11, 1383–1398. [Google Scholar] [CrossRef]
  31. Shao, Y.E.; Dai, J.-T. Integrated Feature Selection of ARIMA with Computational Intelligence Approaches for Food Crop Price Prediction. Complexity 2018, 2018, e1910520. [Google Scholar] [CrossRef]
  32. Zou, H.F.; Xia, G.P.; Yang, F.T.; Wang, H.Y. An Investigation and Comparison of Artificial Neural Network and Time Series Models for Chinese Food Grain Price Forecasting. Neurocomputing 2007, 70, 2913–2923. [Google Scholar] [CrossRef]
  33. Paidipati, K.; Banik, A. Forecasting of Rice Cultivation in India-A Comparative Analysis with ARIMA and LSTM-NN Models. ICST Trans. Scalable Inf. Syst. 2018, 7, 161409. [Google Scholar] [CrossRef]
  34. Maya Gopal, P.S.; Bhargavi, R. A Novel Approach for Efficient Crop Yield Prediction. Comput. Electron. Agric. 2019, 165, 104968. [Google Scholar] [CrossRef]
  35. Ahmar, A.S.; Boj, E. SutteARIMA: Short-Term Forecasting Method, a Case: Covid-19 and Stock Market in Spain. Sci. Total Environ. 2020, 729, 138883. [Google Scholar] [CrossRef]
  36. Ahmar, A.; Boj, E. Current Research in Behavioral Sciences Will COVID-19 Confirmed Cases in the USA Reach 3 Million? A Forecasting Approach by Using SutteARIMA Method. Curr. Res. Behav. Sci. 2020, 1, 100002. [Google Scholar] [CrossRef]
  37. Ahmar, A.S.; Rusli, R. Will Covid-19 Cases in the World Reach 4 Million? A Forecasting Approach Using SutteARIMA. JOIV Int. J. Inform. Vis. 2020, 4, 159–161. [Google Scholar] [CrossRef]
  38. Trull, O.; García-Díaz, J.C.; Troncoso, A. Initialization Methods for Multiple Seasonal Holt-Winters Forecasting Models. Mathematics 2020, 8, 268. [Google Scholar] [CrossRef] [Green Version]
  39. Paraschiv, D.; Tudor, C.; Petrariu, R. The Textile Industry and Sustainable Development: A Holt-Winters Forecasting Investigation for the Eastern European Area. Sustainability 2015, 7, 1280–1291. [Google Scholar] [CrossRef]
  40. Kahforoushan, E.; Zarif, M.; Mashahir, E.B. Prediction of Added Value of Agricultural Subsections Using Artificial Neural Networks: Box-Jenkins and Holt-Winters Methods. J. Dev. Agric. Econ. 2010, 2, 115–121. [Google Scholar] [CrossRef]
  41. Michel, L.; Makowski, D. Comparison of Statistical Models for Analyzing Wheat Yield Time Series. PLoS ONE 2013, 8, e78615. [Google Scholar] [CrossRef] [Green Version]
  42. Ahmar, A.S.; Rahman, A.; Mulbar, U. α-Sutte Indicator: A New Method for Time Series Forecasting. J. Phys. Conf. Ser. 2018, 1040, 012018. [Google Scholar] [CrossRef] [Green Version]
  43. Zhang, G.; Eddy Patuwo, B.; Hu, M.Y. Forecasting with Artificial Neural Networks: The State of the Art. Int. J. Forecast. 1998, 14, 35–62. [Google Scholar] [CrossRef]
  44. Reserver Bank of India. Dbie-Rbi: Database of Indian Economy. Available online: https://dbie.rbi.org.in/DBIE/dbie.rbi?site=statistics (accessed on 20 May 2021).
  45. Ahmar, A.S. RcmdrPlugin. SutteForecastR: An RcmdrPlugin for Forecasting Data. J. Phys. Conf. Ser. 2018, 1028, 012224. [Google Scholar] [CrossRef] [Green Version]
  46. Kim, S.; Kim, H. A New Metric of Absolute Percentage Error for Intermittent Demand Forecasts. Int. J. Forecast. 2016, 32, 669–679. [Google Scholar] [CrossRef]
  47. Adebiyi, A.A.; Adewumi, A.O.; Ayo, C.K. Comparison of ARIMA and Artificial Neural Networks Models for Stock Price Prediction. J. Appl. Math. 2014, 2014, 614342. [Google Scholar] [CrossRef] [Green Version]
  48. Tratar, L.F.; Strmčnik, E. The Comparison of Holt-Winters Method and Multiple Regression Method: A Case Study. Energy 2016, 109, 266–276. [Google Scholar] [CrossRef]
  49. Maleki, A.; Nasseri, S.; Aminabad, M.S.; Hadi, M. Comparison of ARIMA and NNAR Models for Forecasting Water Treatment Plant’s Influent Characteristics. KSCE J. Civ. Eng. 2018, 22, 3233–3245. [Google Scholar] [CrossRef]
  50. Thoplan, R. Simple v/s Sophisticated Methods of Forecasting for Mauritius Monthly Tourist Arrival Data. Int. J. Stat. Appl. 2014, 4, 217–223. [Google Scholar]
  51. Davidescu, A.A.; Apostu, S.-A.; Paul, A. Comparative Analysis of Different Univariate Forecasting Methods in Modelling and Predicting the Romanian Unemployment Rate for the Period 2021-2022. Entropy 2021, 23, 325. [Google Scholar] [CrossRef]
  52. Chakraborty, T.; Chattopadhyay, S.; Ghosh, I. Forecasting Dengue Epidemics Using a Hybrid Methodology. Phys. Stat. Mech. Its Appl. 2019, 527, 121266. [Google Scholar] [CrossRef]
Figure 1. Foodgrains production in India (1950–51 to 2019–20).
Figure 1. Foodgrains production in India (1950–51 to 2019–20).
Forecasting 05 00006 g001
Figure 2. Actual wheat versus predicted values using SutteARIMA and H-W.
Figure 2. Actual wheat versus predicted values using SutteARIMA and H-W.
Forecasting 05 00006 g002
Figure 3. Actual rice versus predicted values using SutteARIMA and H-W.
Figure 3. Actual rice versus predicted values using SutteARIMA and H-W.
Forecasting 05 00006 g003
Figure 4. Actual coarse versus predicted values using SutteARIMA and H-W.
Figure 4. Actual coarse versus predicted values using SutteARIMA and H-W.
Forecasting 05 00006 g004
Figure 5. Actual pulses versus predicted values using SutteARIMA and H-W.
Figure 5. Actual pulses versus predicted values using SutteARIMA and H-W.
Forecasting 05 00006 g005
Figure 6. Actual total crops versus predicted values using SutteARIMA and H-W.
Figure 6. Actual total crops versus predicted values using SutteARIMA and H-W.
Forecasting 05 00006 g006
Table 1. Results of the fitting value of wheat production in India.
Table 1. Results of the fitting value of wheat production in India.
YearActualARIMA (1,1,0) with DriftAPESutteARIMAAPEH-WAPENNAR(1,1)APE
2014959960.080.11969.681.11959.960.10934.122.59
2015865970.1512.16980.2313.32981.8413.51933.367.90
2016923985.996.82912.371.151003.738.75932.711.05
2017985999.611.48960.482.491025.624.12932.165.36
20189991014.081.511005.330.631047.504.85931.686.73
201910361028.220.751036.610.061069.393.22931.2610.10
202010761042.503.111058.551.621091.271.42930.9113.48
Table 2. Results of the fitting value of rice production in India.
Table 2. Results of the fitting value of rice production in India.
YearActualARIMA (0,1,1)
with Drift
APESutteARIMAAPEH-WAPENNAR (1,1)APE
201410671034.463.051071.210.391037.962.721011.005.24
201510551047.620.701075.881.981052.620.23985.606.57
201610441060.781.611058.251.371067.282.23968.777.20
201710971073.932.101057.673.591081.941.37957.1612.74
201811281087.093.631097.282.721096.592.78948.9215.87
201911651100.255.561126.593.301111.254.61942.9619.05
202011841113.405.961159.762.051125.914.91938.6120.72
Table 3. Results of the fitting value of coarse production in India.
Table 3. Results of the fitting value of coarse production in India.
YearActualARIMA (0,1,1)
with Drift
APESutteARIMAAPEH-WAPENNAR (2,2)APE
2014433398.647.93412.194.81416.323.85387.9810.39
2015429402.066.28417.702.63425.780.75383.0510.70
2016385405.485.32419.048.84435.2313.05380.021.29
2017438408.906.64395.079.80444.691.53378.5413.57
2018470412.3112.27426.969.16454.153.37377.6619.64
2019431415.733.54450.854.61463.607.56377.1912.48
2020475419.1511.76433.788.68473.060.41376.9220.64
Table 4. Results of the fitting value of pulses production in India.
Table 4. Results of the fitting value of pulses production in India.
YearActualARIMA (0,1,1)
with Drift
APESutteARIMAAPEH-WAPENNAR (2,2)APE
2014193173.3010.21184.904.20174.289.70178.347.59
2015172174.631.53185.828.03175.712.16178.533.80
2016164175.957.29174.466.38177.138.01178.348.74
2017231177.2823.26167.7527.38178.5622.70178.3322.80
2018254178.6129.68213.2616.04179.9929.14178.3229.80
2019221179.9318.58232.745.31181.4117.91178.3219.31
2020232181.2621.87213.098.15182.8421.19178.3223.14
Table 5. Results of the fitting value of total food grain production in India.
Table 5. Results of the fitting value of total food grain production in India.
YearActualARIMA (0,1,1)
with Drift
APESutteARIMAAPEH-WAPENNAR (1,1)APE
201426502536.404.292621.951.062550.903.742534.964.34
201525202568.491.922644.354.932591.822.852506.170.55
201625162600.573.362548.881.312632.744.642482.891.32
201727512632.664.302565.916.732673.662.812463.8810.44
201828502664.746.502727.004.322714.584.752448.2414.10
201928522696.835.442830.450.762755.503.382435.3014.61
202029662728.917.992848.493.962796.425.722424.5218.26
Table 6. Error values of models for wheat, rice, coarse, pulses, and total foodgrains.
Table 6. Error values of models for wheat, rice, coarse, pulses, and total foodgrains.
FoodgrainsForecasting ModelsMAPEMSE
WheatARIMA (1,1,0) with drift3.71339.78
SutteARIMA2.91294.93
Holt-Winters5.14520.84
NNAR (1,1)6.74912.86
RiceARIMA (0,1,1) with drift3.23260.77
SutteARIMA2.20106.32
Holt-Winters2.69180.96
NNAR (1,1)12.493566.22
CoarseARIMA (0,1,1) with drift7.68201.08
SutteARIMA6.93153.24
Holt-Winters4.3685.20
NNAR (2,2)12.67586.46
PulsesARIMA (0,1,1) with drift16.06272.83
SutteARIMA10.78133.09
Holt-Winters15.83260.16
NNAR (2,2)16.45278.95
Total FoodgrainsARIMA (0,1,1) with drift4.833082.16
SutteARIMA3.291652.90
Holt-Winters3.981857.13
NNAR (2,2)9.0914,800.21
Table 7. The SutteARIMA forecast for foodgrains (in lakh tonnes) from 2021 to 2025 in India.
Table 7. The SutteARIMA forecast for foodgrains (in lakh tonnes) from 2021 to 2025 in India.
YearWheatRiceCoarsePulsesTotal Food Grain
F *ULF *ULF *ULF *ULF *UL
20211072.74979.801165.691174.261124.201174.26446.74411.37482.10229.08180.67277.502894.252698.613089.90
20221089.59973.351205.831188.761118.001188.76451.67416.30487.03236.803186.89286.722938.142674.803201.48
20231106.44970.811242.081203.271116.581203.27456.60421.23491.96244.521193.14295.912982.032665.033299.03
20241123.29970.661275.921217.771117.641217.77461.53426.16496.89252.239199.41305.073025.912663.013388.83
20251140.14972.191308.091232.271120.281232.27466.46431.09501.82259.95205.72314.203069.802666.103473.52
* F-Forecasted Value, L-Lower value at 99%, U-Upper Value at 99%.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ahmar, A.S.; Singh, P.K.; Ruliana, R.; Pandey, A.K.; Gupta, S. Comparison of ARIMA, SutteARIMA, and Holt-Winters, and NNAR Models to Predict Food Grain in India. Forecasting 2023, 5, 138-152. https://doi.org/10.3390/forecast5010006

AMA Style

Ahmar AS, Singh PK, Ruliana R, Pandey AK, Gupta S. Comparison of ARIMA, SutteARIMA, and Holt-Winters, and NNAR Models to Predict Food Grain in India. Forecasting. 2023; 5(1):138-152. https://doi.org/10.3390/forecast5010006

Chicago/Turabian Style

Ahmar, Ansari Saleh, Pawan Kumar Singh, R. Ruliana, Alok Kumar Pandey, and Stuti Gupta. 2023. "Comparison of ARIMA, SutteARIMA, and Holt-Winters, and NNAR Models to Predict Food Grain in India" Forecasting 5, no. 1: 138-152. https://doi.org/10.3390/forecast5010006

Article Metrics

Back to TopTop