Next Article in Journal
SCM Enables Improved Single-Cell Clustering by Scoring Consensus Matrices
Next Article in Special Issue
Dynamically Meaningful Latent Representations of Dynamical Systems
Previous Article in Journal
Inverses and Determinants of n × n Block Matrices
Previous Article in Special Issue
Anomaly Detection in the Molecular Structure of Gallium Arsenide Using Convolutional Neural Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Hybrid Modeling of Spatial Autoregressive Exogenous Using Casetti’s Model Approach for the Prediction of Rainfall

by
Annisa Nur Falah
1,*,
Budi Nurani Ruchjana
2,
Atje Setiawan Abdullah
3 and
Juli Rejito
3
1
Doctoral Program of Mathematics, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Sumedang 45363, Indonesia
2
Department of Mathematics, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Sumedang 45363, Indonesia
3
Department of Computer Science, Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran, Sumedang 45363, Indonesia
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(17), 3783; https://doi.org/10.3390/math11173783
Submission received: 22 July 2023 / Revised: 30 August 2023 / Accepted: 1 September 2023 / Published: 3 September 2023
(This article belongs to the Special Issue Applied Mathematics and Machine Learning)

Abstract

:
Spatial Autoregressive (SAR) models are used to model the relationship between variables within a specific region or location, considering the influence of neighboring variables, and have received considerable attention in recent years. However, when the impact of exogenous variables becomes notably pronounced, an alternative approach is warranted. Spatial Expansion, coupled with the Casetti model approach, serves as an extension of the SAR model, accommodating the influence of these exogenous variables. This modeling technique finds application in the realm of rainfall prediction, where exogenous factors, such as air temperature, humidity, solar irradiation, wind speed, and surface pressure, play pivotal roles. Consequently, this research aimed to combine the SAR and Spatial Expansion models through the Casetti model approach, leading to the creation of the Spatial Autoregressive Exogenous (SAR-X) model. The SAR-X was employed to forecast the rainfall patterns in the West Java region, utilizing data obtained from the National Aeronautics and Space Administration Prediction of Worldwide Energy Resources (NASA POWER) dataset. The practical execution of this research capitalized on the computational capabilities of the RStudio software version 2022.12.0. Within the framework of this investigation, a comprehensive and integrated RStudio script, seamlessly incorporated into the RShiny web application, was developed so that it is easy to use.
MSC:
9004; 62M30

1. Introduction

Water is very important for all living organisms, including humans, thereby ranking as one of the most indispensable resources in the environment [1]. The phenomenon of climate change and global warming has significantly reshaped numerous environmental aspects across multiple countries. This transformation carries the potential for profound consequences, imperiling the populace, agriculture, ecosystems, economy, and industry. Alterations in precipitation patterns exert a direct impact on the management of water resources, agricultural practices, hydrological systems, and ecological balance [2]. It is crucial to acknowledge that the entirety of life-sustaining water on Earth originates from rainfall. Consequently, comprehending the spatial distribution and fluctuations in this rainfall is very important. This understanding is pivotal for efficient water resource management and also for devising strategies for addressing challenges such as anticipating natural hazards triggered by intense rainfall events [1,3,4]. Delving into the spatial arrangement and variances of rainfall can play a pivotal role in shaping perspective notions concerning water resources. Furthermore, it aids in formulating measures for upholding stable environmental circumstances. It is imperative to acknowledge that trends in rainfall constitute a pivotal climate determinant with far-reaching implications, encompassing future factors such as population growth, economic expansion, and enduring climatic shifts. These factors collectively impact both the temporal and spatial availability of water [1,5,6].
In various spheres of life, the anticipation of rainfall yields indispensable advantages, encompassing agriculture, water resource management, disaster preparedness, infrastructure planning, renewable energy, and transportation. Forecasts of rainfall assist farmers in coordinating their planting and harvesting schedules, allowing for effective irrigation planning, suitable crop selection according to weather conditions, and a decreased risk of crop failure [7,8,9]. Moreover, these predictions contribute to the proficient handling of water resources, encompassing the regulation of river flow and reservoir management [10,11]. The availability of accurate rainfall information empowers optimal water utilization and mitigates flood and drought hazards, as well as plays a pivotal role in the management of natural calamities such as floods and landslides [12,13,14]. Informed by projected rainfall data, authorities can implement preventive measures such as early evacuation, timely alerts, and the construction of robust infrastructure. Additionally, rainfall prediction has proved vital in the realm of infrastructure planning, guiding the construction of structures including dams, drainage networks, and irrigation channels [15,16,17]. By leveraging rainfall insights, engineers and planners are equipped to devise designs resilient to extreme weather conditions. Rainfall profoundly affects renewable energy industries, namely hydropower and solar energy [18]. Forecasts of rainfall facilitate the strategic planning and management of renewable energy production, thereby optimizing energy potential [19]. The influence of rainfall extends to transportation planning as well [20]. Intense rainfall impacts road traffic, transportation speeds, and driver safety. By leveraging rainfall predictions, transportation schedules can be organized more efficiently, ultimately reducing accident risks. The availability of accurate rainfall information empowers sound decision making and aids in mitigating the risks and repercussions arising from unforeseen weather fluctuations [21]. Therefore, the precision and accuracy of rainfall prediction are said to be important.
Based on previous research, diverse rainfall predictions have been undertaken, including projections of rainfall variability, particularly focusing on the winter and pre-monsoon rainfall across Pakistan [22]. In Turkey, monthly and seasonal rainfall trend predictions were calculated through a comparison of three interpolation methods, namely Inverse Distance Weighted, Completely Regularized Spline, and Ordinary Kriging [23]. A Spatial Autoregressive model was employed to predict seasonal legume yields for non-irrigated croplands in the semi-arid region of Mexico, categorized by regular rainy and dry seasons [24]. The rainfall and groundwater patterns along the northeastern coast of Brazil were predicted, and the monthly rainfall patterns were characterized to mitigate their impacts during heavy rainfall periods and significant floods, utilizing the Kriging method and a clustering analysis [25]. Additionally, short-term rainfall forecasts were derived using a Hierarchical Bayesian model for spatiotemporal data [26]. It is important to note that the scope of these predictions exclusively pertains to rainfall variability and does not encompass other climate variables.
One of the outcomes of climate phenomena is La Niña, which triggers heightened rainfall in the western Pacific region. Based on empirical data from BMKG, La Niña can amplify the rainfall in West Java by anywhere from 20% to 70% [27]. Rainfall is intertwined with other climatic factors such as air temperature, humidity, solar irradiation, wind speed, and surface pressure, all of which fluctuate across different areas [28]. The climate variables pertinent to specific regions or locations can be effectively modeled using a spatial analysis approach. Among the spatial models commonly employed, the Spatial Autoregressive (SAR) model stands out. This is frequently referred to as a mixed regressive spatial autoregressive model or a spatial lag model [29]. The SAR model, a statistical tool, is harnessed to depict the relationships between variables within a given region or location, considering the impacts of surrounding variables. However, the model falls short in capturing the spatial disparities between locations. To address this gap, Casetti [30] developed the Spatial Expansion model, which extends the SAR to encompass exogenous variables through a linear regression approach. The amalgamation of the SAR and spatial expansion models, in line with the approach of Casetti, culminates in the Spatial Autoregressive Exogenous (SAR-X) model. The SAR-X introduces spatial heterogeneity, thereby enabling the description of distinct parameter values for each spatial observation, determined by the geographical distances between locations. A notable application of the SAR-X is evident in gauging the repercussions of climate change on the ecosystem of a region. For example, when analyzing flood risk prediction within the SAR-X framework, it becomes imperative to incorporate climate variable factors. In this scenario, the model serves as a valuable tool, facilitating an elevated spatial perspective of climate effects and accounting for the spatial influences that shape relationships. In essence, SAR-X modeling has emerged as a valuable instrument for comprehending the ramifications of climate change and supporting decision making to tackle these consequences.
The current research on rainfall prediction entails observations across 13 climate-monitoring stations in the West Java region, utilizing data sourced from the Meteorology, Climatology, and Geophysics Agency (BMKG) with location indexing. The research variables comprised rainfall as the responsive parameter, coupled with exogenous variables encompassing air temperature, humidity, solar irradiation, wind speed, and surface pressure. These climate data were procured from the National Aeronautics and Space Administration Prediction of Worldwide Energy Resources (NASA POWER) dataset. The application study in this research is supported by computing using the R programming language. R is a statistical and graphical programming language; currently, R is widely known as one of the programming languages used for data analysis and data science. R has advantages, including being open source, having many packages available, finding it easy to transform and process data, and being able to create interactive applications/web-based dashboards. In this research, the RStudio software is used, which is an Integrated Development Environment (IDE) for R [31]. The custom-built RStudio script can be further evolved into a web application employing RShiny version 1.7.4, one of the packages of software specifically designed for creating interactive web interfaces [32]. These web applications, once deployed, offer a user-friendly platform for data processing. Users can conveniently upload data for analysis, and the ensuing results are promptly presented on the RShiny web application interface. In this research endeavor, integrated commands within R script enable the utilization of the SAR-X model through the RShiny web application, rendering the process effortlessly accessible.

2. Materials and Methods

2.1. Inverse Distance Weight Matrix

The spatial weight matrix is a matrix that states the proximity relationship between locations. In this study, the inverse distance weight matrix, also called the distance weight, is used, which describes the actual distance between locations. The distance between locations is calculated using the latitude and longitude coordinates of the center point of the observed location. It is known that a location x i j ( u i j , v i j ) with x i j is the symbol of the location i and j , with i = 1 , 2 , 3 , N and j = 1 , 2 , 3 , N , while u and v indicate the latitude and longitude coordinates. If d i j is the distance between location i and location j , and Wij is the inverse value of d i j , we use the Euclidean distance and the equation is as follows [33]:
d i j = ( x i ( u i ) x j ( u j ) ) 2 + ( x i ( v i ) x j ( v j ) ) 2
w i j = { 1 d i j , i j 0 , i = j
Next, if the sum of the distance weights of a row in the inverse distance weight matrix is not equal to 1, then the distance weights must be standardized, so as to obtain j = 1 N w i j = 1 ,     i = 1 , 2 , 3 , , N , where:
w i j * = w i j j = 1 N w i j ,     i = 1 , 2 , 3 , , N .
In detail:
x i j : the symbol of location i and j , with i = 1 , 2 , 3 , N and j = 1 , 2 , 3 , N
u i : latitude i th for i = 1 , 2 , 3 , , N
v i : longitude i th for i = 1 , 2 , 3 , , N
u j : latitude j th for j = 1 , 2 , 3 , , N
v j : longitude j th for j = 1 , 2 , 3 , , N
d i j : the distance between location i and location j
In this investigation, the inverse distance weight matrix was employed to calculate the climate variable data across 13 climate observation stations in the West Java region. The input coordinates for each location, expressed as latitude and longitude, served to establish the proximity relationship between these locations.

2.2. Moran Index

One approach to assessing the spatial dependencies among locations is through a spatial autocorrelation test using the Moran Index statistic. Spatial autocorrelation gauges the correlation between observation values concerning the location of the same variable. When a systematic pattern emerges in the distribution of a variable, spatial autocorrelation is present. This phenomenon underscores the interdependence of the spatial data between different locations, influenced by their proximity or intersection [29].
I = n i = 1 n j = 1 n w i j ( x i x ¯ ) ( x j x ¯ ) i = 1 n j = 1 n w i j i = 1 n ( x i x ¯ ) 2
The hypothesis formulation in this test is as follows:
H 0 : I = 0 There is no spatial autocorrelation between locations.
H 1 : I 0 There is spatial autocorrelation between locations.
The test statistic employed is expressed below:
Z ( I ) = I E ( I ) V a r ( I ) N ( 0 , 1 )
with,
E ( I ) = 1 n 1
V a r ( I ) = n 2 S 1 n S 2 + 3 S 0 2 ( n 2 1 ) S 0 2 [ E ( I ) ] 2
S 0 = i = 1 n j = 1 n w i j S 1 = 1 2 1 j n ( w i j + w j i ) 2 S 2 = i j n ( i = 1 n w i j + j = 1 n w j i ) 2
where:
I : the Moran Index value
n : the number of observation locations
x i : the value of the observation variable at location i th
x j : the value of the observation variable at location j th
x ¯ : the average of the number of variables
w i j : the elements of the standardized weight matrix between regions i and j
Decision:
Reject H 0 at the significance level α if Z s c o r e Z α 2 or Z s c o r e Z α 2 .
The Moran Index was in the range of [−1, 1]. A negative value signified a negative spatial autocorrelation, while a positive value implied a positive spatial autocorrelation [34].

2.3. Spatial Autoregressive Exogenous (SAR-X)

Spatial regression is a method tailored for data with spatial effects, encompassing spatial dependency and heterogeneity. Spatial dependence pertains to observations at one location being influenced by those at another, while spatial heterogeneity arises due to random location effects. The foundation of spatial regression lies in the classical linear regression method, evolving to consider the influence of space on the data under scrutiny. The first Law of Geography by Tobler emphasizes interconnectedness and the greater influence of nearby entities [29]. The general model of spatial regression can be written as follows:
y = ρ W y + X β + u u = λ W u + ε   with   ε ~ i i d N ( 0 , σ 2 I )
In [29], an alternative model derived from Equation (8) was developed. When ρ 0 and λ = 0 , then the Spatial Autoregressive (SAR) model can be formed as follows:
y = ρ W y + X β + ε   with   ε ~ i i d N ( 0 , σ 2 I )
The SAR model, an area-based spatial model, encompasses the impact of spatial lag on the dependent variable [29].
The introduction of the Spatial Expansion model (Casetti, 1972) [30] addresses spatial heterogeneity, which characterizes diverse parameter values for each spatial observation based on the distance between locations. The Euclidean distance, such as location coordinates, quantifies the separation between two locations. The Spatial Expansion model, adopting a linear regression approach, can be formulated as follows:
y = X β + ε   with   ε ~ i i d N ( 0 , σ 2 I ) β = Z J β 0
In this research, a fusion of the SAR and Spatial Expansion models was executed, utilizing the approach of Casetti on exogenous variables, called the Spatial Autoregressive Exogenous (SAR-X) model. The SAR-X model could be said to be an extension of SAR, which encompasses exogenous variables. Referring to [35], the SAR-X model served to describe and predict independent variables, considering location effects and exogenous variables. Mathematically, the SAR-X model prediction with the Casetti’s model approach was derived by substituting Equation (10) into (9), in order to produce the following:
y = ρ W y + X β + ε   with   β = Z J β 0
Equation (11) is subsequently written as:
y = ρ W y + X Z J β 0 + ε   with   ε ~ i i d N ( 0 , σ 2 I )
with:
y : the vector of the dependent variables of size ( n × 1 )
ρ : the spatial lag parameter coefficient of the independent variable
W : a spatial weight matrix of size ( n × n )
X : the matrix of the independent variables of size ( n × n k )
Z : location information that contains elements Z x i , Z y i   with   i = 1 , , n , representing the latitude and longitude of each observation, of size ( n k × 2 n k )
J : the expansion of the identity matrix of size ( 2 n k × 2 k )
β : a matrix of size ( n k × 1 ) , containing parameter estimators for all the explanatory k variables at each observation
β 0 : parameter expressed by β l a t i t u d e , β l o n g i t u d e of size ( 2 k × 1 )
: the Kronecker product
ε : an error vector of size ( n × 1 )
s i : the location matrix with i = 1 , , n
The matrix form in the model can be written as follows:
y = ( y ( s 1 ) y ( s 2 ) y ( s n ) ) ,   W = ( 0 w 12 w 1 n w 21 0 w 2 n w n 1 w n 2 0 ) ,   X = ( x 11 x 1 k 0 0 0 0 0 0 x 21 x 2 k 0 0 0 0 0 0 0 0 0 0 x n 1 x n k )
β = ( β 1 ( s 1 ) β 1 ( s 2 ) β k ( s n ) ) ,   ε = ( ε ( s 1 ) ε ( s 2 ) ε ( s n ) ) ,   β 0 = ( β l a t i t u d e β l o n g i t u d e )
Z = ( Z x 1 I k Z y 1 I k 0 0 0 0 0 0 0 0 0 0 0 0 Z x n I k Z y n I k ) , J = ( I k 0 0 I k 0 I k )

2.4. Estimation

The random error variable in the SAR-X model assumed a normal distribution. Therefore, the parameter estimation in this model followed the SAR parameter estimation method, employing the Maximum Likelihood Estimation (MLE) technique. The observed sources for this approach included Ord [36], Smirnov and Anselin [37], Robinson and Rossi [38], and Feng [39]. Equation (12) can be written as the following equation:
y = ρ W y + A β 0 + ε   with   A = X Z J
y = ρ W y + A β 0 + ε   with   ε ~ i i d N ( 0 , σ 2 I )
The probability density function used is expressed below:
f ( y ) = ( 1 2 π σ 2 ) n 2 exp [ ( y ρ W y A β 0 ) T ( y ρ W y A β 0 ) 2 σ 2 ]
The likelihood function of the dependent variable y is formulated as follows:
L ( ρ , β 0 | y ) = f ( y | ρ , β 0 ) = ( 1 2 π σ 2 ) n 2 exp [ ( y ρ W y A β 0 ) T ( y ρ W y A β 0 ) 2 σ 2 ]
Furthermore, the log-likelihood function is obtained as:
ln L ( ρ , β 0 | ε ) = ln ( 1 2 π σ 2 ) n 2 exp [ ( y ρ W y A β 0 ) T ( y ρ W y A β 0 ) 2 σ 2 ] = n 2 ln ( 2 π ) n 2 ln σ 2 ( y ρ W y A β 0 ) T ( y ρ W y A β 0 ) 2 σ 2
A parameter estimation ρ ,   β 0 is obtained by maximizing the log-likelihood function.
To obtain the MLE estimation of the parameters ρ ^ , the first derivative of Equation (17) for the parameters ρ was expressed as:
ln L ( ρ , β 0 | ε ) β 0 = ( y ρ W y A β 0 ) T ( W y ) 2 σ 2 = ( y ρ W y A β 0 ) T ( W y ) 2 σ 2
ln L ( ρ , β 0 | ε ) ρ | ρ = ρ ^ = 0 , this is further expressed as follows:
( y ρ W y A β 0 ) T ( W y ) 2 σ 2 = 0
In Equation (19), multiply by 2 σ 2 ( W y ) T to obtain:
( y ρ W y A β 0 ) T ( W y ( W y ) T ) = 0
Multiply Equation (20) by ( W y ( W y ) T ) 1 to obtain:
( y ρ W y A β 0 ) T = 0
Transpose Equation (21) to obtain:
y A β 0 = ρ W y
In Equation (22), multiply both segments by ( W y ) T , thus obtaining:
( y A β 0 ) ( W y ) T = ρ W y ( W y ) T
From Equation (23), multiply both segments by ( W y ( W y ) T ) 1 to further obtain:
( y A β 0 ) ( W y ) T ( W y ( W y ) T ) 1 = ρ ( W y ( W y ) T ) ( W y ( W y ) T ) 1
ρ ^ = ( y A β 0 ) ( W y ) T ( W y ( W y ) T ) 1
Substitute the equation A = X Z J into Equation (13), in order to determine ρ ^ the following:
ρ ^ = ( y ( X Z J ) β 0 ) ( W y ) T ( W y ( W y ) T ) 1
To obtain the MLE estimation of the parameters β ^ 0 , first determine the first derivative of Equation (17) for the parameters β 0 as follows:
ln L ( ρ , β 0 | ε ) β 0 = ( y ρ W y A β 0 ) T ( A ) 2 σ 2 = ( y ρ W y A β 0 ) T ( A ) 2 σ 2
ln L ( ρ , β 0 | ε ) β 0 | β 0 = β ^ 0 = 0 . This leads to the equation below:
( y ρ W y A β 0 ) T A 2 σ 2 = 0
In Equation (28), multiply with 2 σ 2 A T , in order to obtain:
( y ρ W y A β 0 ) T A A T = 0
Multiply Equation (29) by ( A A T ) 1 to obtain:
( y ρ W y A β 0 ) T = 0
Transpose Equation (30), in order to obtain the expression below:
A β 0 = y ρ W y
In Equation (31), multiply both segments by A T to obtain:
( A β 0 ) A T = ( y ρ W y ) A T
From Equation (32), multiply both segments by ( A A T ) 1 to obtain the following expression:
β 0 ( A A T ) ( A A T ) 1 = ( y ρ W y ) A T ( A A T ) 1
β ^ 0 = ( y ρ W y ) A T ( A A T ) 1
Substitute the equation A = X Z J into Equation (13) and derive β ^ 0 as follows:
β ^ 0 = ( y ρ W y ) ( X Z J ) T ( ( X Z J ) ( X Z J ) T ) 1
Based on the estimated parameter β ^ 0 in Equation (35), the value β can be obtained by substituting into Equation (11):
β = Z J ( ( y ρ W y ) ( X Z J ) T ( ( X Z J ) ( X Z J ) T ) 1 )

3. Real Data Application

3.1. Data Description

In this research, secondary data from 13 climate observation stations in West Java were utilized. These data were sourced from the Meteorology, Climatology, and Geophysics Agency (BMKG), while climate variable data were extracted from the National Aeronautics and Space Administration Prediction of Worldwide Energy Resources (NASA POWER) satellite observations. The climate variable dataset included rainfall, air temperature, humidity, solar irradiation, wind speed, and surface pressure shown on Table 1.

3.2. RShiny Web Application for SAR-X Model

At this stage, an R script was developed to process the data using the SAR-X model through the RShiny web application.
A.
The process of building an R script for the SAR-X model
The SAR-X modeling process, employing the Casetti model approach, entailed the following steps:
  • Importing the climate data used for the SAR-X modeling, encompassing 13 climate observation stations in the West Java region.
  • Constructing vectors and matrices based on the climate data, including:
    • Vector y : defines the rainfall variable at each location.
    • Matrix X : represents the exogenous variables, such as air temperature, humidity, solar irradiation, wind speed, and surface pressure.
    • Matrix Z : consists of location coordinate entries in latitude and longitude.
    • Matrix J : identity matrix with the size of as many as five exogenous variables, according to matrix X .
    • Matrix W : the result of calculating the inverse distance weight matrix using the equation with input location coordinates (latitude and longitude).
    • Kronecker Z : the expression obtained from the multiplication of the kronecker with the identity matrix of the five exogenous variables ( Z I k )
    • Matrix A : the product of matrix X , Z , and J
  • The Moran Index calculation using function “moran.test” and Moran Scatterplot using function “moran.plot”.
  • The calculation of parameter estimation ρ ^ and β ^ 0 , obtaining the prediction results y ^ , absolute error, and MAPE of the SAR-X model prediction with the Casetti model approach.
  • The SAR-X model prediction data with the Casetti model approach in the form of y ^
B.
Creating and publishing the RShiny web application with the developed script.
At this stage, the R script explicitly built for the SAR-X model was transformed into an RShiny web application following these steps:
  • Installing packages and call libraries, specifically “shiny” and “shinythemes”, to set up RShiny.
  • Creating a User Interface (UI) and Server for the Web Application
    UI scripts managed the appearance of the web, incorporating headers, images, panel tabs, and more. Furthermore, the previously crafted R script for the SAR-X model was integrated into the server script. Running the application involved clicking “Run App.” Successful execution prompted progression, while errors in the console necessitated troubleshooting.
  • Publishing the RShiny Web Application
At this stage, the designed application was published on the https://www.shinyapps.io platform, accessed on 5 July 2023. The account creation on the site was followed by token activation. The provided token was copied and inserted into RStudio. The application could be published by clicking the designated icon.

3.3. Calculation Result of Inverse Distance Weight Matrix

The inverse distance weight was computed by taking into account the actual distances between each location. The standardized inverse distance weight matrix for the SAR-X model was then obtained using Equation (2), which produced the following:
W = ( 0 0.049832 0.046317 0.054288 0.019748 0.055151 0.093068 0.090447 0.146109 0.200602 0.06479 0.083328 0.096319 0.015477 0 0.008455 0.005307 0.044687 0.010736 0.214639 0.232858 0.025472 0.024137 0.14286 0.267645 0.007726 0.016001 0.009405 0 0.039857 0.005352 0.736974 0.01185 0.012007 0.044065 0.039526 0.007747 0.010561 0.066655 0.050697 0.015956 0.107736 0 0.009246 0.085519 0.021146 0.021139 0.05277 0.056539 0.015629 0.019388 0.544235 0.034948 0.254618 0.027416 0.017521 0 0.032666 0.12011 0.123126 0.051796 0.048721 0.135189 0.130788 0.023102 0.018078 0.01133 0.699231 0.030017 0.00605 0 0.014586 0.014824 0.068909 0.057761 0.009028 0.012802 0.057385 0.000609 0.004519 0.000224 0.000148 0.000444 0.000291 0 0.924044 0.000933 0.000937 0.003895 0.063727 0.000229 0.0006 0.00497 0.00023 0.00015 0.000461 0.0003 0.936838 0 0.000964 0.000959 0.003656 0.05064 0.000232 0.018523 0.010397 0.01617 0.007164 0.003711 0.026651 0.018093 0.018428 0 0.843116 0.008339 0.014411 0.014999 0.025362 0.009825 0.014464 0.007654 0.003481 0.022278 0.018121 0.018288 0.84081 0 0.008556 0.014497 0.016663 0.021581 0.153208 0.007469 0.005574 0.025445 0.009174 0.198412 0.1837 0.02191 0.022542 0 0.342974 0.008011 0.004212 0.043556 0.001545 0.001049 0.003735 0.001974 0.492634 0.386119 0.005746 0.005796 0.052045 0 0.001589 0.065939 0.01703 0.132081 0.398967 0.008937 0.119851 0.023988 0.023994 0.080996 0.09023 0.016465 0.021523 0 )

3.4. Calculation Result of Moran’s Index and Scatterplot Moran

The calculation of the Moran Index was employed to assess the existence of spatial autocorrelation among the observation locations. The Moran Index for each climate variable was calculated using Equation (4). The results of the Moran index calculations are shown on Table 2.
Aside from the computation of the Moran Index on Table 2, a Moran Scatterplot was also utilized to visualize the overall clustering tendency and distinctive attributes of each region. This visual representation took the form of a four-quadrant graph for each unit of analysis. The four quadrants showed potential groupings, bounded by mean and average lines. Areas were considered to have high attributes when their values exceeded the average, while values below the average indicated low characteristics. The complete results of the Moran Scatterplot for each climate variable can be seen in Figure 1.

3.5. Prediction Result of SAR Model

The estimation of the prediction parameters ρ ^   and β ^ in the SAR model was conducted using R version 4.2.2. An estimated ρ ^ value of 0.744 was obtained, indicating spatial lag dependence. This signified the influence of adjacent locations within the West Java region on the rainfall prediction data. The results of the calculation of the parameter estimate for β ^ are shown in Table 3.
The SAR model equation for each location is presented and can be found in Appendix A. The predictions of rainfall and absolute errors at each location are shown in Table 4.
Based on the results of the absolute error calculation in Table 4, the MAPE (Mean Absolute Percentage Error) value for prediction using the SAR model amounted to 16.85%. According to Lewis (1982) in [40], the prediction accuracy level is accurate, as the MAPE was less than 20%.

3.6. Prediction Result of SAR-X Model

The estimation of the prediction parameters ρ ^ and β ^ 0 in the SAR-X model was conducted using R software through a web application powered by RShiny. An estimated ρ ^ value of 1.001973 was obtained, indicating spatial lag dependence. This signified the influence of adjacent locations within the West Java region on the rainfall prediction data. The results of the parameter estimate calculation of β ^ 0 for each β ^ l a t i t u d e and β ^ l o n g i t u d e are shown in Table 5.
Based on the estimated value of β ^ 0 in Table 5, the value of β ^ was determined using Equation (38). The calculated values of β ^ are shown in Table 6.
The SAR-X model equation for each location can be found in Appendix B. The predictions of the rainfall and corresponding errors at each location are shown in Table 7.
Based on the results of the absolute error calculation in Table 7, the MAPE value for prediction using the SAR-X model amounted to 1.95%. According to Lewis (1982) in [40], the prediction accuracy level is very accurate, as the MAPE value was less than 10%.

3.7. Cross-Validation

In this research, for a cross-validation study, the data collection stage was carried out through a parameter selection process in the form of climate variable data. Those used were rainfall, solar irradiance, and wind speed in the West Java region, consisting of 11 regencies/cities shown on Table 8.
An estimated ρ ^ value of 0.8055492 was obtained, indicating spatial lag dependence. This signified the influence of adjacent locations within the West Java region on the rainfall prediction data. The results of the parameter estimate calculations of β ^ 0 for each β ^ l a t i t u d e and β ^ l o n g i t u d e are shown in Table 9.
The results of calculating the estimated value of β ^ are shown in Table 10.
The results of predictions of the rainfall and absolute errors at each location are shown in Table 11.
Based on the results of the absolute error calculation in Table 11, the MAPE value for prediction with the SAR-X model in the cross-validation dataset was 7.09% and the prediction accuracy level was very accurate, as the MAPE value was less than 10%.

4. Discussion

The procedure for predicting rainfall in West Java, utilizing the SAR-X model with the Casetti Model approach, was conducted based on data from 13 climate observation stations. The entire process was carried out within the RStudio software environment, leveraging the capabilities of the RShiny web application. The R script constructed for this purpose encompassed various stages of the SAR-X modeling procedure, ranging from data import, vector, matrix construction, Moran index, Moran scatterplot calculation, parameter estimation, prediction result calculation, and MAPE assessment to the publication of the RShiny web application. It should be noted that the execution time of the script ranged from 1 to 5 min. The source code is stored in the GitHub directory and the link is included in Appendix C, which could be valuable for facilitating the replication and verification of the model and its results by other researchers [41,42].
The result of the Moran index calculation, presented in Table 2, showed a positive spatial autocorrelation across the 13 observation stations within the West Java region. In addition, it concurred with the descriptive analysis, indicating that the areas exhibiting high rainfall tended to be situated in proximity to other high-rainfall areas, and vice versa. The results of the identification, parameter estimation, and prediction of rainfall, involving five other climate variable factors in Table 4 and Table 7, showed that the MAPE value of the SAR-X model prediction results was smaller than the MAPE of the SAR model, with the calculation of the rainfall prediction providing very high results based on the MAPE value of 1.95%. This shows that the prediction of rainfall using NASA POWER data from 13 observation stations in the West Java region using the SAR-X model was better than prediction with the SAR model, because the SAR-X model obtained β ^ parameters that varied for each exogenous variable at each location. This research aligned with the discovery of Hermawan [28], highlighting the influential role of five climate variable factors in shaping rainfall patterns, namely air temperature, humidity, solar irradiation, wind speed, and surface pressure.

5. Conclusions

In conclusion, the SAR-X model using Casetti’s approach was applied to analyze the rainfall data and climate variables in West Java, utilizing information from 13 climate observation stations. These data showed a positive spatial autocorrelation among different regions. The results highlighted the remarkable accuracy of the prediction outcomes obtained through the SAR-X model at each observation location. This accuracy was attributed to the distinct estimated parameter values assigned to each climate variable, contributing to the high precision of the prediction results.
The summary of this research underscored that the level of rainfall in each region, based on the data from 13 climate observation stations in West Java, was significantly influenced by five other climate variable factors. It should be noted that these factors collectively exhibited a positive spatial autocorrelation across various areas. Therefore, the recommendation for relevant institutions engaged in rainfall prediction is to thoroughly consider the diverse aspects of climate variables. This approach enables a comprehensive understanding of how each climate variable exerts its influence within distinct locations.

6. Patents

Granted Copy Right: Copy Right for Computer Program, number 000484465.
Entitled “Application of RShiny Program for SAR-X with Casetti Model Approach to Climate Data in West Java”, Ministry of Law and Human Rights of the Republic of Indonesia (Falah, A. N., Ruchjana, B. N., Abdullah, A. S., Rejito, J.), 2023. https://annisanurfalah.shinyapps.io/SAR-X-Model/, accessed on 5 July 2023.

Author Contributions

Conceptualization, A.N.F., B.N.R. and A.S.A.; methodology, A.N.F.; software, A.N.F.; validation, A.N.F. and B.N.R.; formal analysis, A.N.F., B.N.R. and A.S.A.; investigation, A.S.A. and J.R.; resources, B.N.R.; data curation A.N.F.; writing—original draft preparation, A.N.F.; writing—review and editing, A.N.F., B.N.R., A.S.A. and J.R.; supervision, B.N.R., A.S.A. and J.R.; project administration A.N.F. and B.N.R.; funding acquisition, B.N.R. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by Academic Leadership Grant Universitas Padjadjaran with contract number: 1549/UN6.3.1/PT.00/2023.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank to the Rector, Directorate of Research and Community Service (DRPM), and Studies Center of Modeling and Computation Faculty of Mathematics and Natural Sciences, Universitas Padjadjaran. The authors thank to the Commission Developing Country (CDC)—International Mathematical Union (IMU) Breakout Graduate Fellowship 2020–2023, Academic Leadership Grant (ALG) with contract number 1549/UN6.3.1/PT.00/2023, and International Consortium RISE_SMA project 2019–2024.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

  • SAR model for predicting rainfall in Balitpa Sukamandi:
y ^ s 1 = 0.744 i = 1 13 w 1 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Lanud Atang Sanjaya Semplak:
y ^ s 2 = 0.744 i = 1 13 w 2 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in LPHP Tasikmalaya:
y ^ s 3 = 0.744 i = 1 13 w 3 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in SMPK Cirebon:
y ^ s 4 = 0.744 i = 1 13 w 4 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in SMPK Maranginan Sukabumi:
y ^ s 5 = 0.744 i = 1 13 w 5 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in SMPK Nariewattie:
y ^ s 6 = 0.744 i = 1 13 w 6 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in SMPK Pacet Cianjur:
y ^ 7 = 0.744 i = 1 13 w 7 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in SMPK Pasir Sarongge:
y ^ s 8 = 0.744 i = 1 13 w 8 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Stage of Bandung:
y ^ s 9 = 0.744 i = 1 13 w 9 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Stage of Lembang:
y ^ 10 = 0.744     i = 1 13 w 10 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Staklim Darmaga:
y ^ s 11 = 0.744     i = 1 13 w 11 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Stamet Citeko Bogor:
y ^ s 12 = 0.744 i = 1 13 w 12 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5
  • SAR model for predicting rainfall in Stamet Jatiwangi:
y ^ s 13 = 0.744 i = 1 13 w 13 i y s i 89.56 X 1 12.21 X 2 0.63 X 3 + 27.50 X 4 + 36.49 X 5

Appendix B

  • SAR-X model for predicting rainfall in Balitpa Sukamandi:
y ^ s 1 = 1.001973 i = 1 13 w 1 i y s i 215.93 X 1 45.51 X 2 + 58.56 X 3 21.52 X 4 0.73 X 5
  • SAR-X model for predicting rainfall in Lanud Atang Sanjaya Semplak:
y ^ s 2 = 1.001973 i = 1 13 w 2 i y s i 31.14 X 1 116.44 X 2 2.56 X 3 + 120.40 X 4 + 43.66 X 5
  • SAR-X model for predicting rainfall in LPHP Tasikmalaya:
y ^ s 3 = 1.001973 i = 1 13 w 3 i y s i + 0.16 X 1 20.48 X 2 42.76 X 3 + 29.06 X 4 + 16.71 X 5
  • SAR-X model for predicting rainfall in SMPK Cirebon:
y ^ s 4 = 1.001973 i = 1 13 w 4 i y s i + 86.59 X 1 + 3.91 X 2 112.28 X 3 1.67 X 4 + 24.16 X 5
  • SAR-X model for predicting rainfall in SMPK Maranginan Sukabumi:
y ^ s 5 = 1.001973 i = 1 13 w 5 i y s i + 80.12 X 1 0.91 X 2 + 38.00 X 3 0.32 X 4 + 11.79 X 5
  • SAR-X model for predicting rainfall in SMPK Nariewattie:
y ^ s 6 = 1.001973 i = 1 13 w 6 i y s i + 137.49 X 1 + 36.36 X 2 + 14.27 X 3 49.03 X 4 1.14 X 5
  • SAR-X model for predicting rainfall in SMPK Pacet Cianjur:
y ^ s 7 = 1.001973 i = 1 13 w 7 i y s i + 32.06 X 1 + 5.04 X 2 1.21 X 3 + 71.15 X 4 + 24.35 X 5
  • SAR-X model for predicting rainfall in SMPK Pasir Sarongge:
y ^ s 8 = 1.001973 i = 1 13 w 8 i y s i 1.75 X 1 + 343.87 X 2 + 20.79 X 3 + 20.24 X 4 12.06 X 5
  • SAR-X model for predicting rainfall in Stage of Bandung:
y ^ s 9 = 1.001973 i = 1 13 w 9 i y s i 4.98 X 1 + 68.96 X 2 17.38 X 3 1.40 X 4 + 6.13 X 5
  • SAR-X model for predicting rainfall in Stage of Lembang:
y ^ s 10 = 1.001973 i = 1 13 w 10 i y s i 55.57 X 1 2.87 X 2 + 48.79 X 3 + 12.41 X 4 + 8.65 X 5
  • SAR-X model for predicting rainfall in Staklim Darmaga:
y ^ s 11 = 1.001973 i = 1 13 w 11 i y s i + 295.90 X 1 58.45 X 2 + 16.20 X 3 30.12 X 4 1.05 X 5
  • SAR-X model for predicting rainfall in Stamet Citeko Bogor:
y ^ s 12 = 1.001973 i = 1 13 w 12 i y s i + 60.51 X 1 134.81 X 2 1.27 X 3 50.76 X 4 + 29.39 X 5
  • SAR-X model for predicting rainfall in Stamet Jatiwangi:
y ^ s 13 = 1.001973 i = 1 13 w 13 i y s i 2.62 X 1 + 285.08 X 2 + 18.01 X 3 1.64 X 4 5.17 X 5

Appendix C

The source code is stored in the GitHub directory and can be accessed at the following link: https://github.com/annisanurfalah02/SAR-X-Model, accessed on 15 August 2023.

References

  1. Shoji, T.; Kitaura, H. Statistical and geostatistical analysis of rainfall in central Japan. Comput. Geosci. 2006, 32, 1007–1024. [Google Scholar] [CrossRef]
  2. Bostan, P.A.; Zuhal, A. Exploring The Mean Annual Precipitation and Temperature Values over Turkey by Using Environmental Variables; University of Applied Sciences: Stuttgart, Germany, 2006; pp. 1–6. [Google Scholar]
  3. Cannarozzo, M.; Noto, L.V.; Viola, F. Spatial distribution of rainfall trends in Sicily (1921–2000). Phys. Chem. Earth 2006, 31, 1201–1211. [Google Scholar] [CrossRef]
  4. Diodato, N.; Tartari, G.; Bellocchi, G. Geospatial Rainfall Modelling at Eastern Nepalese Highland from Ground Environmental Data. Water Resour. Manag. 2010, 24, 2703–2720. [Google Scholar] [CrossRef]
  5. Basistha, A.; Arya, D.S.; Goel, N.K. Spatial distribution of rainfall in Indian Himalayas—A case study of Uttarakhand Region. Water Resour. Manag. 2008, 22, 1325–1346. [Google Scholar] [CrossRef]
  6. Yilmaz, B.; Harmancioglu, N.B. An Indicator Based Assessment for Water Resources Management in Gediz River Basin, Turkey. Water Resour. Manag. 2010, 24, 4359–4379. [Google Scholar] [CrossRef]
  7. Zimit, A.Y.; Jibril, M.M.; Azimi, M.S.; Abba, S.I. Hybrid predictive based control of precipitation in a water-scarce region: A focus on the application of intelligent learning for green irrigation in agriculture sector. J. Saudi Soc. Agric. Sci. 2023. [Google Scholar] [CrossRef]
  8. Hussain, A.; Jadoon, K.Z.; Rahman, K.U.; Shang, S.; Shahid, M.; Ejaz, N.; Khan, H. Analyzing the impact of drought on agriculture: Evidence from Pakistan using standardized precipitation evapotranspiration index. Nat. Hazards 2023, 115, 389–408. [Google Scholar] [CrossRef]
  9. Ali, U.; Jing, W.; Zhu, J.; Omarkhanova, Z.; Fahad, S.; Nurgazina, Z.; Khan, Z.A. Climate change impacts on agriculture sector: A case study of Pakistan. Cienc. Rural 2021, 51, e20200110. [Google Scholar] [CrossRef]
  10. Duncan, J.M.A.; Biggs, E.M.; Dash, J.; Atkinson, P.M. Spatio-temporal trends in precipitation and their implications for water resources management in climate-sensitive Nepal. Appl. Geogr. 2013, 43, 138–146. [Google Scholar] [CrossRef]
  11. Hartmann, H.; Snow, J.A.; Su, B.; Jiang, T. Seasonal predictions of precipitation in the Aksu-Tarim River basin for improved water resources management. Glob. Planet. Chang. 2016, 147, 86–96. [Google Scholar] [CrossRef]
  12. Li, L.; Hong, Y.; Wang, J.; Adler, R.F.; Policelli, F.S.; Habib, S.; Irwn, D.; Korme, T.; Okello, L. Evaluation of the real-time TRMM-based multi-satellite precipitation analysis for an operational flood prediction system in Nzoia Basin, Lake Victoria, Africa. Nat. Hazards 2009, 50, 109–123. [Google Scholar] [CrossRef]
  13. Sharma, S.; Khadka, N.; Hamal, K.; Baniya, B.; Luintel, N.; Joshi, B.B. Spatial and Temporal Analysis of Precipitation and Its Extremities in Seven Provinces of Nepal. Appl. Ecol. Environ. Sci. 2020, 8, 64–73. [Google Scholar]
  14. Wei, N.; Sun, X.; Bi, X.; Wang, J.M.; Li, X. The spatial characteristics of precipitation and water-logging disaster during rainy season for urban planning in Xi’an. Indoor Built Environ. 2019, 28, 1263–1271. [Google Scholar] [CrossRef]
  15. Cheng, L.; Aghakouchak, A. Nonstationary precipitation intensity-duration-frequency curves for infrastructure design in a changing climate. Sci. Rep. 2014, 4, 7093. [Google Scholar] [CrossRef] [PubMed]
  16. Rosenberg, E.A.; Keys, P.W.; Booth, D.B.; Hartley, D.; Burkey, J.; Steinemann, A.C.; Lettenmaier, D.P. Precipitation extremes and the impacts of climate change on stormwater infrastructure in Washington State. Clim. Chang. 2010, 102, 319–349. [Google Scholar] [CrossRef]
  17. Nissen, K.M.; Ulbrich, U. Increasing frequencies and changing characteristics of heavy precipitation events threatening infrastructure in Europe under climate change. Nat. Hazards Earth Syst. Sci. 2017, 17, 1177–1190. [Google Scholar] [CrossRef]
  18. Ibrahim, N.A.; Alwi, S.R.W.; Manan, Z.A.; Mustaffa, A.A.; Kidam, K. Risk matrix approach of extreme temperature and precipitation for renewable energy systems in Malaysia. Energy 2022, 254, 124471. [Google Scholar] [CrossRef]
  19. Bezirgiannidis, A.; Chatzopoulos, P.; Tsakali, A.; Ntougias, S.; Melidis, P. Renewable energy recovery from sewage sludge derived from chemically enhanced precipitation. Renew. Energy 2020, 162, 1811–1818. [Google Scholar] [CrossRef]
  20. Black, A.W.; Mote, T.L. Characteristics of winter-precipitation-related transportation fatalities in the United States. Weather Clim. Soc. 2015, 7, 133–145. [Google Scholar] [CrossRef]
  21. Bucar, R.C.B.; Hayeri, Y.M. Quantitative assessment of the impacts of disruptive precipitation on surface transportation. Reliab. Eng. Syst. Saf. 2020, 203, 107105. [Google Scholar] [CrossRef]
  22. Safdar, F.; Khokhar, M.F.; Mahmood, F.; Khan, M.Z.A.; Arshad, M. Observed and predicted precipitation variability across Pakistan with special focus on winter and pre-monsoon precipitation. Environ. Sci. Pollut. Res. 2023, 30, 4510–4530. [Google Scholar] [CrossRef] [PubMed]
  23. Yavuz, H.; Erdoğan, S. Spatial Analysis of Monthly and Annual Precipitation Trends in Turkey. Water Resour. Manag. 2012, 26, 609–621. [Google Scholar] [CrossRef]
  24. Gonzalez-Gonzalez, M.A.; Guertin, D.P. Seasonal bean yield forecast for non-irrigated croplands through climate and vegetation index data: Geospatial effects. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102623. [Google Scholar] [CrossRef]
  25. da Silva, M.V.; Pandorfi, H.; da Rosa Ferraz Jardim, A.M.; de Oliveira-Júnior, J.F.; da Divincula, J.S.; Giongo, P.R.; da Silva, T.G.F.; de Almeida, G.L.P.; de Albuquerque Moura, G.B.; Lopes, P.M.O. Spatial modeling of rainfall patterns and groundwater on the coast of northeastern Brazil. Urban Clim. 2021, 38, 100911. [Google Scholar] [CrossRef]
  26. Sigrist, F.; Künsch, H.R.; Stahel, W.A. A dynamic nonstationary spatio-temporal model for short term prediction of precipitation. Ann. Appl. Stat. 2012, 6, 1452–1477. [Google Scholar] [CrossRef]
  27. BMKG. Analisis Dinamika Atmosfer Dasarian III Mei 2022. 2022. Available online: https://www.bmkg.go.id/iklim/dinamika-atmosfir.bmkg (accessed on 1 May 2022).
  28. Hermawan, E.; Lubis, S.W.; Harjana, T.; Purwaningsih, A.; Risyanto; Ridho, A.; Andarini, D.F.; Ratri, D.N.; Widyaningsih, R. Large-Scale Meteorological Drivers of the Extreme Precipitation Event and Devastating Floods of Early-February 2021 in Semarang, Central Java, Indonesia. Atmosphere 2022, 13, 1092. [Google Scholar] [CrossRef]
  29. Anselin, L. Spatial Econometrics: Methods and Models; Springerl: Berlin/Heidelberg, Germany, 1988; Volume 85. [Google Scholar]
  30. LeSage, J. Spatial Econometrics Toolbox. In Introduction to Spatial Econometrics; CRC Press: Boca Raton, FL, USA, 1999; p. 273. Available online: http://www.spatial-econometrics.com/ (accessed on 1 May 2022).
  31. Pebesma, E.; Graler, B. Introduction to Spatio-Temporal Variography; ifgi: Munster, Germany, 2017. [Google Scholar]
  32. Gio, P.U.; Effendie, A.R. Belajar Bahasa Pemrograman R; USU Press: Denver, CO, USA, 2018. [Google Scholar]
  33. Lu, G.Y.; Wong, D.W. An adaptive inverse-distance weighting spatial interpolation technique. Comput. Geosci. 2008, 34, 1044–1055. [Google Scholar] [CrossRef]
  34. Kopczewska, K. Applied Spatial Statistics and Econometric; Routledge: Abingdon-on-Thames, UK, 2021. [Google Scholar]
  35. Abdullah, A.S.; Ruchjana, B.N.; Toharudin, T.; Rosadi, R. Model SAR, Ekspansi SAR dan Plot Moran untuk Pemetaan Hasil Akreditasi Sekolah di Provinsi Jawa Barat. In Prosiding Seminar Nasional Matematika dan Pendidikan Matematika; UMS: Kota Kinabalu, Malaysia, 2015; pp. 935–943. [Google Scholar]
  36. Ord, K. Estimation methods for models of spatial interaction. J. Am. Stat. Assoc. 1975, 70, 120–126. [Google Scholar] [CrossRef]
  37. Smirnov, O.; Anselin, L. Fast maximum likelihood estimation of very large spatial autoregressive models: A characteristic polynomial approach. Comput. Stat. Data Anal. 2001, 35, 301–319. [Google Scholar] [CrossRef]
  38. Robinson, P.M.; Rossi, F. Refinements in maximum likelihood inference on spatial autocorrelation in panel data. J. Econom. 2015, 189, 447–456. [Google Scholar] [CrossRef]
  39. Qiu, F.; Ding, H.; Hu, J. Asymptotic Properties of Quasi-Maximum Likelihood Estimators for Heterogeneous Spatial Autoregressive Models. Symmetry 2022, 14, 1894. [Google Scholar] [CrossRef]
  40. Lawrence, K.D.; Klimberg, R.K.; Lawrence, S.M. Fundamentals of Forecasting Using Excel; Industrial Press Inc.: New York, NY, USA, 2009. [Google Scholar]
  41. Zhang, C.; Hu, C.; Wu, T.; Zhu, L.; Liu, X. Achieving Efficient and Privacy-Preserving Neural Network Training and Prediction in Cloud Environments. IEEE Trans. Dependable Secur. Comput. 2022, 20, 4245–4257. [Google Scholar] [CrossRef]
  42. Hu, C.; Zhang, C.; Lei, D.; Wu, T.; Liu, X.; Zhu, L. Achieving Privacy-Preserving and Verifiable Support Vector Machine Training in the Cloud. IEEE Trans. Inf. Forensics Secur. 2023, 18, 3476–3491. [Google Scholar] [CrossRef]
Figure 1. Moran’s scatterplot of climate variables: (a) rainfall, (b) air temperature, (c) humidity, (d) solar ir-radiation, (e) wind speed, and (f) surface pressure.
Figure 1. Moran’s scatterplot of climate variables: (a) rainfall, (b) air temperature, (c) humidity, (d) solar ir-radiation, (e) wind speed, and (f) surface pressure.
Mathematics 11 03783 g001aMathematics 11 03783 g001b
Table 1. Climate variable data at 13 climate observation stations in West Java.
Table 1. Climate variable data at 13 climate observation stations in West Java.
No.LocationsLatitudeLongitudeRainfall
(mm)
Air Temperature
(°C)
Humidity
(%)
Solar Irradiation
(W/m2)
Wind Speed (m/s)Surface Pressure (kPa)
1Balitpa Sukamandi−6.3107.65186.3625.383.45415.21.7697.86
2Lanud Atang Sanjaya Semplak−6.9106.77200.7423.7287.25411.231.7195.52
3LPHP Tasikmalaya−7.28108.16228.223.6889.15398.292.196.46
4SMPK Cirebon−6.72108.58189.7127.778.58418.723.67100.86
5SMPK Maranginan Sukabumi−7.25106.25191.4225.7984.61415.463.1298.84
6SMPK Nariewattie−7.25108194.5722.5987.85398.291.5793.62
7SMPK Pacet Cianjur−6.73107176.2824.785.19415.21.4696.82
8SMPK Pasir Sarongge−6.75107176.2824.785.19415.21.4696.82
9Stage of Bandung−6.92107.6203.2921.4188.29415.21.5190.93
10Stage of Lembang−6.83107.62203.2921.4188.29415.21.5190.93
11Staklim Darmaga−6.55106.75176.2824.785.19411.231.4696.82
12Stamet Citeko Bogor−6.7106.93176.2824.785.19411.231.4696.82
13Stamet Jatiwangi−6.75108.27181.0826.4580.97418.722.4999.66
Table 2. Spatial autocorrelation test using Moran’s Index.
Table 2. Spatial autocorrelation test using Moran’s Index.
No.Climate Variablep-ValueDescription
1 y (Rainfall)0.0168There is spatial autocorrelation
2 X 1 (Air Temperature)0.0061There is spatial autocorrelation
3 X 2 (Humidity)0.0163There is spatial autocorrelation
4 X 3 (Solar Irradiation)0.0013There is spatial autocorrelation
5 X 4 (Wind Speed)0.0487There is spatial autocorrelation
6 X 5 (Surface Pressure)0.0044There is spatial autocorrelation
Table 3. Parameter estimated value β ^ .
Table 3. Parameter estimated value β ^ .
No. β ^ Parameter Estimated Value
1β1−89.56
2β2−12.21
3β3−0.63
4β427.50
5β536.49
Table 4. Prediction results and error prediction of SAR model.
Table 4. Prediction results and error prediction of SAR model.
No.Locationsy y ^ Absolute Error
1Balitpa Sukamandi186.36214.4915.09
2Lanud Atang Sanjaya Semplak200.74215.987.59
3LPHP Tasikmalaya228.20260.0913.98
4SMPK Cirebon189.71217.4914.64
5SMPK Maranginan Sukabumi191.42226.5818.37
6SMPK Nariewattie194.57273.1640.39
7SMPK Pacet Cianjur176.28189.407.44
8SMPK Pasir Sarongge176.28189.417.45
9Stage of Bandung203.29250.9323.44
10Stage of Lembang203.29250.8123.38
11Staklim Darmaga176.28196.3911.41
12Stamet Citeko Bogor176.28192.969.46
13Stamet Jatiwangi181.08228.8526.38
Table 5. Parameter estimated value β ^ 0 for each β ^ l a t i t u d e and β ^ l o n g i t u d e .
Table 5. Parameter estimated value β ^ 0 for each β ^ l a t i t u d e and β ^ l o n g i t u d e .
No. β ^ 0 Parameter Estimated Value
1 β ^ latitude ( x 1 ) −539.33
2 β ^ longitude ( x 1 ) −96.53
3 β ^ latitude ( x 2 ) 2.93
4 β ^ longitude ( x 2 ) 139.34
5 β ^ latitude ( x 3 ) 207.11
6 β ^ longitude ( x 3 ) −33.57
7 β ^ latitude ( x 4 ) −5.94
8 β ^ longitude ( x 4 ) 0.17
9 β ^ latitude ( x 5 ) 8.96
10 β ^ longitude ( x 5 ) 12.86
Table 6. Parameter estimated value of β ^ .
Table 6. Parameter estimated value of β ^ .
No.LocationsParameter Estimated Value of β ^
β ^ 1 β ^ 2 β ^ 3 β ^ 4 β ^ 5
1Balitpa Sukamandi−215.93−45.5158.56−21.52−0.73
2Lanud Atang Sanjaya Semplak−31.14−116.44−2.56120.4043.66
3LPHP Tasikmalaya0.16−20.48−42.7629.0616.71
4SMPK Cirebon86.593.91−112.28−1.6724.16
5SMPK Maranginan Sukabumi80.12−0.9138.00−0.3211.79
6SMPK Nariewattie137.4936.3614.27−49.03−1.14
7SMPK Pacet Cianjur32.065.04−1.2171.1524.35
8SMPK Pasir Sarongge−1.75343.8720.7920.24−12.06
9Stage of Bandung−4.9868.96−17.38−1.406.13
10Stage of Lembang−55.57−2.8748.7912.418.65
11Staklim Darmaga295.90−58.4516.20−30.12−1.05
12Stamet Citeko Bogor60.51−134.81−1.27−50.7629.39
13Stamet Jatiwangi−2.62285.0818.01−1.64−5.17
Table 7. Prediction results and absolute error prediction of SAR-X Model.
Table 7. Prediction results and absolute error prediction of SAR-X Model.
No.Locationsy y ^ Absolute Error
1Balitpa Sukamandi186.36189.491.68
2Lanud Atang Sanjaya Semplak200.74200.780.02
3LPHP Tasikmalaya228.20222.302.59
4SMPK Cirebon189.71190.800.57
5SMPK Maranginan Sukabumi191.42193.361.01
6SMPK Nariewattie194.57202.013.82
7SMPK Pacet Cianjur176.28175.130.65
8SMPK Pasir Sarongge176.28176.620.19
9Stage of Bandung203.29193.784.68
10Stage of Lembang203.29210.813.70
11Staklim Darmaga176.28171.902.48
12Stamet Citeko Bogor176.28179.171.64
13Stamet Jatiwangi181.08176.872.32
Table 8. Climate variable data from 11 regencies/cities in West Java.
Table 8. Climate variable data from 11 regencies/cities in West Java.
No.LocationsLatitudeLongitudeRainfall
(mm)
Solar Irradiation
(W/m2)
Wind Speed (m/s)
1Bandung City−6.91486107.6082203.29415.21.511
2Bekasi City−6.24159106.9924172.32411.22.492
3Cirebon City−6.73725108.5507189.71418.73.674
4Sukabumi City−6.9237106.9287200.74411.21.706
5Tasikmalaya City−7.31956108.203228.2398.32.099
6Pangandaran Regency−7.61506108.4988210.141.816398.3
7Bogor Regency−6.59504106.8166176.28411.21.456
8Majalengka Regency−6.83638108.2274194.57418.71.575
9Indramayu Regency−6.32758108.3249181.08418.72.495
10Purwakarta Regency−6.53868107.4499186.36415.21.76
11Kuningan Regency−7.01381108.5701184.64398.31.781
Table 9. Parameter estimated value β ^ 0 for cross-validation dataset.
Table 9. Parameter estimated value β ^ 0 for cross-validation dataset.
No. β ^ 0 Parameter Estimated Value
1 β ^ latitude ( x 1 ) −0.08
2 β ^ longitude ( x 1 ) 4.04
3 β ^ latitude ( x 2 ) −0.01
4 β ^ longitude ( x 2 ) 0.35
Table 10. Parameter estimated value of β ^ for cross-validation dataset.
Table 10. Parameter estimated value of β ^ for cross-validation dataset.
No.LocationsParameter Estimated Value of β ^
β ^ 1 β ^ 2
1Bandung City0.11−0.06
2Bekasi City2.720.08
3Cirebon City0.033.88
4Sukabumi City5.480.10
5Tasikmalaya City0.083.26
6Pangandaran Regency3.790.03
7Bogor Regency0.125.51
8Majalengka Regency2.480.07
9Indramayu Regency0.164.32
10Purwakarta Regency1.140.12
11Kuningan Regency0.202.59
Table 11. Prediction results and MAPE Prediction of SAR-X Model for cross-validation dataset.
Table 11. Prediction results and MAPE Prediction of SAR-X Model for cross-validation dataset.
No.Locationsy y ^ Absolute Error
1Bandung City203.29206.082.79
2Bekasi City172.32177.635.31
3Cirebon City189.71203.2313.52
4Sukabumi City200.74201.400.66
5Tasikmalaya City228.20223.604.60
6Pangandaran Regency210.14140.2169.93
7Bogor Regency176.28191.0114.73
8Majalengka Regency194.57203.288.71
9Indramayu Regency181.08183.862.78
10Purwakarta Regency186.36190.223.86
11Kuningan Regency184.64211.1126.47
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Falah, A.N.; Ruchjana, B.N.; Abdullah, A.S.; Rejito, J. The Hybrid Modeling of Spatial Autoregressive Exogenous Using Casetti’s Model Approach for the Prediction of Rainfall. Mathematics 2023, 11, 3783. https://doi.org/10.3390/math11173783

AMA Style

Falah AN, Ruchjana BN, Abdullah AS, Rejito J. The Hybrid Modeling of Spatial Autoregressive Exogenous Using Casetti’s Model Approach for the Prediction of Rainfall. Mathematics. 2023; 11(17):3783. https://doi.org/10.3390/math11173783

Chicago/Turabian Style

Falah, Annisa Nur, Budi Nurani Ruchjana, Atje Setiawan Abdullah, and Juli Rejito. 2023. "The Hybrid Modeling of Spatial Autoregressive Exogenous Using Casetti’s Model Approach for the Prediction of Rainfall" Mathematics 11, no. 17: 3783. https://doi.org/10.3390/math11173783

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop