Next Article in Journal / Special Issue
A Trajectory Big Data Storage Model Incorporating Partitioning and Spatio-Temporal Multidimensional Hierarchical Organization
Previous Article in Journal
Attention-Based Multiscale Spatiotemporal Network for Traffic Forecast with Fusion of External Factors
Previous Article in Special Issue
A Map Tile Data Access Model Based on the Jump Consistent Hash Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial and Attribute Neural Network Weighted Regression for the Accurate Estimation of Spatial Non-Stationarity

1
Faculty of Information Science and Engineering, Ocean University of China, 238 Songling Road, Qingdao 266100, China
2
Huawei Technologies, Zhangheng Road, Shenzhen 518129, China
3
Ocean Academy, Zhejiang University, 1 Zheda Road, Zhoushan 316021, China
4
Zhejiang Provincial Key Laboratory of Geographic Information Science, 148 Tianmushan Road, Hangzhou 310028, China
5
College of Science and Technology, Ningbo University, No. 521 Wenwei Rd. Baisha Road St. Cixi, Ningbo 315300, China
6
Ningbo Bay Area Development Research Base, No. 521 Wenwei Rd. Baisha Road St. Cixi, Ningbo 315300, China
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2022, 11(12), 620; https://doi.org/10.3390/ijgi11120620
Submission received: 17 September 2022 / Revised: 25 November 2022 / Accepted: 12 December 2022 / Published: 13 December 2022
(This article belongs to the Special Issue GIS Software and Engineering for Big Data)

Abstract

:
Geographically neural network weighted regression is an improved model of GWR combined with a neural network. It has a stronger ability to fit nonlinear functions, and complex geographical processes can be modeled more fully. GNNWR uses the distance metric of Euclidean space to express the relationship between sample points. However, except for spatial location features, geographic entities also have many diverse attribute features. Incorporating attribute features into the modeling process can make the model more suitable for the real geographical process. Therefore, we proposed a spatial-attribute proximities deep neural network to aggregate data from the spatial feature and attribute feature, so that one unified distance metric can be used to express the spatial and attribute relationships between sample points at the same time. Based on GNNWR, we designed a spatial and attribute neural network weighted regression (SANNWR) model to adapt to this new unified distance metric. We developed one case study to examine the effectiveness of SANNWR. We used PM2.5 concentration data in China as the research object and compared the prediction accuracy between GWR, GNNWR and SANNWR. The results showed that the “spatial-attribute” unified distance metric is useful, and that the SANNWR model showed the best performance.

1. Introduction

Space is the basis of human activities and various physical processes. Spatial modeling has always been the focus of research in the field of geographic information science [1,2,3]. In today’s big data era, enhancing geospatial modeling and spatial analysis capabilities remains a major challenge [4].
Solving the complex nonlinear problems of geographical elements is the key problem restricting the construction of spatial regression analysis models. To solve this problem, the geographically weighted regression (GWR) model was proposed by Brunsdon et al. (1996) [5]. GWR uses the distance metric of spatial feature to characterize the relationship between sample points and uses spatial kernel functions to calculate the spatial weight matrix. The OLR model is used in GWR to estimate the parameters of each sample point in the space. As a geostatistical technique, GWR can express the non-stationarity of spatial relationships intuitively, and can also perform statistical tests on parameter estimates. GWR is widely applied in diverse fields, including the analysis of CO2 emissions from agriculture in China (Xu and Lin 2017) [6], the structure model of violent crime in Portland (Cahill and Mulligan 2007) [7], the role of Chinese afforestation in reducing global greenhouse gas emissions (Sheng et al., 2017) [8] and housing price analysis (Huang et al., 2010, Wu et al., 2014, Fotheringham et al., 2015, Yao and Fotheringham 2016) [9,10,11,12].
The key component of the GWR is the spatial weighted kernel function. The deconstruction level of the spatial kernel function determines the detection accuracy of spatial non-stationarity. Kernel functions are mainly composed of two types: bandwidth-type kernel functions and function-type kernel functions. The bandwidth type kernel function consists of an adaptive kernel and a fixed-distance kernel. The function type kernel function consists of a Gaussian kernel, Bi-square kernel, Tri-cube kernel, and exponential kernel (Brunsdon et al., 1998, Fotheringham et al., 2002, Gollini et al., 2014) [13,14,15]. Considering that there may be a mixture of weights with a constant invariance of 1 and local weights in the real spatial relationship, a geographically weighted regression model was combined with a general linear regression model, and MGWR was developed(Fotheringham et al., 2017, Li et al., 2020, Yu et al., 2020) [16,17,18]. However, the structure of the kernel function is relatively simple, and it is difficult to fully express the influence of the complex geographical environment on the spatial weight. At the same time, the complex kernel function has high requirements for computing resources and the modeling process is too complicated. These make it very difficult for GWR to express spatial non-stationarity accurately.
Over the past decade, the EO data managed and processed by information systems have increased from the terabyte level to petabyte and exabyte levels [19]. With more data available, artificial intelligence machine learning (AI/ML), in particular deep learning (DL), is reorienting and transforming earth observation (EO) [20]. To remedy the above problems of GWR, a neural network model has been introduced to detect the complex nonlinear mapping of spatial non-stationarity. Based on the similar concepts of GWR, GNNWR was proposed (Du et al., 2020; Wu et al., 2018 [21,22]). This model combines ordinary linear regression (OLR) and neural networks to estimate spatial non-stationarity. The key of GNNWR is the construction of spatial weight neural network (SWNN). SWNN uses the super fitting ability of the neural network model to achieve the purpose of constructing the spatial non-stationary matrix accurately. Du et al. used 100 simulated data and observed data in Zhejiang Province to compare the GNNWR model, GWR model, and OLR model. The result proved that, compared to GWR, GNNWR can capture the relationship of spatial non-stationarity better and achieve better fitting performance and generalization performance (Du et al., 2020) [22].
All the above models use the spatial feature of geographic entities for modeling and use the distance metric in Euclidean space to represent the relationship between geographic entities. However, the discovery of complex geospatial features is still rather understudied [23]. In the real world, in addition to spatial features, geographic entities also have diverse attribute features, such as tree diameter, temperature, etc. To a certain extent, the distance metric in the attribute features space can also reflect the mutual relationship between geographic entities. Therefore, on the basis of traditional spatial distance, a unified distance metric can be obtained by merging attribute distances, which can more fully represent the interrelationships between geographical entities. In this paper, we proposed SAPDNN to merge the spatial distance and attribute distance to obtain a unified distance metric. Then, on the basis of GNNWR, we proposed SANNWR to enable the model to apply the unified distance metric of “spatial-attribute”.
The remainder of this paper is organized as follows. Section 2 introduces the data set we used. Section 3 presents the SANNWR model, followed by the definition of “spatial-attribute” unified distance metric and the SAPDNN model. Section 4 contains experiments and comparisons of the GWR, GNNWR, SANNWR models for one study case. The conclusions are given in the last section.

2. Study Area and Data

In order to ensure the consistency of the dataset at the time scale, the dataset used in this paper was obtained from the annual average data of PM2.5 concentration monitoring stations in China in 2018, including a total of 1465 monitoring stations (Figure 1).
The original dataset fields included aerosol (AOD), 2 m temperature (TEMP), relatively humidity(r), precipitation (TP), 10 m wind speed (WS), wind direction (WD), and elevation (DEM). These fields belong to the attribute features of the monitoring stations. In the model training, ten-fold cross-validation is adopted, 85% of the total data set is randomly used as the cross-validation set and 15% as the test set. The geographical distribution of the dataset in China after division is shown in Figure 1. The overall distribution is relatively uniform.
The basic statistical information of the observation parameters of original dataset, the cross-validation dataset, and the test dataset are shown in Table 1, including the maximum value, the minimum value, the mean value, and the standard deviation. Due to the inconsistent value range of each variable, the weight of variables may be affected during model training. We used min-max normalization in the data preprocessing stage to normalize the values of each variable to [0, 1].

3. Methods

In this section, first, we provide a definition of attribute features and the measurement method of attribute distance. Then, we use the neural network to merge the spatial distance and attribute distance of geographic entities into a new unified distance metric that contains more semantic information. After defining the data representation, we give the definition of the SANNWR model. This model integrates attribute distance based on GNNWR and uses the “spatial-attribute” unified distance metric as input, which enhances the ability to estimate the spatial non-stationarity.

3.1. ”Spatial-Attribute” Unified Distance Metirc

3.1.1. Attribute Features

Attribute features refer to some inherent attributes of a geographic entity, such as temperature, tree diameter, and wind direction at the sample point. Traditional GWR and GNNWR only use spatial feature to characterize sample points in the modeling process and mine the relationship between sample points from the spatial distance in Euclidean space. The attribute features of the sample points are ignored.
There are differences in the geographic spatial distribution of attribute features, to a certain extent, these differences can reflect the relationship between sample points. Combining attribute features with spatial features can express the relationship between geographic entities more fully. In an analysis of the spatial pattern of trees, Shi et al. (2006) extended the spatial weighted kernel function of GWR to spatial-attribute weighted kernel function, so that the GWR model can take the tree diameter into account in the calculation. The result showed that this method can better explain the spatial distribution of geographic entities [24].

3.1.2. The Definition of “Spatial-Attribute” Unified Distance Metric

The relationship between the spatial features is often measured by the distance in Euclidean space, and the mathematical expression is defined as:
d i j S = θ i θ j 2 + ϕ i ϕ j 2  
Similarly, we believe that in the vector space of attribute features, sample points that are closer or similar to the regression point have a greater impact on the regression point, and vice versa. Therefore, extending the distance metric between sample points from the perspective of attribute feature can more effectively evaluate the influence of adjacent sample points on regression points. The attribute distance is defined as the absolute difference of the attribute value or the weighted difference of multiple attribute values in the vector space of the attribute features. The attribute distance is defined as:
d i j A = p i p j  
In this formula, d i j A represents the attribute distance between sample points 𝑖 and 𝑗, the superscriptA is the identifier of the attribute feature.
In order to eliminate the difference of the measurement scale between spatial distance and attribute distance in the vector space, refer to Wu et al. (2014) for the fusion of temporal and spatial distances [10], we introduced the scale-weighted parameter to aggregate spatial distance and attribute distance and construct a “spatial-attribute” unified distance metric; the expression is as follows:
d i j S A = λ d i j S + φ d i j A  
λ is the weighted parameter of spatial distance and φ is the weighted parameter of attribute distance. We input d i j S and d i j A into a neural network (SAPNN), the input data is nonlinearly mapped by each node of the neural network to obtain a d i j S A . Therefore, the weight of d i j S and d i j A is the value of each node of the neural network and the value of the neural network node comes from the gradient update during model training.

3.1.3. The Nonlinear Fusion of “Spatial-Attribute” Unified Distance Metric

For the two sample points 𝑖 and 𝑗 in the space, it is assumed that there is a nonlinear fusion function d i j S A , that expresses a unified distance metric that takes into account both spatial distance and attribute distance; the expression is as follows:
d i j S A = f p r o x i m i t y S A d i j S ,   d i j A  
We proposed the spatial-attribute proximities neural network (SAPNN) to fit this function d i j S A . The network structure is shown in Figure 2:
SAPNN uses spatial distance and attribute distance as input and uses several dense layers for computation. Finally, we obtained the “spatial-attribute” unified distance between the two sample points i and j; the expression is as follows:
d i j S A = S A P N N d i j S , d i j A  
For any sample point 𝑖, the spatial distance vector d i 1 S , d i 2 S , , d i n S and the attribute distance vector d i 1 A , d i 2 A , , d i n A between 𝑖 and any other sample point can be obtained, n is the total number of sample points. For simplicity, the above two vectors can be abbreviated as d i S and d i A .
Furthermore, considering the interaction of “spatial-attribute” unified distance metric between any two sample points in the point set, we proposed the spatial-attribute proximities deep neural network (SAPDNN) to obtain a new unified distance metric of the relationship between one point and all other points, the structure of SAPDNN is shown as Figure 3. First, we used SAPNN to obtain the unified distance between 𝑖 and any one point, the vector we obtained is d i 1 S A , d i 2 S A , , d i n S A . Then, the vector undergoes several dense layers for nonlinear fusion. Finally, we obtained the unified distance metric d i S A , which can characterize the unified distance metric of spatial feature and attribute feature between the sample point 𝑖 and all other sample points in the space. The expression is as follows:
d i S A = S A P D N N   d i S , d i A  
It is worth pointing out that SAPDNN can adjust the model input, model structure and number of neurons according to the research situation. It can input two or more kinds of neighbor relationships for neural network fusion operation, and it can easily extend the fusion of neighbor relationships to other new feature scales. In this section, the neighbor relationships we aggregated are the spatial relationship and attribute relationships.

3.2. SANNWR Model

3.2.1. The Definition of SANNWR Model

In order to estimate spatial non-stationarity accurately, the GNNWR model was modified by using the “spatial-attribute” unified distance metric which takes attribute features into account. We proposed the spatial and attribute neural network-weighted regression (SANNWR) model, the mathematical expression is as follows:
y s i , a i = β 0 s i , a i + k = 1 p β k s i , a i x i k + ε i       i = 1 , 2 , , n  
The regression coefficient β k is a function of spatial feature and attribute feature, s i is the spatial feature at coordinate point u i , v i , a i is the attribute feature at coordinate point u i , v i . We can use w k s i , a i to indicate the weight of the coefficient of the ith sample point, β k s i , a i in GWR can be expressed as β k s i , a i = w k s i , a i × β ^ k O L R . Substitute this formula, y s i , a i can be updated as:
y ^ s i , a i = k = 0 p β ^ k s i , a i x i k = k = 0 p w k s i , a i × β ^ k O L R x i k  
The solution of the model can be expressed as the following matrix form:
y ^ s i , a i = x i T β ^ s i , a i = x i T W s i , a i ( X T X ) 1 X T y  
W s i , a i is a unified distance weight matrix which reflects spatial non-stationarity in geographical relations, the weight is related to the space distance s i of the ith point and the attribute distance a i of the ith point. The mathematical expression is as follows:
W s i , a i = w 0 s i , a i 0 0 0 0 w 1 s i , a i 0 0 0 0 0 0 0 0 w p s i , a i
The measure of spatial non-stationarity is actually determined by W s i , a i , and W s i , a i   is determined by the spatial distance and the attribute distance. We used SAPDNN to merge spatial distance and attribute distance, so that a unified metric of “spatial-attribute” distance could be obtained. Therefore, W s i , a i is a function of d i S A and can be expressed as:
W s i , a i = f k e r n e l d i S A = f k e r n e l S A P D N N d i S , d i A  
In this formula, f k e r n e l represents a weight kernel function in a generalized sense, d i S can be obtained by using Euclidean or non-Euclidean spatial distance metrics, d i A can be obtained according to the attribute distance measurement method defined in 3.1.2 above.
The SANNWR model was based on the GNNWR model and integrated SAPDNN to realize the fusion of spatial distance and attribute distance. In contrast to GNNWR, SANNWR uses a unified distance metric in the modeling process. It can more accurately reflect the relationship between each sample point and realize the exact solution of spatial non-stationarity. The implementation process is shown as follows:(Figure 4).
First, we used SAPDNN to obtain the unified distance vector d i S A , then took this vector as input to the SWNN to solve the weight matrix W s i , a i ; the solution process of this matrix can be expressed as:
W s i , a i = S W N N d i S A = S W N N S T A P D N N d i S , d i A  
The fit value y ^ can be expressed as:
y ^ = s 1 s 2 s n = x 1 T W s 1 , a 1 X T X 1 X T x 2 T W s 2 , a 2 X T X 1 X T x n T W s n , a n X T X 1 X T y = S S A N N W R y  
In this formula, S S A N N W R is the hat matrix of the SANNWR model.

3.2.2. Model Structure and Parameters

SANNWR consists of SAPDNN and SWNN. SAPDNN is composed of three layers: input layer, a hidden layer, and an output layer, which are connected in full connection mode. The SWNN has four layers of network structure, including two hidden layers, one for input and one for output. The unified distance is input into SWNN, and the weight matrix of the independent variables of fitting points is output [25]. All the layers in SWNN are connected in full connection mode. The neural network adopts a fully connected layer and dropout technologies to enhance the generalization capability, as suggested by Srivastava et al. (2014) [26].
The structure of SANNWR model is shown as Figure 5:
The super parameter settings of SANNWR model is listed in Table 2 as follows:

3.2.3. Model Optimization Training

The accuracy of the fitting value y ^ i depends on the degree of fitting the proximity relationship between sample points by SAPDNN network and the degree of fitting the nonlinear relationship by SWNN network. The following figure shows the training process from the initial spatial distance and attribute distance to the final weight matrix.
In order to obtain a better training effect, dropout technology was used to enhance the generalization ability of the model (Figure 6). The dropout technique randomly invalidates some nodes in the neural network during the training process to reduce the complexity of the model. This technique can reduce the overfitting and improve the generalization of the model. He-parameter initialization and PReLU activation function are used to improve the optimization efficiency of the model. At the same time, the batch normalization technique and the gradient descent method with variable learning rate are adopted to further improve the computing ability of the model. In addition, early stop method is used to prevent deep neural network overfitting. The training process is listed in Table 3.

4. Results

4.1. Design of Experiment

The dataset used in this paper was obtained from the annual average data of PM2.5 concentration monitoring stations in China in 2018, which includes a total of 1465 monitoring stations. PM2.5 concentration monitoring stations have many attribute features, including aerosol (AOD), 2 m temperature (TEMP), relative humidity (r), precipitation (TP), 10 m wind speed (WS), and elevation (DEM). In this paper, wind direction (WD) data were selected as the measure of attribute distance. In terms of model selection, this paper chose the GWR, GNNWR and SANNWR models for comparison. In order to verify the effectiveness of the “spatial-attribute” unified distance metric, GWR and GSNNWR models compared the spatial distance and attribute distance, respectively, while the SANNWR model used the unified distance metric as the measurement method.
Kernel function is a kind of mapping function for the nonlinear mapping of data, and is often used in statistics. According to the different spatial kernel functions, the GWR model can be combined in many ways. The fixed kernel function uses a fixed mapping mode for all the inputs, and the adaptive kernel function adjusts the weight of the mapping according to the distribution of the input data. To fully compare the performance differences between the models, we chose the fixed and adaptive kernel functions combined with Gaussian and Bi-square kernel functions, and select the more commonly used AICc criterion in the optimization criterion. It should be noted that the spatial distribution of stations in China is dense in the east and sparse in the west, which will lead to the lower reliability of the kernel function of the combination of fixed and bi-square (Nakaya, 2014) [27]. Therefore, this chapter does not consider the use of this kernel function combination. The specific kernel function combination settings of the GWR model are shown in Table 4.
Table 5 lists the modeling environment. The GWR modeling process is implemented based on MATLAB 2016a software. The optimal GWR model is solved by fminbnd function. GNNWR and SANNWR are implemented based on Python 3.6, CUDA9.0, and TensorFlow 1.7.

4.2. Analysis of Experimental Results

The GWR, GNNWR, and SANNWR models were cross-validated with ten-fold cross-validations. In addition to SANNWR, GWR and GNNWR were modeled using spatial distance (represented by letter S) and attribute distance (represented by letter A), respectively. Finally, six GWR modeling methods, two GNNWR modeling methods and the SANNWR model were compared. The evaluation indexes included R2, RMSE, MAE, MAPE and AICc. The results of the training set and the verification set are the average values of the cross-validation results. The F1 test results of each model show that the significance level is 0.01, which proves that there is significant spatial non-stationarity among atmospheric environmental pollutants in China.
The experimental results are shown in Table 6. The GWR model and GNNWR model take spatial distance and attribute distance as inputs, respectively. The performance of the model using attribute distance is lower than the model using spatial distance, which proves that the attribute distance alone does not improve the fitting performance of the model. Comparing three GWR models using spatial distance metric, we can see that the fitting ability of the GWR-AFG-S model is slightly worse, the GWR-AAB-S model is slightly better than the GWR-AAG-S model in terms of R2, RMSE, MAPE, and AICc, but slightly worse in terms of MAE. The results show that the spatial modeling capability of the two methods is equivalent. Comparing the GNNWR-S model with the GWR-AAB-S model, the indicators are as follows: R2 (0.841 vs. 0.711), RMSE (5.270 vs. 7.175), MAE (3.866 vs. 5.437), MAPE (10.773% vs. 15.210%), AICc (6912.113 vs. 7656.975). According to the above five indexes, the GNNWR-S model is better than the GWR-AAB-S model, which indicates that the GNNWR model has a better fitting ability than the GWR model. Furthermore, the SANNWR model outperforms the GNNWR model in terms of various indicators and becomes the model with the best fit performance among the nine modeling methods. It also proves that the “spatial-attribute” unified distance metric can better reflect the interrelationship between sample points and improve the analytical capability of the model for spatial non-stationarity.
In addition, we used a test set to examine the predictive effect and generalization ability of these models. As shown in Table 6, on the test set, the R2, RMSE, MAE and MAPE evaluation indexes of SANNWR model are better than those of the GWR and GNNWR models, indicating that the SANNWR model has better generalization capability.
In order to further reveal the fitting ability of these models, the scatter plots of the fitted values and true values of the GWR, GNNWR and SANNWR models are presented. In the scatter plots, the abscissa is the observed value, the ordinate is the estimated value, and the color of point indicates the number of data. The fitted lines for all points are drawn and the angle between the diagonal line and the fitted line reflects the similarity of the fitting. In the fitting effect presentation part, the cross-validated validation set of each model are used to draw these charts, the data volume is the same as that of the cross-validation set, so it is more convincing. The result is shown as follows:
As can be seen from Figure 7, the point distribution of the GWR model is rather scattered and the distribution of GNNWR and SANNWR points is relatively centralized, it means that the GWR model shows the worst fitting ability. Then, comparing the GNNWR and SANNWR, the scatter points of SANNWR are more concentrated, the angle between the scatter points fitting line and the 1:1 line is the smallest, which indicates that the SANNWR model has a better fitting ability than the GNNWR model. We can obtain the same conclusion from the values of MAE, MAPE, and RMSE.
Figure 8 shows the scatter chart of each model in the test set to show the prediction capability of these models. The result is similar to the scatter chart of the training set. The GWR model has the worst prediction accuracy. The GNNWR model has similar results to SANNWR model, but slightly weaker.

5. Discussion

More and more researchers use attribute features in spatial prediction, including the prediction of element concentrations, loss on ignition, and pH in the soils of south west England using high resolution remote sensing and geophysical survey data (Kirkwood C, Cave M and Beamish D, 2016) [28], and a study of multiple machine learning models on Walker Lake datasets (Ahmed W, Muhammad K, Glass H J, 2022) [29]. These studies have chosen to use machine learning to process data. In the big data era, the fine granularity and precision of geographical data are greatly improved, and the interrelationship detection between geographical elements becomes more refined. The original expression of proximity is difficult to meet the need of spatial non-stationarity analysis, and the complexity of spatial proximity expression needs to be improved urgently. So, the neural network is introduced to fit the complex spatial non-stationarity process. In addition to dealing with massive data in a single dimension, the neural network can calculate the data in different dimensions and obtain the optimal weight of the data in different dimensions independently during the training process. We introduced data from the attribute dimension based on the spatial dimension, which improved the fitting accuracy compared with the single dimension. In the future, we can extend the method to the time dimension, and we believe that the method can achieve better results.
Our experiment only combined spatial distance and a single attribute, this method has many research directions, such as multi-attribute distance using different measurement formulas, multi-attribute fusion.

6. Conclusions

In this study, we combined a spatial feature with a single attribute feature, defined a unified distance metric form of spatial distance and attribute distance, and proposed SAPDNN for nonlinear fusion of spatial distance and attribute distance. Then, we upgraded the GNNWR with the “spatial -attribute” unified distance metric and proposed the spatial and attribute neural network-weighted regression (SANNWR) model. On this basis, we conducted a case study. In our case, the spatial non-stationarity model was conducted with a PM2.5 concentration and other atmospheric environmental factors in China. The fitting effect and prediction ability of SANNWR were compared and analyzed with GWR and GNNWR models, with the results summarized as follows: in terms of the choice of the distance metric, single attribute distance is less effective than single spatial distance and unified distance can represent more semantic information. From the results of the comparison of the models, the SANNWR model has a significantly better fitting ability and generalization ability than the GWR model, and has some advantages over the GNNWR model. In Table 6, the GNNWR-A, the GNNWR-S, and the SANNWR can be considered as the same regression fitting network (SWNN + OLR) receiving three different inputs: a single attribute distance, a single spatial distance, and a fusion distance. For the regression fitting network (SWNN + OLR), the model structure and training method were the same. Therefore, the precision difference was caused by the information carried in the input data. The result of fusion distance was better than that of single spatial distance or single attribute distance. Results showed that the fusion distance has more effective information, and this fusion distance is obtained by the SAPDNN, which combined the space distance and the attribute distance. Our comparison results on the test dataset were consistent with the training set and the validation set, which indicates that the model did not overfit.
The results showed that the “spatial-attribute” unified distance metric can not only reveal the relationship between geographical elements more precisely, but also improve the fitting effect and generalization ability of the model to the spatial non-stationary elements. It also proved that the SANNWR model has strong fitting ability and generalization ability.

Author Contributions

Conceptualization, Zhongyi Wang; Methodology, Sihan Ni and Zhongyi Wang; Fromal analysis, Zhongyi Wang; Funding acquisition, Yuanyuan Wang; Investigation, Minghao Wang; Project administration, Sihan Ni; Resources, Nan Wang; Software Nan Wang; Validation, Sihan Ni and Shuqi Li; Supervision, Minghao Wang; Visualization, Yuanyuan Wang; Writing—original draft, Shuqi Li and Zhongyi Wang; Writing—review & editing, Sihan Ni and Yuanyuan Wang. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China [No. 42050103]; Provincial Key Research and De-velopment Program of Zhejiang [No. 2021C01031]; Key project of Soft Science Research of Zhejiang Province in the year 2022 [No.2022C25021].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cressie, N. Statistics for spatial data. Terra Nova 1992, 4, 613–617. [Google Scholar] [CrossRef]
  2. Cressie, N.; Wikle, C.K. Statistics for Spatio-Temporal Data; John Wiley & Sons: New York, NY, USA, 2015; ISBN 978-047-169-274-4. [Google Scholar]
  3. Fotheringham, A.S.; Crespo, R.; Yao, J. Geographical and temporal weighted regression (GTWR). Geogr. Anal. 2015, 47, 431–452. [Google Scholar] [CrossRef] [Green Version]
  4. Goodchild, M.F. Prospects for a space–time GIS: Space–time integration in geography and GIScience. Ann. Assoc. Am. Geogr. 2013, 103, 1072–1077. [Google Scholar] [CrossRef]
  5. Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Geographically weighted regression: A method for exploring spatial nonstationarity. Geogr. Anal. 1996, 28, 281–298. [Google Scholar] [CrossRef]
  6. Xu, B.; Lin, B. Factors affecting CO2 emissions in China’s agriculture sector: Evidence from geographically weighted regression model. Energy Policy 2017, 104, 404–414. [Google Scholar] [CrossRef]
  7. Cahill, M.; Mulligan, G. Using geographically weighted regression to explore local crime patterns. Soc. Sci. Comput. Rev. 2007, 25, 174–193. [Google Scholar] [CrossRef]
  8. Sheng, J.; Han, X.; Zhou, H. Spatially varying patterns of afforestation/reforestation and socio-economic factors in China: A geographically weighted regression approach. J. Clean. Prod. 2017, 153, 362–371. [Google Scholar] [CrossRef]
  9. Huang, B.; Wu, B.; Barry, M. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Int. J. Geogr. Inf. Sci. 2010, 24, 383–401. [Google Scholar] [CrossRef]
  10. Wu, B.; Li, R.R.; Huang, B. A geographically and temporally weighted autoregressive model with application to housing prices. Int. J. Geogr. Inf. Sci. 2014, 28, 1186–1204. [Google Scholar] [CrossRef]
  11. Fotheringham, A.S.; Crespo, R.; Yao, J. Exploring, modelling and predicting spatiotemporal variations in house prices. Ann. Reg. Sci. 2015, 54, 417–436. [Google Scholar] [CrossRef]
  12. Yao, J.; Fotheringham, A.S. Local spatiotemporal modeling of house prices: A mixed model approach. Prof. Geogr. 2016, 68, 189–201. [Google Scholar] [CrossRef]
  13. Brunsdon, C.; Fotheringham, S.; Charlton, M. Geographically weighted regression. J. R. Stat. Soc. Ser. D (Stat.) 1998, 47, 431–443. [Google Scholar] [CrossRef]
  14. Fotheringham, A.S.; Brunsdon, C.; Charlton, M. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships; John Wiley & Sons: Oxford, UK; New York, NY, USA, 2002; ISBN 978-661-027-017-0. [Google Scholar]
  15. Gollini, I.; Lu, B.; Charlton, M. GWmodel: An R Package for Exploring Spatial Heterogeneity using Geographically Weighted Models. arXiv 2014. [Google Scholar] [CrossRef] [Green Version]
  16. Fotheringham, A.S.; Yang, W.; Kang, W. Multiscale geographically weighted regression (MGWR). Ann. Am. Assoc. Geogr. 2017, 107, 1247–1265. [Google Scholar] [CrossRef]
  17. Li, Z. Measuring bandwidth uncertainty in multiscale geographically weighted regression using akaike weights. Ann. Am. Assoc. Geogr. 2020, 110, 1500–1520. [Google Scholar] [CrossRef]
  18. Yu, H. Inference in multiscale geographically weighted regression. Geogr. Anal. 2020, 52, 87–106. [Google Scholar] [CrossRef]
  19. Yue, P.; Ramachandran, R.; Baumann, P. Recent activities in Earth data science [technical committees]. IEEE Geosci. Remote Sens. Mag. 2016, 4, 84–89. [Google Scholar] [CrossRef]
  20. Yue, P.; Shangguan, B.; Hu, L. Towards a training data model for artificial intelligence in earth observation. Int. J. Geogr. Inf. Sci. 2022, 36, 2113–2137. [Google Scholar] [CrossRef]
  21. Wu, S. The Theory and Method of Geographically and Temporally Neural Network Weighted Regression. Ph.D. Thesis, Zhejiang University, Hangzhou, China, 2018. [Google Scholar]
  22. Wu, S. Modeling spatially anisotropic nonstationary processes in coastal environments based on a directional geographically neural network weighted regression. Sci. Total Environ. 2020, 709, 136097. [Google Scholar] [CrossRef]
  23. Yue, P.; Di, L.; Wei, Y. Intelligent services for discovery of complex geospatial features from remote sensing imagery. ISPRS J. Photogramm. Remote Sens. 2013, 83, 151–164. [Google Scholar] [CrossRef]
  24. Shi, H.; Zhang, L.; Liu, J. A new spatial-attribute weighting function for geographically weighted regression. Can. J. For. Res. 2006, 36, 996–1005. [Google Scholar] [CrossRef] [Green Version]
  25. Du, Z.; Wang, Z.; Wu, S. Geographically neural network weighted regression for the accurate estimation of spatial non-stationarity. Int. J. Geogr. Inf. Sci. 2020, 34, 1353–1377. [Google Scholar] [CrossRef]
  26. Srivastava, N. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar] [CrossRef]
  27. Nakaya, T. GWR4 User Manual. WWW Document. 2014. Available online: http://www. st-andrews.ac.uk/geoinformatics/wp-content/uploads/GWR4manual_201311.pdf (accessed on 4 November 2013).
  28. Kirkwood, C.; Cave, M.; Beamish, D.; Grebby, S.; Ferreira, A. A machine learning approach to geochemical mapping. J. Geochem. Explor. 2016, 167, 49–61. [Google Scholar] [CrossRef] [Green Version]
  29. Ahmed, W.; Muhammad, K.; Glass, H.J.; Chatterjee, S.; Khan, A.; Hussain, A. Novel MLR-RF-Based Geospatial Techniques: A Comparison with OK. ISPRS Int. J. Geo-Inf. 2022, 11, 371. [Google Scholar] [CrossRef]
Figure 1. Study area.
Figure 1. Study area.
Ijgi 11 00620 g001
Figure 2. The structure of SAPNN.
Figure 2. The structure of SAPNN.
Ijgi 11 00620 g002
Figure 3. The structure of SAPDNN.
Figure 3. The structure of SAPDNN.
Ijgi 11 00620 g003
Figure 4. The modeling process of SANNWR.
Figure 4. The modeling process of SANNWR.
Ijgi 11 00620 g004
Figure 5. The structure of SANNWR.
Figure 5. The structure of SANNWR.
Ijgi 11 00620 g005
Figure 6. The training process of SANNWR.
Figure 6. The training process of SANNWR.
Ijgi 11 00620 g006
Figure 7. Scatter plots of predicted and actual values in the validation set.
Figure 7. Scatter plots of predicted and actual values in the validation set.
Ijgi 11 00620 g007
Figure 8. Scatter plots of predicted and actual values in the test set.
Figure 8. Scatter plots of predicted and actual values in the test set.
Ijgi 11 00620 g008
Table 1. Basic statistical information of the data set.
Table 1. Basic statistical information of the data set.
Data SetVariableUnitMaximumMinimumMeanStandard Deviation
total data set
(1445)
PM2.5 μ g / m 3 129.1098.03740.97413.704
WD°243.18181.689150.801644.472
AOD/1200.8264.858529.988186.749
TEMP K 299.735271.645287.7385.305
r % 86.68524.96162.21112.495
TP m 2.1 × 10−41.56 × 10−69.46 × 10−54.74 × 10−5
WS m / s 16.2283.54 × 10−71.5412.139
DEM m 4525.00−6.000397.375667.306
cross-validation set (1245)PM2.5 μ g / m 3 110.3998.50740.75213.306
WD°243.15581.689150.48544.178
AOD/1200.8264.858527.093186.289
TEMP K 299.735271.645287.6475.309
r % 88.68524.96162.08112.472
TP m 2.1 × 10−41.56 × 10−69.5 × 10−54.78 × 10−5
WS m / s 16.2283.54 × 10−71.5412.136
DEM m 4525.00−6.000405.364669.400
test set
(220)
PM2.5 μ g / m 3 129.1098.03742.19215.691
WD°243.18187.459152.57845.961
AOD/1033.5430.768545.992188.187
TEMP K 297.562272.602288.2625.238
r % 84.64425.64162.94612.568
TP m 1.96 × 10−43.96 × 10−69.23 × 10−54.46 × 10−5
WS m / s 12.8553.25 × 10−41.5422.152
DEM m 4520.002.000352.275651.946
Table 2. The super parameter settings of SANNWR.
Table 2. The super parameter settings of SANNWR.
SANNWR Model Structure Setup
HierarchySettingsInput Data DimensionOutput Data Dimension
SAPDNN
Input layer-22
Hidden layer323
Output layer131
SWNN
Input layer-11201120
Hidden layer-a50112050
Hidden layer-b205020
Output layer7207
SANNWR Model hyper-parameter setting
Initial Learning RateMax Learning RateMaximum value of epochBatch sizeDropout
0.020.67200,000160.7
Table 3. The training process of SANNWR.
Table 3. The training process of SANNWR.
SANNWR Model Training and Optimization Process
1: Neural network model training starts;
2: Divide dataset. The data set is divided into a cross-validation set and a test set. Then, the cross-validation set is divided into N equal parts. When we train the model, one part of the cross-validation set is used as the verification set and the rest are used as the training set.
3: Initialize the neural network hyper-parameters according to the preset related parameters, including initial and maximum learning rate, iteration times, etc.
4: Divide the training set into multiple mini batches.
5: Use mini-batch as input in sequence
6: Check whether the current epoch is complete. If not, switch to the next Mini Batch for training.
7: Calculate the overfitting index for the completed epoch.
8: View training indicators of the current epoch. If it is better than the previous optimal model, record the current neural network parameters and zero the tolerance value. Otherwise, judging whether the tolerance value has reached the maximum value; If yes, end training and restore parameters of the optimal model; Otherwise, increasing the tolerance value to continue training;
9: Check whether the number of epoch times has reached the upper limit. If yes, end training and restore parameters of the optimal model; If not, scramble the data set involved in the training, and continue the training.
10: Calculate and verify the generalization capability of the model.
11: End the neural network training process.
Table 4. Specific Kernel Function Combination Settings for GWR Models.
Table 4. Specific Kernel Function Combination Settings for GWR Models.
Combination NameKernel Function Typekernel Function Structure
GWR-AFGFixedGaussian
GWR-AAGadaptiveGaussian
GWR-AABadaptiveBi-square
Table 5. Software and hardware environment of these models.
Table 5. Software and hardware environment of these models.
Model NameGWRGNNWR and SANNWR
Software environmentMATLAB
2016a
TensorFlow 1.7/CUDA9.0/
Python 3.6
Number of machines1
Hardware
environment
CPU: Inter(R) XEON E3-1231V3 3.4 GHz
Memory: 32 GB
Graphics card: NVIDIA Quadro RTX4000
OS: Windows 10 ×64
Table 6. Average value of the ten cross-validation results of the training set.
Table 6. Average value of the ten cross-validation results of the training set.
ModelTen Cross-Validation Results (Fitness)Evaluation Result (Forecast)
Train SetValidation SetTest Set
R2RMSEMAEMAPEAICcF1p-ValueR2RMSEMAEMAPER2RMSEMAEMAPE
GWR-AAB-A0.6937.3755.62515.568%7769.6410.6660.010.6657.7556.04915.719%0.7248.4565.70414.373%
GWR-AFG-A0.6927.3785.60715.475%7779.4510.6710.010.6877.4905.86915.381%0.7218.4945.80914.566%
GWR-AAG-A0.6997.2995.58315.511%7764.8190.6540.010.6487.9346.21216.364%0.7118.4386.52617.640%
GWR-AAB-S0.7117.1755.43715.210%7656.9750.6150.010.7376.8715.34214.256%0.7398.1905.74115.098%
GWR-AFG-S0.7087.2215.60315.326%7678.8010.6260.010.7197.1525.59814.624%0.7817.6245.90015.555%
GWR-AAG-S0.7107.1915.36415.239%7666.5070.6160.010.6428.0056.26816.533%0.7427.9795.98215.929%
GNNWR-A0.7147.0645.36214.698%7588.5250.1370.010.7067.6975.54115.994%0.7398.2375.48713.703%
GNNWR-S0.8415.2703.86610.773%6912.1130.2780.010.7686.3874.66812.565%0.8446.3774.30711.213%
SANNWR0.8604.9813.65110.142%6792.5440.2480.010.8245.7474.26110.845%0.8576.2514.20510.949%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ni, S.; Wang, Z.; Wang, Y.; Wang, M.; Li, S.; Wang, N. Spatial and Attribute Neural Network Weighted Regression for the Accurate Estimation of Spatial Non-Stationarity. ISPRS Int. J. Geo-Inf. 2022, 11, 620. https://doi.org/10.3390/ijgi11120620

AMA Style

Ni S, Wang Z, Wang Y, Wang M, Li S, Wang N. Spatial and Attribute Neural Network Weighted Regression for the Accurate Estimation of Spatial Non-Stationarity. ISPRS International Journal of Geo-Information. 2022; 11(12):620. https://doi.org/10.3390/ijgi11120620

Chicago/Turabian Style

Ni, Sihan, Zhongyi Wang, Yuanyuan Wang, Minghao Wang, Shuqi Li, and Nan Wang. 2022. "Spatial and Attribute Neural Network Weighted Regression for the Accurate Estimation of Spatial Non-Stationarity" ISPRS International Journal of Geo-Information 11, no. 12: 620. https://doi.org/10.3390/ijgi11120620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop