Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models

Mahmoud, Hamdy F. F.; Kim, Inyoung

doi:10.3390/math11224629

Open AccessFeature PaperArticle

Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models

by

Hamdy F. F. Mahmoud

^1,2

and

Inyoung Kim

^1,*

¹

Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA

²

Department of Statistics, Mathematics, and Insurance, Faculty of Commerce, Assiut University, Assiut 71515, Egypt

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(22), 4629; https://doi.org/10.3390/math11224629

Submission received: 27 September 2023 / Revised: 31 October 2023 / Accepted: 5 November 2023 / Published: 13 November 2023

(This article belongs to the Special Issue Nonparametric Regression Models: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we introduce two semiparametric single-index models for spatially and temporally correlated data. Our first model has spatially and temporally correlated random effects that are additive to the nonparametric function, which we refer to as the “semiparametric spatio-temporal single-index model (ST-SIM)”. The second model integrates the spatially correlated effects into the nonparametric function, and the time random effects are additive to the single-index function. We refer to our second model as the “semiparametric integrated spatio-temporal single-index model (IST-SIM)”. Two algorithms based on a Markov chain expectation maximization are introduced to simultaneously estimate the model parameters, spatial effects, and time effects of the two models. We compare the performance of our models using several simulation studies. The proposed models are then applied to mortality data from six major cities in South Korea. Our results suggest that IST-SIM (1) is more flexible than ST-SIM because the former can estimate various nonparametric functions for different locations, while ST-SIM enforces the mortality functions having the same shape over locations; (2) provides better estimation and prediction, and (3) does not need restrictions for the single-index coefficients to fix the identifiability problem.

Keywords:

Markov chain expectation maximization; semiparametric regression models; nonparametric regression models; single-index model; spatio-temporal correlated data

MSC:

62P12; 62G05

1. Introduction

Epidemiology has a long history of studying factors that affect the variability of mortality. These factors include geographical or spatial variations, which play a crucial role in evaluating healthcare distribution. Spatio-temporal analysis offers additional benefits over spatial analysis by enabling researchers to simultaneously study patterns over time. Due to recent advancements in computational methods for analyzing spatio-temporal data, spatio-temporal data analysis has emerged as a prominent research area.

Numerous studies [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16] have introduced parametric statistical methods for modeling spatio-temporal data, such as the generalized linear mixed model, generalized linear additive model, and the spatio-temporal auto-regressive model. On the other hand, mixed-effects models and spatio-temporal models differ in terms of the structure of the variance-covariance matrix, which defines the type of correlation between time random effects and spatial dependencies. In spatio-temporal data, the covariance matrix structure of spatial effects depends on the distance between any two locations (or cities) and follows a parametric function form. The estimation of covariance functions and the prediction of spatial effects have also been studied [9,11,12,15,17]. Both parametric and nonparametric models have been developed. Parametric models provide convenient and interpretable results; however, they lack flexibility due to strong parametric assumptions. These assumptions are often not satisfied in real data applications. Nonparametric and semiparametric models relax these assumptions, making them more suitable for analyzing spatio-temporal data.

Hence, in this article, we focus on semiparametric modeling for spatio-temporal data and introduce two semiparametric models. These two models are built based on the single-index model (SIM), which incorporates spatial and time effects into the model.

The authors of Ichimura [18] introduced the single-index model, and many articles have extensively studied the estimation and inference of SIM [19,20,21,22,23]. SIM offers several advantages over parametric models: (1) it assumes that the function describing the relationship between the response variable and the explanatory variables is unknown, thereby avoiding the misleading results of misspecifying the link function [24]; (2) it does not assume a specific type of error distribution, and (3) it mitigates the curse of dimensionality problem by reducing the p-dimensional explanatory variables to a single dimension using the single-index linear combination.

In SIM estimation, the parameters require restrictions to address the identifiability problem. One possible solution to this problem is to set the norm of the parameters equal to 1 [25,26], while another solution is to set the coefficient of the first explanatory variable equal to 1 [2,18]. This article employs the second approach.

In Pang and Xue [27], random effects were considered as additive effects to the single-index function; however, these random effects were assumed to be independent. In contrast, in this article, we introduce correlated spatial and temporal effects and incorporate these random effects into the single-index function. We develop two models: the spatio-temporal single-index model (ST-SIM), in which correlated spatial and temporal effects are additive to the single-index function, and the integrated spatio-temporal single-index model (IST-SIM), in which the spatial effect is integrated into the single-index function, and spatial effects are added to the unknown function. To the best of our knowledge, the IST-SIM model is not reported in the statistical literature. This article primarily focuses on IST-SIM.

The organization of this article is as follows: Section 2 presents the proposed models, ST-SIM and IST-SIM. Section 3 describes the two estimation algorithms. Section 4 contains the simulation studies we conducted. Section 5 describes the real data application and presents the results of applying the two models to the motivating data. Conclusions are provided in Section 6.

2. Semiparametric Models

In this section, we introduce two semiparametric models: ST-SIM and IST-SIM. In ST-SIM, the spatial effects are additive to the single-index function, while in IST-SIM, the spatial effects are non-additive to the single-index function.

Let

Y_{s, t}

be the observed value of the response variable at location s and time point t,

u_{s}

(

s = 1, \dots, r

) be a spatial random effect following Gaussian process (

G P

),

w_{t}

(

t = 1, \dots, n

) be the time effect, and

{x_{1}}_{(s, t)}, {x_{2}}_{(s, t)}, \dots, {x_{p}}_{(s, t)}

be the p observed values of the explanatory variables at location s and time point t.

2.1. Semiparametric Spatio-Temporal Single-Index Model

The additive ST-SIM model is defined as follows:

\begin{matrix} Y_{s, t} | μ_{s, t} & \sim P_{d} (μ_{s, t} | u_{s}, ω_{t}), \\ μ_{s, t} | u_{s}, ω_{t} & = g (X_{s, t} β) + u_{s} + ω_{t}, \\ u_{s} & \sim G P (0, σ_{u}^{2} Σ (s, s^{'})), s \neq s^{'} \\ ω_{t} & \sim P_{d} (0, σ_{ω}^{2} Ω), \end{matrix}

(1)

where

g (\cdot)

is a smoothed unknown function,

β

are the single-index coefficient parameters,

Σ (s, s^{'})

is the

(s, s^{'})

th component of the covariance function

Σ

of the spatial effects,

Ω

is the covariance function of a probability distribution (

P_{d}

) of the time effects, and

σ_{u}^{2}

and

σ_{ω}^{2}

are the variances of the spatial and time effects, respectively. Given the effects

u_{s}

and

ω_{t}

,

Y_{s, t}

follows a probability distribution (

P_{d}

) with mean

μ_{s, t}

.

This model assumes that the effects

u_{s}

and

ω_{t}

are additive to the single-index function

g (\cdot)

. The single-index coefficient parameters (

β

), spatial and time effects (

u_{s}

,

ω_{t}

), and the unknown function (

g (\cdot)

) must be estimated simultaneously. A restriction on

β

is needed to resolve the identifiability problem. Two possible approaches include setting one of the parameters of

β

equal to 1 [2,18] or to use

| | β | | = 1

[25,26,28]. The first approach is employed in this article.

2.2. Semiparametric Integrated Spatio-Temporal Single-Index Model

The IST-SIM is defined as follows:

\begin{matrix} Y_{s, t} | μ_{s, t} & \sim P_{d} (μ_{s, t} | u_{s}, ω_{t}), \\ μ_{s, t} | u_{s}, ω_{t} & = g (u_{s} + X_{s, t} β) + ω_{t}, \\ u_{s} & \sim G P (0, σ_{u}^{2} Σ (s, s^{'})), s \neq s^{'} \\ ω_{t} & \sim P_{d} (0, σ_{ω}^{2} Ω), \end{matrix}

(2)

where

g (\cdot)

is a smoothed unknown function,

β

represents the single-index coefficient parameters,

Σ (s, s^{'})

is the

(s, s^{'})

th component of the covariance function

Σ

of the spatial effects,

Ω

is the covariance function of a probability function (

P_{d}

) of the time effect, and

σ_{u}^{2}

and

σ_{ω}^{2}

are the variances of the spatial and time effects, respectively. Given

u_{s}

and

ω_{t}

,

Y_{s, t}

follows some probability distribution (

P_{d}

) with a mean of

μ_{s, t}

.

One of the advantages of this model is that it does not have an identifiability problem, unlike the additive model (1). Consequently, all of the parameters can be estimated. This is because the spatial effect

u_{s}

has a coefficient equal to 1, allowing all of the single-index coefficient parameters to be estimated without requiring additional assumptions for the unknown function. This model assumes that the spatial effect

u_{s}

is integrated into the single-index function, while the time effect

ω_{t}

is additive to the mean of the unknown function

g (\cdot)

. Additionally,

β

,

u_{s}

,

ω_{t}

, and

g (\cdot)

must be estimated simultaneously.

2.3. Covariance Functions of Spatial and Temporal Effects

The spatial covariance function that describes the spatial correlation between any two locations,

u_{s}

and

u_{s^{'}}

, can be described as

\begin{matrix} u & \sim M N (0, σ_{u}^{2} Σ_{ρ}), \\ C o v (u_{s}, u_{s}^{'}) & = σ_{u}^{2} Σ_{ρ} (s, s^{'}), \end{matrix}

(3)

where

M N

represents a multivariate normal distribution,

σ_{u}^{2}

is the variance of the spatial effects,

Σ_{ρ} (s, s^{'})

is the

(s, s^{'})

th component of the spatial covariance matrix

Σ_{ρ}

, which is assumed to have a known parametric covariance function to guarantee positive definite, and

ρ

is the dependence range that can be estimated by the semivariogram. The dependence range,

ρ

, represents the distance between two locations such that the spatial effects within that range are correlated and, outside of that range, the correlation is assumed to be negligible.

In time series, it is common to use the autoregressive moving average, autoregressive integrated moving average, or a random walk to model the time effects. In random walk, the relationship between two consecutive time points, say

ω_{t}

, and

ω_{t - 1}

, takes the form:

ω_{t} = ω_{t - 1} + ϵ_{t},

(4)

where

ϵ_{t}

is the random noise term accounting for the difference between two consecutive time points within the location s. When

ω_{t}

follows a normal distribution [29], where n locations exist, the vector of the temporal effects

ω = {(ω_{1}, ω_{2}, \dots, ω_{n})}^{T}

takes the following joint probability distribution:

f (ω | σ_{ω}^{2}) \propto e x p (- \frac{σ_{ω}^{2}}{2} ω^{T} Ω^{- 1} ω),

(5)

where

Ω^{- 1}

is the inverse of the temporal covariance matrix. For example, when we have four-time points, the covariance matrix,

Ω

, based on the description above, takes the form

Ω = [\begin{matrix} 1 & - 1 & 0 & 0 \\ - 1 & 2 & - 1 & 0 \\ 0 & - 1 & 2 & - 1 \\ 0 & 0 & - 1 & 1 \end{matrix}] .

This precision matrix is singular, so the covariance matrix

Ω^{- 1}

, cannot be obtained. Hence, we propose a modified version to overcome this problem

Ω = [\begin{matrix} 2 & - 1 & 0 & 0 \\ - 1 & 2 & - 1 & 0 \\ 0 & - 1 & 2 & - 1 \\ 0 & 0 & - 1 & 2 \end{matrix}] .

In this form, all of the diagonal elements are equal to 2, which means

ω_{1}

and

ω_{4}

are also random values and the other time points follow the first-order random walk. Other possible functions can be used, such as the Gaussian function with

ρ = 2

[30], which means the time effect

ω_{t}

depends only on the following and previous time effects,

ω_{t + 1}

and

ω_{t - 1}

. The Gaussian process of the temporal effects, in this case, can be described as

\begin{matrix} ω & \sim M N (0, σ_{ω}^{2} Ω_{ρ_{=} 2}), \\ C o v (ω_{t}, ω_{t^{'}}) & = σ_{ω}^{2} Ω_{ρ = 2} (t, t^{'}), \end{matrix}

(6)

where

ω_{t}

and

ω_{t^{'}}

are two temporal effects at two different time points, t and

t^{'}

, at location s.

3. Model Estimation

In this section, two estimation algorithms are introduced to estimate the ST-SIM and IST-SIM. The Monte Carlo expectation maximization (MCEM) algorithms are implemented.

3.1. Estimating Spatial and Time Effects

The expectation maximization (EM) algorithm consists of two steps: expectation (the E-step) and maximization (the M-step). Estimates are obtained by iteratively performing these until convergence is achieved. This algorithm is commonly used for estimating generalized linear mixed models [31,32,33,34,35]. However, for our proposed models, there is no closed form that is available for the expectation part. Therefore, we incorporated Markov chain Monte Carlo (MCMC) to generate random samples from the full conditional distributions of

u

and

ω

. In the E-step, we employed the Metropolis–Hastings (M-H) algorithm. As a result, we developed an MCEM algorithm for estimating the proposed models.

The Metropolis–Hastings (M-H) algorithm employs the conditional multivariate normal distribution to generate random samples from the spatial effects

u

and time effects

ω

, as dictated by the following complete-data log-likelihood function:

log f (Y, u, ω | μ, σ_{u}^{2} Σ, σ_{ω}^{2} Ω) & = & log f_{Y | u, ω} [Y | μ, u, ω] + log f_{u} [u | σ_{u}^{2} Σ] + log f_{ω} [ω | σ_{ω}^{2} Ω],

(7)

where

Y \sim Pois [μ | u, ω]

,

μ | u, ω = g (X β) + Z_{1} u + Z_{2} ω

for the additive model and

μ | u, ω = g (Z_{1} u + X β) + Z_{2} ω

for the integrated spatial effects model;

u \sim G P (0, σ_{u}^{2} Σ)

,

ω \sim G P (0, σ_{ω}^{2} Ω)

,

σ_{ω}^{2} Ω = C o v (ω_{t}, ω_{t + δ}) = σ_{ω}^{2} {e x p (| | δ | |}^{2} / ρ_{ω} = 2)

for all

t, δ \in R

; and

σ_{u}^{2} Σ = C o v (u_{s}, u_{s + d}) = σ_{u}^{2} {e x p (| | d | |}^{2} / ρ_{u})

for all

s, d \in R^{2}

.

The single-component Metropolis–Hastings algorithm is employed, where, at each iteration, only a single component of the spatial or time effects is updated. The proposed conditional distribution of the time effect

ω_{t}

and its parameters can be derived as follows:

Let

ω

=

(ω_{1}, ω_{2}, \dots, ω_{n}) = [\begin{matrix} ω_{1} & ω_{2} \end{matrix}]^{T}

have a multivariate normal distribution with mean

0

and a variance-covariance matrix

σ_{0}^{2} Ω

, where

Ω = [\begin{matrix} Ω_{11} & Ω_{12} \\ Ω_{21} & Ω_{22} \end{matrix}] .

Then, the proposal distribution of

ω_{1}

conditioned on

ω_{2} = a

is a normal

(ω_{1} | ω_{2} = a) \sim N ({\bar{μ}}_{ω_{1}}, σ_{0}^{2} \bar{Ω})

, where

σ_{0}^{2}

is the proposed variance of the time effects,

{\bar{μ}}_{ω_{1}} = Ω_{12} Ω_{22}^{- 1} a

, and the covariance matrix

\bar{Ω} = Ω_{11} - Ω_{12} Ω_{22}^{- 1} Ω_{21}

.

With this proposed conditional distribution, and given a vector of spatial effects

u

, and assuming that the spatial and time effects are independent, the acceptance probability rule takes the form

\begin{matrix} min \{\frac{f (Y | μ, u, ω_{t}^{*}) f_{ω} (ω_{t}^{*} | {\bar{μ}}_{ω_{t}^{*}}, σ_{ω}^{2} \bar{Ω}) f_{u} (u | σ_{u}^{2} Σ)}{f (Y | μ, u, ω_{t}) f_{ω} (ω_{t} | {\bar{μ}}_{ω_{t}}, σ_{ω}^{2} \bar{Ω}) f_{u} (u | σ_{u}^{2} Σ)}, 1\}, \end{matrix}

(8)

where

f_{ω} (ω_{t} | {\bar{μ}}_{ω_{t}}, σ_{ω}^{2} \bar{Ω})

is the conditional distribution of

ω_{t}

given the other time effects,

ω_{1}, ω_{2}, \dots, ω_{t - 1}, ω_{t + 1}, \dots, ω_{n}

. Similarly, one can drive the proposed conditional distribution of the spatial effect

u_{t}

given the other spatial effects as

(u_{1} | u_{2} = a) \sim N ({\bar{μ}}_{u_{1}}, σ_{0}^{2} \bar{Σ})

, where

σ_{0}^{2}

is the proposed variance of the spatial effects,

{\bar{μ}}_{u_{1}} = Σ_{12} Σ_{22}^{- 1} a

, and the covariance matrix

\bar{Σ} = Σ_{11} - Σ_{12} Σ_{22}^{- 1} Σ_{21}

. With this proposed conditional distribution and a given vector of time effects

ω

, the acceptance probability rule takes the form

\begin{matrix} min \{\frac{f (Y | μ, u_{s}^{*}, ω) f_{ω} (ω | σ_{ω}^{2} Ω) f_{u} (u_{s}^{*} | {\bar{μ}}_{u_{s}^{*}}, σ_{u}^{2} \bar{Σ})}{f (Y | μ, u_{s}, ω) f_{ω} (ω | σ_{ω}^{2} Ω) f_{u} (u_{s} | {\bar{μ}}_{u_{s}}, σ_{u}^{2} \bar{Σ})}, 1\} . \end{matrix}

The following steps make up a subroutine for simulating random samples from the spatial and time effects. This subroutine is used within the algorithms to estimate the model parameters of the proposed models.

Step 0:: Initialize $u^{(0)}$ and $ω^{(0)}$ , and, given the estimates of $σ_{u}^{2}$ , $σ_{ω}^{2}$ , and $μ_{s, t}$ , set $s = 1$ , $t = 1$ , and $m = 1$ .
Step 1:: Let $ω$ = mean( $ω^{[0 : (m - 1)]}$ ), generate a value from the spatial effect proposal distribution at location s, $u_{s}^{*}$ , and generate a value from uniform(0,1), U.

$\begin{matrix} If U < min \{\frac{f (Y | μ, u_{s}^{*}, ω) f_{ω} (ω | σ_{ω}^{2} Ω) f_{u} (u_{s}^{*} | {\bar{μ}}_{u_{s}^{*}}, σ_{u}^{2} \bar{Σ})}{f (Y | μ, u_{s}, ω) f_{ω} (ω | σ_{ω}^{2} Ω) f_{u} (u_{s} | {\bar{μ}}_{u_{s}}, σ_{u}^{2} \bar{Σ})}, 1\}, set u^{(m)} = (u_{s}^{*}, u_{2}, \dots, u_{S}); \end{matrix}$

otherwise, $u^{(m)} = u$ . Set $s = s + 1$ and repeat this step until all locations are visited.
Step 2:: Given $u$ = mean( $u^{[0 : (m - 1)]}$ ), generate a value from the proposal time effects distribution of time t, $ω_{t}^{*},$ and generate a value from uniform (0,1), U.

$\begin{matrix} If U < min \{\frac{f (Y | μ, u, ω_{t}^{*}) f_{ω} (ω_{t}^{*} | {\bar{μ}}_{ω_{t}^{*}}, σ_{ω}^{2} \bar{Ω}) f_{u} (u | σ_{u}^{2} Σ)}{f (Y | μ, u, ω_{t}) f_{ω} (ω_{t} | {\bar{μ}}_{ω_{t}}, σ_{ω}^{2} \bar{Ω}) f_{u} (u | σ_{u}^{2} Σ)}, 1\}, set ω^{(m)} = (ω_{t}^{*}, ω_{2}, \dots, ω_{n}); \end{matrix}$

otherwise, $u^{(i)} = u$ . Set $t = t + 1$ and repeat this step until all time points are visited.
Step 3:: Repeat Steps 1–2, a large number of times, M-, and, based on Geyer [36], discard a percentage of burn-in between 1% and 2% of the total number of iterations, M, and use the rest, $N_{0}$ , to estimate the spatial and time effects.

3.2. ST-SIM Estimation Algorithm

The following steps comprised the proposed algorithm for estimating the additive ST-SIM’s parameters and the spatial and time effects:

I-step

Initialize parameters:

(a): Obtain initial values for $u$ , $ω$ , $σ_{u}^{2}$ , and $σ_{ω}^{2}$ ( $u^{(0)}$ , $ω^{(0)}$ , $σ_{u}^{2 (0)}$ , $σ_{ω}^{2 (0)}$ ).
(b): Calculate $Y^{*} = Y - Z_{1} u^{(0)} - Z_{2} ω^{(0)}$ , obtain the single-index coefficient estimate $β^{(0)}$ , and estimate the unknown function using a smoothing method to obtain $\hat{g} {(\cdot)}^{(0)}$ .

E-step

Given the initial values from the I-step, simulate random samples for each spatial effect

u_{s}

, (

u_{s}^{1}, u_{s}^{2}, \dots, u_{s}^{N}

), and for each time effect

ω_{t}

, (

ω_{t}^{1}, ω_{t}^{2}, \dots, ω_{t}^{N}

), from f(

Y, u, ω | μ,

σ_{u}^{2} Σ, σ_{ω}^{2} Ω

), where

μ = g (X_{s, t} β) + Z_{1} u + Z_{2} ω

via the subroutine described in Section 3.1.

M-step

Maximize

\sum_{k} log f (u^{k} | σ_{u}^{2} Σ)

and

\sum_{k} log f (ω^{k} | σ_{ω}^{2} Ω)

,

(a): Obtain $σ_{u}^{2 (1)}$ and $σ_{ω}^{2 (1)}$ , and calculate $u^{(1)} = \frac{1}{N_{0}} \sum_{k = 1}^{N_{0}} u^{k}$ , $ω^{(1)} = \frac{1}{N_{0}} \sum_{k = 1}^{N_{0}} ω^{k}$ and $Y^{*} = Y - Z_{1} u^{(1)} - Z_{2} ω^{(1)}$ .
(b): Using the Ichimura method, estimate $β$ $β^{(1)}$ , and smooth the unknown function $g (\cdot)$ to obtain $\hat{g} {(\cdot)}^{(1)}$ .

Do

iteration of the E-step and M-step until convergence is achieved. The stopping rule for the EM algorithm is

| \frac{L o g L i k e l i h o o d^{(t)} - L o g L i k e l i h o o d^{(t - 1)}}{L o g L i k e l i h o o d^{(t - 1)}} | < 0.001,

where

L o g L i k e l i h o o d^{(t)}

is the log of the likelihood function at the iterated step t, and

L o g L i k e l i h o o d^{(t - 1)}

is the log of the likelihood function at the iterated step

t - 1

.

3.3. IST-SIM Estimation Algorithm

The algorithm described in Section 3.2 does not work for the IST-SIM due to two main issues: (1) the long computation time resulting from the intensive calculations (specifically, when using the Metropolis–Hastings algorithm with only 1000 iterations, the single-index function needs to be estimated 36,000 times to run the MCEM algorithm just once); and (2) the inability to separate the impact of spatial random effects on the acceptance ratio from the effect of the single-index coefficient parameters when comparing the current and previous spatial effect values. In the following, we provide a detailed explanation of these issues.

The acceptance ratio takes the form

\begin{matrix} min \{\frac{f (Y | u^{*}, ω, μ = {\hat{g}}^{*} [X \hat{β^{*}} + Z_{1} u^{*}] + Z_{2} ω) f_{u} (u^{*} | σ_{u}^{2} Σ) f_{ω} (ω | σ_{ω}^{2} Ω)}{f (Y | u, ω, μ = \hat{g} [X \hat{β^{*}} + Z_{1} u] + Z_{2} ω) f_{u} (u | σ_{u}^{2} Σ) f_{ω} (ω | σ_{ω}^{2} Ω)}, 1\} . \end{matrix}

If this ratio is greater than 1, it is unclear whether this is because

u^{*}

is better than

u

,

{\hat{g}}^{*}

is superior to

\hat{g}

, or

{\hat{β}}^{*}

is more accurate than

\hat{β}

. In such cases, determining which of the two proposed spatial random effects represents a generated value from the true spatial effect distribution becomes challenging. One possible solution to this problem is to employ a Taylor series approximation for the unknown function at a specific value

(X β^{(0)} + Z u^{(0)})

to isolate the spatial effect from the model-parameter effect on the ratio, as follows:

\begin{matrix} μ = g (X β + Z_{1} u) + Z_{2} ω = \hat{g} (\cdot) + {\hat{g}}^{'} (\cdot) (X β + Z_{1} u - X β^{(0)} - Z_{1} u^{(0)}) + Z_{2} ω . \end{matrix}

Here,

\hat{g} (\cdot)

represents the unknown function, estimated using a smoothing method, like p-spline [37] or kernel smoothing [38], and

{\hat{g}}^{'} (\cdot)

is the estimate of the first derivative of this unknown function. In this paper, we employ the local linear kernel method to estimate both the unknown function and its derivative.

The following steps comprise the proposed algorithm for estimating the IST-SIM’s parameters and the spatial and time effects:

I-step

Initialize the parameters:

(a): Obtain initial values for $u$ , $ω$ , $σ_{u}^{2}$ , and $σ_{ω}^{2}$ ( $u^{(0)}$ , $ω^{(0)}$ , $σ_{u}^{2 (0)}$ , $σ_{ω}^{2 (0)}$ );
(b): Calculate $Y^{*} = Y - Z_{2} ω^{(0)}$ , obtain the single-index coefficient estimates $β^{(0)}$ , and estimate the unknown function using a smoothing method to obtain $\hat{g} {(\cdot)}^{(0)}$ and its derivative $\hat{g^{'}} {(\cdot)}^{(0)}$ .

E-step

Given the initials from the I-step and the Taylor approximation of the unknown function, simulate random samples for each spatial effect

u_{s}

, (

u_{s}^{1}, u_{s}^{2}, \dots, u_{s}^{N}

), and for each time effect

ω_{t}

, (

ω_{t}^{1}, ω_{t}^{2}, \dots, ω_{t}^{N}

), from

f (Y, u, ω | μ, σ_{u}^{2} Σ, σ_{ω}^{2} Ω)

, where

μ \approx \hat{g} + {\hat{g}}^{'} (X β + Z_{1} u - X β^{(0)} - Z_{1} u^{(0)}) + Z_{2} ω

, via the subroutine described in Section 3.1

M-step

Maximize

\sum_{k} log f (u^{k} | σ_{u}^{2} Σ)

and

\sum_{k} log f (ω^{k} | σ_{ω}^{2} Ω)

,

(a): Obtain $σ_{u}^{2 (1)}$ and $σ_{ω}^{2 (1)}$ , and calculate $u^{(1)} = \frac{1}{N_{0}} \sum_{k = 1}^{N_{0}} u^{k}$ , $ω^{(1)} = \frac{1}{N_{0}} \sum_{k = 1}^{N_{0}} ω^{k}$ and $Y^{*} = Y - Z_{2} ω^{(1)}$ .
(b): Using the Ichimura method, estimate $β$ $β^{(1)}$ , and smooth the unknown function $g (\cdot)$ to obtain $\hat{g} {(\cdot)}^{(1)}$ .

Do

iteration of the E-step and M-step until convergence is achieved. The stopping rule for the EM algorithm is

| \frac{L o g L i k e l i h o o d^{(t)} - L o g L i k e l i h o o d^{(t - 1)}}{L o g L i k e l i h o o d^{(t - 1)}} | < 0.001,

where

L o g L i k e l i h o o d^{(t)}

is the log of the likelihood function at iteration step t, and

L o g L i k e l i h o o d^{(t - 1)}

is the log of the likelihood function at iteration step

t - 1

.

4. Simulation

In this section, we evaluate the performance of the two proposed model algorithms in terms of estimating the model parameters, fitting the data, and predicting under correct and misspecified model specifications. The performance of the IST-SIM estimation algorithm is assessed through the simulation of 100 data sets based on the following integrated model:

\begin{matrix} y_{i s} | μ_{s}, u_{s}, ω_{t} & \sim p_{d} (y_{s} | μ_{s}, u_{s}), \\ μ_{s} | u_{s}, ω_{t} & = g (β_{1} x_{1 i s} + β_{2} x_{2 i s} + u_{s}) + ω_{t}, \\ u_{s} & \sim G P (0, σ_{u}^{2} Σ (s, s^{'})) and ω_{t} \sim G P (0, σ_{ω}^{2} Ω (t, t^{'})), \end{matrix}

where

t = 1, 2, \dots, n

and

s = 1, 2, \dots, r

, with six locations (

r = 6

) and 12 time points at each location (

n = 12

). In addition,

x_{1}

is generated from uniform (5, 20), and

x_{2}

is generated from a standard normal distribution. The true model parameters are (

β_{1}, β_{2}, σ_{u}^{2}, σ_{ω}^{2}

) = (

1, 1, 0.5, 0.5

), and two cases of the dependence range are considered (

ρ_{u} = 1

and

ρ_{u} = 3

) in the domain

[0, 3] \times

[0, 3]. In addition, the performance of the ST-SIM algorithm was studied by simulating 100 data sets from the same setting, with model mean equal to

μ_{s} | u_{s}, ω_{t} = g (β_{1} x_{1 i s} + β_{2} x_{2 i s}) + u_{s} + ω_{t}

. The IST-SIM and ST-SIM algorithms are used for estimating the model parameters. For the spatial effects and temporal effects, the initial values are generated from

N (0, 1)

, and to generate strictly positive starting points for

σ_{u}^{2}

and

σ^{2} ω

, an inverse-gamma distribution was used,

I G (1, 1)

. Table 1 shows that the two proposed algorithms worked fine in estimating the model parameters, with the mean of the 100 estimates being close to the true parameter values and small standard error values. The mean square error (MSE) of the IST-SIM was greater than that of the ST-SIM model. A possible reason is because of the Taylor approximation used for the unknown function. These findings were the same under both dependence ranges (

ρ_{u} = 1

and

ρ_{u} = 3

). In addition, all parameters or the single-index coefficients can be estimated for the IST-SIM, but for the ST-SIM, one of the parameters is set to 1 to address the identifiability problem,

β_{1} = 1

.

We conducted another simulation study to assess the performance of the proposed models in fitting and predicting data under both correct and misspecified model settings. For each model, we used the mean square error (MSE) to evaluate the fitting quality and the predicted residual sum of squares (PRESS) to assess the prediction accuracy. We evaluated each model’s performance in two scenarios: when the data were generated from the true model and when the data were not generated from the true model.

For instance, in the context of the IST-SIM described earlier, we generated 100 data sets from the model and calculated the MSE and PRESS for both models. We followed the same approach for the ST-SIM. The results in Table 2 demonstrate that when the true model was IST-SIM (i.e., when the data were simulated from the IST-SIM), the IST-SIM significantly outperformed the ST-SIM in terms of the mean MSE (123.8 vs. 266.4) and the mean PRESS (371.8 vs. 587.8), with smaller standard errors compared to the ST-SIM. Conversely, when the true model was the ST-SIM (i.e., when the data were simulated from the ST-SIM), the IST-SIM still performed better in terms of the mean MSE (124.5 vs. 135.7) and exhibited smaller standard errors, with their mean PRESS values being comparable (388.6 vs. 372.78). The code used for generating the simulation results can be provided upon receiving a reasonable request.

5. Application

These two models are applied to real data from South Korea, covering the period from 2000 to 2007. The data set includes multiple daily recorded variables, such as mortality, temperature, humidity, pressure, and time, for six major cities in South Korea (Busan, Seoul, Daejeon, Incheon, Gwangju, and Daegu). Figure 1 shows the locations of these six cities in South Korea.

5.1. Data and Models

The non-accidental mortality (the number of deaths excluding deaths due to accidents), mean temperature, mean humidity, mean pressure, and time were recorded daily for six major cities in South Korea: Busan, Seoul, Daejeon, Incheon, Gwangju, and Daegu. However, we utilized monthly data by calculating the means of the weather variables per month. We opted for monthly data instead of daily data to avoid the ‘big-N’ problem [39], which could have significantly increased the computing time due to the rank of the variance-covariance matrix. Additionally, using monthly data allowed us to study the patterns of the mortality functions throughout each year.

The time component (consisting of 12 time points) and the city locations (six in total) represent the temporal and spatial effects, respectively. It is important to note that the population sizes of the six cities differ, potentially influencing the relative mortality rates among them. To account for this, we calculated the number of deaths per one hundred thousand people for each city.

In this paper, we applied the two spatio-temporal models to the South Korea data set to determine which model was more suitable for describing the data, based on the model selection criteria. For each model, we needed to simultaneously estimate the single-index function, model parameters, and spatial and time effects. The dependence range was estimated using the variograms of the models. In both models, the response variable was mortality (Y), and the explanatory variables included the temperature (

x_{1}

), pressure (

x_{2}

), and mean humidity (

x_{3}

).

Figure 2a–c depict the relationships between mortality and temperature (negative), humidity (negative), and pressure (positive). Figure 2d reveals that mortality was highest at the beginning and end of the year, with a minimum in the middle of each year. The highest mortality was observed in Busan, while Seoul, Daejeon, and Gwangju had the lowest mortality rates.

In addition to applying the two models to the mortality data, we evaluated which model was most appropriate for the data based on fitting and prediction criteria. The following are the two proposed models for our motivating data:

IST-SIM

$\begin{matrix} y | μ & \sim Pois (μ | u, ω), \\ μ & = g (Z_{1} u + x_{1} β_{1} + x_{2} β_{2} + x_{3} β_{3}) + Z_{2} ω, \\ u & \sim M N (0, σ_{u}^{2} Σ), and ω \sim M N (0, σ_{ω}^{2} Ω) . \end{matrix}$
ST-SIM

$\begin{matrix} y | μ & \sim Pois (μ | u, ω), \\ μ & = g (x_{1} β_{1} + x_{2} β_{2} + x_{3} β_{3}) + Z_{1} u + Z_{2} ω, \\ u & \sim M N (0, σ_{u}^{2} Σ), and ω \sim M N (0, σ_{ω}^{2} Ω) . \end{matrix}$

5.2. Models Estimation

The proposed algorithms were employed to estimate the parameters and spatial and temporal effects of both the ST-SIM and IST-SIM models. The Metropolis–Hastings (M-H) algorithm was utilized to obtain 10,000 samples from the spatial and temporal effects, with the first 2% of the MCMC samples discarded. The Markov chain Monte Carlo expectation maximization (MCEM) algorithm was executed until convergence was achieved. Using the variogram, the dependence range was estimated and found to be equal to 1.8, as illustrated in Figure 3.

Table 3 presents the two model parameters along with their 95% confidence intervals and fitting criteria values, including the MSE,

R^{2}

, and log-likelihood. The IST-SIM outperformed the ST-SIM in several aspects. Specifically, the IST-SIM exhibited a smaller MSE (131.7 versus 131.7) compared to the ST-SIM, a higher

R^{2}

(0.87 versus 0.41), and a superior log-likelihood value (−297.3 versus −363.2). In the ST-SIM, the coefficient parameter of the pressure variable,

x_{2}

, was set to 1 to address the identifiability problem. In contrast, the IST-SIM did not encounter this issue and successfully estimated all the parameters. Table 4 highlights that both models identified Busan as having the highest mortality rate and Seoul as having the lowest mortality rate.

Figure 3 shows that the mortality functions of the six cities had the same form when using the ST-SIM but not with the IST-SIM. This highlights one of the advantages of the IST-SIM over the ST-SIM: the former is more flexible, allowing mortality functions to vary by location.

5.3. Model Selection

Multiple criteria were employed to evaluate the fitting and prediction performance of both ST-SIM and IST-SIM. These included the MSE,

R^{2}

, and log-likelihood to assess the models’ suitability for describing the data. Additionally, the means and medians of the predicted mean square error (PMSE) and predicted log-likelihood were utilized. These evaluation criteria were calculated using the following steps:

(1): Select n observations randomly from each city, consider the selected observations as the evaluation data, and consider the remaining observations as the training data.
(2): For the training data, fit the model and compute the $R^{2}$ , MSE, and log-likelihood fits. For the evaluation data, calculate the log-likelihood prediction and PMSE $= \sum_{i = 1}^{6 n} {(y_{i} - \hat{y_{i}})}^{2} / 6 n$ , where $y_{i}$ and $\hat{y_{i}}$ are the actual and predicted response values, respectively.
(3): Repeat Steps 1–2 for a large number of times; compute the mean and median of the estimated PMSE values and calculate the means of the $R^{2}$ , MSE, log-likelihood predictions, and log-likelihood fits.

These steps were repeated for different values of n (

n = 2, 4, and 6

) for the two proposed models. Table 5 demonstrates that the IST-SIM outperformed the ST-SIM consistently, exhibiting higher

R^{2}

, lower MSE, and higher log-likelihood at all values of n. For example, at

n = 2

, the IST-SIM had an

R^{2}

of 0.88 compared to 0.45 for the ST-SIM, an MSE of 120.8 versus 600.3, and a log-likelihood of −248.6 versus −300.2.

In terms of prediction, once again, the IST-SIM outperformed the ST-SIM consistently, with smaller mean and median values for the PMSE and higher values for the log-likelihood prediction. For example, at

n = 2

, the mean PMSE was 274.2 versus 922.9, the median PMSE was 210.5 versus 884.0, and the log-likelihood prediction was −63.3 versus −75.6.

Figure 4 displays boxplots representing 500 estimates of the PMSE from each model at different data set sizes (

n = 2, 4, 6

). The boxplots indicate that the variability (interquartile range) of the PMSE estimates for the IST-SIM is lower than that of the ST-SIM at

n = 2

and

n = 4

. However, when

n = 6

, with half of the data used for training and the other half for evaluation, the PMSE estimates for the IST-SIM exhibit greater variability than those for the ST-SIM. Nevertheless, the median PMSE for the IST-SIM is lower than that for the ST-SIM. Overall, the IST-SIM outperforms the ST-SIM in terms of describing the mortality data and making predictions from the South Korean mortality data. Upon receiving a reasonable request, we are able to furnish the code utilized to analyze the real data application.

6. Conclusions

This article introduces two semiparametric models for modeling correlated spatio-temporal data. In the ST-SIM, the spatial effect is treated as an additive component to the single-index model, while in the IST-SIM, the spatial effect is integrated into the single-index function. For each model, we proposed an algorithm to simultaneously estimate the unknown function, single-index coefficient parameters, variance of the spatial effects, spatial effects, and time effects, using an MCEM algorithm that we developed. To the best of our knowledge, the IST-SIM has not yet been documented in the statistical literature.

We conducted several simulation studies and found that the IST-SIM outperformed the ST-SIM based on various criteria. Both models were also applied to South Korean mortality data for comparison. The IST-SIM demonstrated superior performance in fitting and prediction for the motivating data. The analysis revealed that Busan had the highest nonaccidental mortality among the six major cities, while Seoul had the lowest mortality, with the other cities falling in between.

The IST-SIM offers several advantages over the ST-SIM: (1) it does not necessitate restrictions on the coefficient parameters like the ST-SIM, allowing estimation of all parameters, and (2) it does not require mortality functions to have the same form across each location, a requirement of the ST-SIM. There exist several avenues for enhancing the IST-SIM model in future research. For instance, potential improvements may encompass: (1) incorporating temporal effects into the unknown nonparametric function, (2) exploring alternative estimation methods for the nonparametric function beyond the Ichimura method, and (3) exploring the utilization of the Bayesian approach for the model estimation, which has the potential to streamline parameter estimation and expedite the process in comparison to the EM algorithm.

Author Contributions

Conceptualization, H.F.F.M. and I.K.; methodology, H.F.F.M. and I.K.; software, H.F.F.M.; validation, H.F.F.M.; formal analysis, H.F.F.M.; writing—original draft preparation, H.F.F.M.; writing—review and editing, H.F.F.M. and I.K.; visualization, H.F.F.M.; supervision, I.K.; project administration, H.F.F.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and computing code are available for replication from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

SIM	Single-index model
ST-SIM	Spatio-temporal single-index model
IST-SIM	Integrated spatio-temporal single-index model
MCEM	Monte Carlo expectation maximization
EM	Expectation maximization
MCMC	Markov chain Monte Carlo
M-H	Metropolis–Hastings
MSE	Mean square error
PRESS	Predicted residual sum of squares
SE	Standard error
PMSE	Predicted mean square error

References

Cressie, N.A.C. Statistics for Spatial Data; Wiley: New York, NY, USA, 1993. [Google Scholar]
Sherman, R.P. U-process in analysis of a generalized semi-parametric regression estimator. Econ. Theory 1994, 10, 372–395. [Google Scholar] [CrossRef]
Cressie, N.; Huang, H.C. Classes of nonseparable, spatio-temporal stationary covariance functions. J. Am. Stat. Assoc. 1999, 94, 1330–1340. [Google Scholar] [CrossRef]
Kanevski, M.; Maignan, M. Analysis and Modeling of Spatial Environmental Data; EPFL Press: Lausanne, Switzerland, 2004. [Google Scholar]
Genton, M.G.; Butry, D.T.; Gumpertz, M.L.; Prestemon, J.P. Spatio-temporal analysis of wildfire ignitions in the St. Johns River Water Management District, Florida. Int. J. Wildland Fire 2006, 15, 87–97. [Google Scholar] [CrossRef]
Landagan, E.B.; Barrios, O.Z. An estimation procedure for a spatial-temporal model. Stat. Probab. Lett. 2007, 77, 401–406. [Google Scholar] [CrossRef]
Li, B.; Genton, M.G.; Sherman, M. A nonparametric assessment of properties of space-time covariance functions. J. Am. Stat. Assoc. 2007, 102, 736–744. [Google Scholar] [CrossRef]
Nelson, T.A.; Duffus, D.; Robertson, C.; Laberfee, K.; Feyrer, L.J. Spatial-temporal analysis of marine wildlife. J. Coast. Res. 2009, 56, 1537–1541. [Google Scholar]
Hayn, M.; Beirle, S.; Hamprecht, F.A.; Platt, U.; Menze, B.H.; Wagner, T. Analysing spatio-temporal patterns of the global NO2-distribution retrieved from GOME satellite observations using a generalized additive model. Atmos. Chem. Phys. 2009, 9, 6459–6477. [Google Scholar] [CrossRef]
Sherman, M. Spatial Statistics and Spatio-Temporal Data; Wiley: New York, NY, USA, 2011. [Google Scholar]
Arcuti, S.; Calculli, C.; Pollice, A.; D’Onghia, G.; Maiorano, P.; Tursi, A. Spatio-temporal modeling of zero-inflated deep-sea shrimp data by Tweedie generalized additive. Statistica 2013, 73, 103–122. [Google Scholar]
Lekdee, K.; Ingsrisawang, L. Generalized linear mixed models with spatial random effects for spatio-temporal data: An application to dengue fever mapping. J. Math. Stat. 2013, 9, 137–143. [Google Scholar] [CrossRef]
Barzegar, Z.; Rivaz, F. A scalable Bayesian nonparametric model for large spatio-temporal data. Comput. Stat. 2021, 35, 153–173. [Google Scholar] [CrossRef]
Harper, A.; Baker, P.N.; Xia, Y.; Kuang, T.; Zhang, H.; Chen, Y.; Han, T.-L.; Gulliver, J. Development of spatiotemporal land use regression models for PM2.5 and NO2 in Chongqing, China, and exposure assessment for the CLIMB study. Atmos. Pollut. Res. 2021, 12, 101096. [Google Scholar] [CrossRef]
Ibañez, M.V.; Martínez-Garcia, M.; Simó, A. A Review of Spatiotemporal Models for Count Data in R Packages. A Case Study of COVID-19 Data. Mathematics 2021, 9, 1538. [Google Scholar] [CrossRef]
Feng, C. Spatial-temporal generalized additive model for modeling COVID-19 mortality risk in Toronto, Canada. Stat. Sci. 2022, 49, 100526. [Google Scholar] [CrossRef]
Cressie, N.; Hawkins, D.M. Robust estimation of the variogram: I. J. Int. Assoc. Math. Geol. 1980, 12, 115–125. [Google Scholar] [CrossRef]
Ichimura, H. Semiparametric least squares (SLS) and weighted SLS estimation of single-index models. J. Econom. 1993, 58, 71–120. [Google Scholar] [CrossRef]
Hridtache, M.; Juditski, A.; Spokoiny, V. Direct estimation of the single coefficients in a single-index model. Ann. Stat. 2001, 29, 595–623. [Google Scholar]
Wang, J.L.; Xue, L.G.; Zhu, L.X.; Chong, Y.S. Extension for a partial-linear single-index model. Ann. Stat. 2010, 38, 246–274. [Google Scholar]
Chang, Z.Q.; Xue, L.G.; Zhu, L.X. On asymptotically more efficient estimation of the single-index model. J. Multivar. Anal. 2010, 101, 1898–1901. [Google Scholar] [CrossRef]
Mahmoud, H.F.; Kim, I.; Kim, H. Semiparametric single index multi change points model with an application of environmental health study on mortality and temperature. Environmetrics 2016, 27, 49–506. [Google Scholar] [CrossRef]
Mahmoud, H.F.; Kim, I. Semiparametric spatial mixed effects single index models. Comput. Stat. Data Anal. 2019, 136, 108–122. [Google Scholar] [CrossRef]
Horowitz, J.L.; Hardle, W. Direct semiparametric estimation of single-index models with discrete covariates. J. Am. Stat. Assoc. 1996, 91, 1623–1629. [Google Scholar] [CrossRef]
Xia, Y.; Tong, H.; Li, W.K.; Zhu, L.X. An adaptive estimation of dimension reduction space. J. R. Stat. Soc. Ser. B 2002, 64, 363–410. [Google Scholar] [CrossRef]
Lin, W.; Kulasekera, K.B. Identifiability of single index models and additive index models. Biometrika 2007, 94, 496–501. [Google Scholar] [CrossRef]
Pang, Z.; Xue, L. Estimation of the single-index models with random effects. Comput. Stat. Data Anal. 2012, 56, 1837–1853. [Google Scholar] [CrossRef]
Hardle, W.; Hall, P.; Ichimura, H. Optimal smoothing in single-index models. Ann. Stat. 1993, 21, 157–178. [Google Scholar] [CrossRef]
Knorr-Held, L. Bayesian modeling of inseparable space-time variation in disease risk. Stat. Med. 2000, 19, 2555–2567. [Google Scholar] [CrossRef]
Liu, H.; Davidson, R.A.; Apanasovich, T.V. Spatial generalized linear mixed models of electric power outages due to hurricanes and ice storms. Reliab. Eng. Syst. Saf. 2008, 93, 875–890. [Google Scholar] [CrossRef]
McCulloch, C.E. Maximum likelihood variance components estimation for binary data. J. Am. Stat. Assoc. 1994, 89, 330–335. [Google Scholar] [CrossRef]
Booth, J.G.; Hobert, J.P. Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo em algorithm. J. R. Stat. Soc. Ser. B 1999, 61, 265–285. [Google Scholar] [CrossRef]
Caffo, B.S.; Jank, W.; Jones, G.L. Ascent-based Monte Carlo expectation maximization. J. R. Stat. Soc. Ser. B 2005, 67, 235–251. [Google Scholar] [CrossRef]
Tan, M.; Tian, G.-L.; Fang, H.-B. An efficient MCEM algorithm for fitting generalized linear mixed models for correlated binary data. J. Stat. Comput. Simul. 2007, 77, 929–943. [Google Scholar] [CrossRef]
An, X.; Bentler, P.M. Efficient direct sampling MCEM algorithm for latent variable models with binary responses. Comput. Stat. Data Anal. 2012, 56, 231–244. [Google Scholar] [CrossRef]
Geyer, C.J. Practical Markov chain Monte Carlo. Stat. Sci. 1922, 7, 473–483. [Google Scholar] [CrossRef]
Ruppert, D.; Wand, M.P.; Carroll, R.J. Semiparametric Regression; Cambridge Press: New York, NY, USA, 2003. [Google Scholar]
Wand, M.P.; Jones, M.C. Kernel Smoothing; Chapman and Hall: London, UK, 1995. [Google Scholar]
Banerjee, S.; Carlin, C.P.; Gelfand, A.E. Hierarchical Modeling and Analysis for Spatial; Chapman and Hall: London, UK, 2004. [Google Scholar]

Figure 1. The 6 major cities locations in South Korea: Seoul, Busan, Daegu, Incheon, Gwangju, and Daejeon.

Figure 2. The relationship between temperature and mortality (a), the relationship between humidity and mortality (b), the relationship between pressure and mortality (c), and the relationship between month and mortality (d) for the six cities in South Korea.

Figure 3. Smoothed mortality functions from the ST-SIM (left) and smoothed mortality functions from the IST-SIM along with 95% confidence intervals (right) of the six major cities in South Korea.

Figure 4. Boxplots of the prediction mean square error (PMSE) of the proposed two models (ST-SIM and IST-SIM) at different evaluation data set sizes (

n = 2, 4, 6

).

Figure 4. Boxplots of the prediction mean square error (PMSE) of the proposed two models (ST-SIM and IST-SIM) at different evaluation data set sizes (

n = 2, 4, 6

).

Table 1. Results of 100 simulated data sets: mean ± standard error (SE), median, and interquartile range (IQR) of the 100 model parameters estimates and MSE of the two proposed models (IST-SIM and ST-SIM) at different values of the dependence range (

ρ = 1

and

ρ = 3

).

Table 1. Results of 100 simulated data sets: mean ± standard error (SE), median, and interquartile range (IQR) of the 100 model parameters estimates and MSE of the two proposed models (IST-SIM and ST-SIM) at different values of the dependence range (

ρ = 1

and

ρ = 3

).

	Model		True	Mean ± SE	MSE	Median	IQR
$ρ = 1$	ST-SIM	$β_{2}$	1	0.993 ± 0.007	0.006	0.998	0.071
		$σ_{u}^{2}$	0.5	0.451 ± 0.005	0.006	0.439	0.066
		$σ_{t}^{2}$	0.5	0.455 ± 0.001	0.006	0.541	0.017
		$R^{2}$	-	0.989 ± 0.000	-	0.989	0.003
	IST-SIM	$β_{1}$	1	1.002 ± 0.043	0.119	0.923	0.286
		$β_{2}$	1	1.015 ± 0.044	0.123	0.937	0.325
		$σ_{u}^{2}$	0.5	0.486 ± 0.077	0.127	0.310	0.370
		$σ_{t}^{2}$	0.5	0.457 ± 0.007	0.006	0.452	0.047
		$R^{2}$	-	0.989 ± 0.000	-	0.989	0.003
$ρ = 3$	ST-SIM	$β_{2}$	1	0.989 ± 0.006	0.004	0.988	0.088
		$σ_{u}^{2}$	0.5	0.437 ± 0.004	0.006	0.428	0.061
		$σ_{t}^{2}$	0.5	0.457 ± 0.001	0.002	0.452	0.021
		$R^{2}$	-	0.9888 ± 0.000	-	0.988	0.003
	IST-SIM	$β_{1}$	1	0.994 $\pm 0.069$	0.089	0.866	0.233
		$β_{2}$	1	0.989 ± 0.066	0.101	0.856	0.252
		$σ_{u}^{2}$	0.5	0.407 ± 0.042	0.088	0.298	0.027
		$σ_{t}^{2}$	0.5	0.476 ± 0.005	0.004	0.461	0.029
		$R^{2}$	-	0.989 ± 0.000	-	0.989	0.004

Table 2. The mean square error (MSE) and the predicted residual sum of squares (PRESS) results of the proposed models of 100 simulated data sets from each true model.

		True Model
		ST-SIM		IST-SIM
Criterion	Fitted Model	Mean ± SE	Median	Mean ± SE	Median
MSE	ST-SIM	135.7 ± 5.11	135.0	266.4 ± 16.6	236.2
	IST-SIM	124.5 ± 4.69	125.5	123.8 ± 6.17	130.1
PRESS	ST-SIM	372.7 ± 36.2	275.8	587.8 ± 45.9	446.8
	IST-SIM	388.6± 40.7	267.4	371.8 ± 25.0	312.9

Table 3. The ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

parameter estimates along with the 95% confidence intervals,

R^{2}

values, and the log-likelihood values.

Table 3. The ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

parameter estimates along with the 95% confidence intervals,

R^{2}

values, and the log-likelihood values.

	ST-SIM		IST-SIM
	Estimate	95% CI	Estimate	95% CI
$x_{1}$	−0.05	(−0.049, −0.052)	0.66	(0.60, 0.72)
$x_{2}$	0.06	(0.055, 0.066)	-	-
$x_{3}$	0.02	(0.011, 0.032)	−0.49	(−0.41, −0.52)
$σ_{u}^{2}$	2.13		0.97
$σ_{ω}^{2}$	0.47		0.52
log Likelihood	−297.3		−363.2
$R^{2}$	0.87		0.41
MSE	637.1		131.7

Table 4. Spatial random effects estimates of the two introduced models [ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

].

Table 4. Spatial random effects estimates of the two introduced models [ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

].

	ST-SIM	IST-SIM
Busan	1.254	2.377
Daegu	0.715	1.092
Gwangju	−0.422	−0.637
Daejeon	−0.697	−1.258
Incheon	−0.876	−1.479
Seoul	−0.905	−1.577

Table 5. Predicted mean square error (PMSE), mean square error (MSE), log likelihood prediction and fits, and

R^{2}

of ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

summary results of 500 estimates of the fitting and prediction criteria for ST-SIM and IST-SIM at different sizes of the evaluating data sets (

n = 2, 4, 6

).

Table 5. Predicted mean square error (PMSE), mean square error (MSE), log likelihood prediction and fits, and

R^{2}

of ST-SIM =

g (Z_{1} u + X β) + Z_{2} ω

and IST-SIM =

g (X β) + Z_{1} u + Z_{2} ω

summary results of 500 estimates of the fitting and prediction criteria for ST-SIM and IST-SIM at different sizes of the evaluating data sets (

n = 2, 4, 6

).

		PMSE		Log Likelihood
	Model	Mean	Median	Prediction	Fits	MSE	$R^{2}$
n = 2	ST-SIM	922.9	884.0	−75.6	−300.2	600.3	0.45
	IST-SIM	274.2	210.5	−63.3	−248.6	120.8	0.88
n = 4	ST-SIM	967.0	930.1	−235.3	−235.3	531.0	0.51
	IST-SIM	352.3	219.3	−134.3	−201.6	122.59	0.88
n = 6	ST-SIM	1094.8	985.7	−212.6	−174.7	467.3	0.56
	IST-SIM	633.4	250.2	−192.7	−153.8	117.9	0.89

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mahmoud, H.F.F.; Kim, I. Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models. Mathematics 2023, 11, 4629. https://doi.org/10.3390/math11224629

AMA Style

Mahmoud HFF, Kim I. Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models. Mathematics. 2023; 11(22):4629. https://doi.org/10.3390/math11224629

Chicago/Turabian Style

Mahmoud, Hamdy F. F., and Inyoung Kim. 2023. "Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models" Mathematics 11, no. 22: 4629. https://doi.org/10.3390/math11224629

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models

Abstract

1. Introduction

2. Semiparametric Models

2.1. Semiparametric Spatio-Temporal Single-Index Model

2.2. Semiparametric Integrated Spatio-Temporal Single-Index Model

2.3. Covariance Functions of Spatial and Temporal Effects

3. Model Estimation

3.1. Estimating Spatial and Time Effects

3.2. ST-SIM Estimation Algorithm

3.3. IST-SIM Estimation Algorithm

4. Simulation

5. Application

5.1. Data and Models

5.2. Models Estimation

5.3. Model Selection

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI