Next Article in Journal
Some Double q-Series by Telescoping
Previous Article in Journal
Artificial Intelligence in Business-to-Customer Fashion Retail: A Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Profile Maximum Likelihood Estimation of Single-Index Spatial Dynamic Panel Data Model

Department of Mathematics, Harbin Institute of Technology, Harbin 150001, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(13), 2947; https://doi.org/10.3390/math11132947
Submission received: 6 June 2023 / Revised: 22 June 2023 / Accepted: 26 June 2023 / Published: 1 July 2023

Abstract

:
In this paper, the spatial dynamic panel data (SDPD) model is extended to the single-index spatial dynamic panel data (Si-SDPD) model by introducing a nonlinear connection function to reflect the interaction between explanatory variables. The Si-SDPD model not only retains the advantages of the parametric SDPD model in dealing with spatial and temporal interaction effects and spatio-temporal dependencies, but also solves the limitations of the parametric SDPD model that may lead to missed bias. It reduces the data dimension of non-parametric models and enhances the practicability and explanatory power of parametric models. Since the parts of the model to be estimated contain unknown functions, we propose a new estimation method, a profile maximum likelihood (PML) method, to solve the problem of incidental parameters in the estimation. Under the assumption that the spatial coefficients are known, we preliminarily estimate the unknown function by carrying out local polynomial estimation, so as to transform the model into the parametric form for solving purposes. We then solve the dynamic panel parametric model via quasi-maximum likelihood (QML) estimation. We derive the asymptotic properties of profile maximum likelihood estimators (PMLEs) and find that, under certain regularity conditions, both parametric and non-parametric estimators are consistent. Monte Carlo results show that PMLEs have good finite sample performance.

1. Introduction

Research into spatial panel data is always a hot topic in econometrics. Scholars have gradually applied the spatial panel data model to analyze social and economic issues, such as the effective allocation of resources, factors influencing economic growth, energy consumption, environmental pollution, foreign direct investments, and so on. Compared with time series or cross-section data, panel data can increase the degree of freedom and reduce collinearity, thereby improving the accuracy of parameter estimation. Spatial panel data models can be divided into parametric and non-parametric models according to different assumptions about the explanatory variables. The parametric panel data model can simply and clearly describe the relationship between dependent and independent variables. Refs. [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15] and others have made outstanding contributions to parametric panel data modelling and its inference of economic phenomena. However, in many practical problems, the relationship between variables is usually complex, and it is often difficult to accurately describe this relationship using the preset form of the parametic model. In order to solve this problem, some non-parametric panel data models have been widely studied. Because the function form of a non-parametric model can be arbitrary, the model has great adaptability. When there are many explanatory variables in non-parametric models, the so-called “curse of dimensionality” problem often occurs, leading to the reduced reliability of estimates. Therefore, some new models have been proposed, and one of the effective methods is to build a single-index model.
The basic mathematical form of the single-index model is Y = g x β + ε (Ref. [16]), where β is the unknown parameter and g · is the unknown connection function g · . Its advantages are that it achieves dimension reduction by connecting functions, effectively avoids the problem of “curse of dimensionality”, and (at the same time) better reflects the relationship between variables. In recent years, the research focus on various extended single-index models has been to seek effective estimates of β and g · . For the single-index model of cross-section data, Ref. [17] proposed a semi-parametric maximum likelihood estimation method, while Ref. [18] proposed non-iterative methods for estimating β and g · based on the parameter constraint assumption, which improves the operation speed. Ref. [19] proposed root-n-consistent estimators for the variance components and a local linear smoother for the connection function. Ref. [20] constructed a semi-parametric minimum average variance estimation method in a partially linear single-index panel data model with fixed effects, and demonstrated the asymptotic properties of the estimation. Compared with the cross-sectional data model, there are very few studies on the single-index model for panel data, and even fewer on the single-index spatial panel data model because of its greater estimation complexity.
According to these research trends of spatial econometric models, we design a Si-SDPD model by introducing a nonlinear connection function to reflect the interaction between explanatory variables. The Si-SDPD model not only retains the advantages of the parametric SDPD model in dealing with spatial and temporal interaction effects and spatio-temporal dependencies, but also solves the limitations of the parametric SDPD model that may lead to missed bias. It reduces the data dimension of non-parametric models and enhances the practicability and explanatory power of parametric models. It is worth noting that due to the presence of a spatial lag term and the multi-directionality of the spatial correlation, the traditional estimation of the single-index model may not be directly applied to estimate the Si-SDPD model. In order to overcome the above difficulties, we propose a new estimation method, namely PML, which can first estimate the unknown functions via local polynomial estimation under the assumption that the spatial coefficients are known, and then solves the dynamic panel parametric model by conducting QML estimation. Then, we derive the asymptotic properties of the PMLEs of the Si-SDPD model and find that under certain regularity conditions, both the parametric and non-parametric estimators are consistent. Finally, we present a rigorous theoretical analysis of the asymptotic properties of PMLEs and verify some of their finite-sample properties by carrying out Monte Carlo experiments.
This paper is organized as follows. In Section 2, we introduce the Si-SDPD model and explain our PML estimation method. With the law of large numbers and central limit theorem for our settings developed in the Appendix A, Section 3 establishes the consistency of the parametric and nonparametric parts of the PMLE. We then present a Monte Carlo simulation to verify that the estimators have good finite sample performance. Section 5 concludes the paper. Some useful lemmas and results are provided in the Appendix.

2. The Model and Profile Maximum Likelihood Estimators

2.1. The Model

The model considered in this paper refers to the single-index spatial dynamic panel data (Si-SDPD) model:
Y n t = ρ 0 W n Y n t + γ 0 Y n , t 1 + τ 0 W n Y n , t 1 + G n t + V n t   ,   t = 1 ,   2 ,   ,   T
where Y n t = y 1 t ,   y 2 t ,   ,   y n t and V n t = v 1 t ,   v 2 t ,   ,   v n t are n × 1 column vectors; v n t is independent and identically distributed across i and t with a zero mean and variance σ 0 2 ; W n is the n × n spatial weight matrix, which is predetermined and generates spatial dependence between cross-sectional units y i t ; G n t = g x 1 β ,   g x 2 β ,   ,   g x n β , g · is an unary unknown function, β = 1 ; and the first component of β is positive and is an identifiable condition.
Define Z n t = Y n , t 1 , W n Y n , t 1 , A n ρ = I ρ W n , θ = β ,   δ ,   ρ ,   σ 2 and ζ = β ,   δ ,   ρ , where δ = γ , τ . At the true value, A n = A n ρ 0 = I ρ 0 W n , θ 0 = β 0 , δ 0 ,   ρ 0 ,   σ 0 2 , ζ 0 = β 0 ,   δ 0 ,   ρ 0 , where δ 0 = γ 0 , τ 0 . Then, presuming A n is invertible and (1) can be rewritten as Y n t = A n 1 Z n t δ 0 + G n t + V n t . The likelihood function of (1) is
l n L n , T θ ,   G n t = n T 2 l n 2 π σ 2 + T   l n A n ρ 1 2 σ 2 t = 1 T V n t ζ V n t ζ
where V n t ζ = A n ρ Y n t Z n t δ G n t and V n t ζ = v 1 ,   v 2 ,   ,   v n . Thus, V n t = V n t ζ 0 .
The QMLE θ ^ n T is the extreme estimator derived from the maximization of (2). When the V n t s are normally distributed, θ ^ n T is the MLE; when the V n t s are not normally distributed, θ ^ n T is the QMLE.

2.2. The Profile Maximum Likelihood Estimation

For the likelihood function shown in (2), the parameter estimation method is not feasible because g x i β ,   i = 1 ,   ,   n is unknown. In order to obtain a feasible estimate, we propose adopting the PML method. First, we consider the parameter θ = β ,   δ ,   ρ ,   σ 2 as known, and then (1) becomes a general spatial nonparametric model. The initial estimate g ^ I N x i β of g · can be obtained using the kernel estimation. Obviously, g ^ I N x i β is a function of the parameter θ . By replacing g · in (2) with g ^ I N x i β , we obtain the likelihood function with parameter θ . Then, by maximizing the likelihood function, we obtain the estimator θ ^ = β ^ ,   δ ^ , ρ ^ , σ 2 ^ of θ . Finally, the final estimate of g · , g ^ x i β , is obtained by replacing θ in g ^ I N x i β with θ ^ .
The specific steps of the profile maximum likelihood method are as follows.
  • Step 1: Considering θ = β ,   δ ,   ρ ,   σ 2 as known, we obtain g ^ I N x i β that is an initial estimate of g · using local polynomial estimation.
We denote Y t = A n ρ Y n t Z n t δ = G n t + V n t ζ and Y t = y 1 ,   y 2 ,   ,   y n , and (1) can be written as y i = g x i β + v i = g u i + v i ,   i = 1 ,   ,   n . At u , the p-order Taylor expansion of g u i is:
g u i g u + g u u i u + + 1 p ! g p u i u p λ 0 + λ 1 u i u + + λ p u i u p
Therefore, we can use the samples near u to perform weighted regression to estimate g u i and its higher-order derivatives, i.e., solving the following minimization problem:
min λ 0 , λ 1 1 n i = 1 n y i λ 0 λ 1 u i u 2 k h u i u
where k h u i u = h p k u i u h , k u i u h ,   i = 1 ,   ,   n is the multivariate kernel function and h is the bandwidth. In order to simplify the theoretical derivation, all variables in this chapter have the same window width, and the conclusion is also valid under the assumption of different window widths.
For the convenience of the following matrix operations, we denote p = n 1 . Then, we denote M u , β = 1 1 u 1 u h u n u h ,   λ = λ 0 ,   λ 1 and κ u , β = d i a g k u 1 u h ,   ,   k u n u h , so M u , β is a 2 × n   matrix. We can rewrite the objective function (4) as:
m i n λ 1 n Y t M u , β λ κ u , β Y t M u , β λ
After minimizing, the estimator of λ is:
λ ^ = M u , β κ u , β M u , β 1 M u , β κ u , β Y t
The first component of λ ^ is an estimate of g x i β . We denote M u , β κ u , β M u , β 1 M u , β κ u , β = S u , β , s u ,   β = e 1 S u ,   β , and s u ,   β = e 1 S u ,   β , and the initial estimate of g x i β is
g ^ I N u = e 1 λ ^ = e 1 S u ,   β Y = s u ,   β Y t
where e 1 = 1 ,   0 ,   ,   0 . Then, the initial estimate of G n t is
G ^ I N = g ^ I N u 1 ,   g ^ I N u 2 ,   ,   g ^ I N u n = s u 1 Y ,   s u 2 Y ,   ,   s u n Y t = s u 1 ,   s u 2 ,   ,   s u n Y t = S β Y t
where S β = s x 1 β ,   s x 2 β ,   ,   s x n β .
Step 2: By substituting G ^ I N for G n t in (1), we obtain the approximate value of the logarithmic likelihood function as:
l n   L n ,   T θ = n T 2 l n 2 π σ 2 + T   l n A n ρ 1 2 σ 2 t = 1 T V n t ζ V n t ζ = n T 2 l n 2 π σ 2 + T   l n A n ρ 1 2 σ 2 t = 1 T Y t G ^ I N Y t G ^ I N = n T 2 l n 2 π σ 2 + T   l n A n ρ 1 2 σ 2 t = 1 T I n S β Y t I n S β Y t
The θ ^ n T that can maximize the above formula is the estimate of θ , i.e.,
θ ^ n T = a r g   max θ 1 n T   l n   L n ,   T θ
Computationally and analytically, it is convenient to work with the concentrated log-likelihood by concentrating out the σ 2 . Based on the log-likelihood function, the initial estimate of σ 2 is
σ 2 ^ I N = 1 n T t = 1 T I n S β Y t I n S β Y t = 1 n T t = 1 T I n S β A n ρ Y n t Z n t δ I n S β A n ρ Y n t Z n t δ = 1 n T t = 1 T I n S β G ^ I N + V n t ζ I n S β G ^ I N + V n t ζ
The concentrated log-likelihood function of ζ is
l n   L n ,   T ζ = n T 2 l n 2 π + 1 n T 2 l n σ 2 ^ I N + T   l n A n ρ
The estimate of ζ can be defined as:
ζ ^ n T = a r g   max θ 1 n T   l n   L n ,   T ζ
Equation (10) displays a nonlinear optimization problem, which can be solved using an iterative method in the actual estimation. After ζ ^ n T is obtained, the final estimate of σ 2 can be obtained by replacing ζ in Equation (8) with ζ ^ n T :
σ 2 ^ = 1 n T t = 1 T I n S β ^ Y t I n S β ^ Y t = 1 n T t = 1 T I n S β ^ G ^ I N + V n t ζ ^ n T I n S β ^ G ^ I N + V n t ζ ^ n T
Step 3: By using θ ^ = β ^ ,   δ ^ ,   ρ ^ ,   σ 2 ^ obtained in Step 2 to replace the parameters in the model, we describe the final estimate of the non-parametric part g x i β as:
g ^ u = s u ,   β ^ A n ρ ^ Y n t Z n t δ ^ = s u ,   β ^ Y ^ t

3. Profile Likelihood Estimators and Their Asymptotic Properties

To analyze the asymptotic properties of the estimators, we need the following assumptions:
Assumption 1.
W n  is a constant spatial weight matrix and its diagonal elements satisfy  w n ,   i i = 0  for  i = 1 ,   2 ,   ,   n In addition,  W n  is uniformly bounded in the row and column sums in an absolute value (for short, UB).
Assumption 2.
The disturbances  v i t ,   i = 1 ,   2 ,   ,   n  and  t = 1 ,   2 ,   ,   T , are    i . i . d .  across  i  and  t  with a zero mean, a variance of  σ 2 , and  E v i t 4 + η <  for some  η > 0 .
Assumption 3.
A n ρ  is invertible for all  ρ Λ ρ , where   Λ ρ  is compact and  ρ 0  is in the interior of  Λ ρ . Furthermore,  A n 1 ρ  is UB.
Assumption 4.
x i i = 1 n  is an independent and identically distributed random sequence and  E v i t x i < ,   i = 1 ,   ,   n .  x i β  has marginal density functions  f u i , where  f u i  is continuously differentiable near  x i β Λ u  and  Λ u  is the support set of  k · .
Assumption 5.
g ·  has continuous derivatives  g p · ,   p = 1 ,   ,   n  and  g · 𝜛 g , where  𝜛 g  is a positive constant.
Assumption 6.
When  n ,  h 0  and  n h p ,   p = 1 ,   ,   n .
Assumption 7.
The kernel function  k ·  is a bounded continuous non-negative function whose support set is bounded and closed:  𝜛 k ,   𝜛 k R p , where  𝜛 k > 0  is a constant, i.e.,  k m > 0  only if  m 𝜛 k . In addition,  M u ,   β ,  κ u ,   β , and  M u ,   β κ u ,   β M u ,   β 1  are UB.
Assumption 8.
k ·  is an even function and  k m m l d m = 𝜛 l ,   k 2 m m l d m = ς l . For any positive odd number  l , 𝜛 l = ς l = 0 ; also,  𝜛 1 = 0 ,   𝜛 2 0 .
Assumption 9.
β 0  is the inner point of  D , where  D R p  is a convex compact set.  β 0 = 1  and the first component of vector quantity  β 0  is positive, where  ·  is the Euclidean norm.
Assumption 1 is a standard normalization assumption in spatial econometrics. Here, the matrix A = A i j is UB, meaning that there is a non-negative constant 𝜛 A such that i = 1 n A i j 𝜛 A and j = 1 n A i j 𝜛 A . The constant 𝜛 A is defined differently in the following sections depending on the matrices. Assumption 2 provides regularity assumptions for v i t . The reversibility and compactness of A n ρ in Assumption 3 were derived from Kelejian and Prucha (1998, 2001) and have also been used in many articles about spatial correlation. When exogenous variables u i are included in the model, it is convenient to assume that the exogenous regressors are uniformly bounded, as in Assumption 4. Assumption 5 is a necessary condition for (3). Assumptions 6–8 demonstrate the condition of kernel density estimation. The bandwidth of the kernel function, h , is an important parameter that affects the estimation result of the kernel function. Kernel functions that satisfy Assumptions 7–8 exist, such as the product kernel, k m = Π i = 1 p k m i , where k m i is a symmetric kernel of one variable on the closed interval 𝜛 k ,   𝜛 k . Assumption 9 makes the model (1) recognizable.
For the concentrated log-likelihood Function (4) divided by the sample size n T , the corresponding expected value function is Q n ,   T θ = max σ 2 E 1 n T l n   L n ,   T θ , which is
Q n ,   T θ = 1 n T E l n   L n ,   T θ = 1 2 l n 2 π + 1 + 1 n   l n A n ρ 1 2 σ 2 E 1 n T t = 1 T V n t ζ V n t ζ
To show the consistency of θ ^ n T , we need the following uniform convergence results.
Lemma 1.
Under Assumptions 1–9, for an  n × n  nonstochastic UB matrix  B n ,
1 n T t = 1 T V n t B n V n t E 1 n T t = 1 T V n t B n V n t = O p 1 n T 1 n T t = 1 T Z n t B n Z n t E 1 n T t = 1 T Z n t B n Z n t = O p 1 n T 1 n T t = 1 T Z n t B n V n t E 1 n T t = 1 T Z n t B n V n t = O p 1 n T
where  E 1 n T t = 1 T V n t B n V n t = 1 n σ 0 2 t r B n = O 1 ,  E 1 n T t = 1 T Z n t B n Z n t = O 1 , and  E 1 n T t = 1 T Z n t B n V n t = O 1 T .
Lemma 2.
Let Θ be any compact parameter space. Then, under Assumptions 1–9,  1 n T l n   L n ,   T θ Q n ,   T θ P 0  is uniform in    θ Θ .
Lemma 3.
Let Θ be any compact parameter space. Then, under Assumptions 1–8,  Q n ,   T ζ  is uniformly equicontinuous for  θ Θ .
Before obtaining the information matrix, we need to compute the first and second derivatives of the logarithmic likelihood function. The asymptotic distribution of the QMLE θ ^ n T can be derived from the Taylor expansion of l n   L n ,   T θ ^ n T θ around θ 0 . We define θ β = θ β ^ = δ ,   ρ ,   σ 2 and ζ β = ζ β ^ = δ ,   ρ at the true value θ 0 β = θ 0 β ^ = δ 0 ,   ρ 0 ,   σ 0 2 and ζ 0 β = ζ 0 β ^ = δ 0 ,   ρ 0 . The first-order derivative of the concentrated likelihood function involves both linear and quadratic functions of V n t , which are as follows:
1 n T l n   L n ,   T θ 0 β θ β = 1 n T l n   L n ,   T θ 0 β δ l n   L n ,   T θ 0 β ρ l n   L n ,   T θ 0 β σ 2 = 1 n T 1 σ 0 2 t = 1 T Z n s V n t 1 σ 0 2 t = 1 T F n s Z n t δ 0 V n t t = 1 T V n t F n V n t σ 0 2 t r F n 1 2 σ 0 4 t = 1 T V n t V n t n T σ 0 2 T
where Z n s = I n S β ^ Z n t and Y n s = I n S β ^ W n Y n t . Then, the second-order derivatives are:
1 n T 2 l n   L n ,   T θ β θ β θ β = 1 n T × 1 σ 2 t = 1 T Z n s Z n s 1 σ 2 t = 1 T Z n s   Y ns 1 σ 4 t = 1 T Z n s V n t ζ β * 1 σ 2 t = 1 T Y ns Y ns T t r F n 2 1 σ 4 t = 1 T Y ns V n t ζ β * * n T 2 σ 4 + 1 σ 6 t = 1 T V n t ζ β V n t ζ β
The information matrix is as follows:
Σ θ 0 β ,   n T = E 1 n T 2 l n   L n ,   T θ 0 β θ β θ β = 1 n T 1 σ 0 2 t = 1 T Z n s Z n s 1 n T 1 σ 0 2 t = 1 T Z n s R n s 0 * 1 n T 1 σ 0 2 E t = 1 T R n s R n s + 1 n t r F n 2 + t r F n s F n s 1 n σ 0 2 t r F n s * * 1 2 σ 0 4
where F n s = I n S β ^ F n and R n s = F n s Z n t δ 0 + G ^ I N .
Assumption 10.
lim n 1 n t r F n 2 + t r F n s F n s 2 n t r F n s 2 0 .
Assumption 10 is an important condition for the non-singularity of the limiting information matrix Σ θ 0 β ,   n T in addition to the global identification in Lemma 4 and Theorem 1.
Lemma 4.
The information matrix  Σ θ 0 β ,   n T  is non-singular.
Theorem 1.
Under Assumptions 1–10,  θ 0  is globally identifiable and  θ ^  is a consistent estimator of  θ 0  (similar to Yu (2008)).
Theorem 2.
Under Assumptions 1–10,  θ 0  is globally identifiable and  θ ^ n T P θ 0  if  l i m n 1 n l n σ 0 2 A n 1 A n 1 1 n l n σ n 2   ρ A n 1 ρ A n 1 ρ 0  for  ρ ρ 0  (similar to Yu (2008)).
Lemma 5.
Under Assumptions 1–10,  1 n M u ,   β κ u ,   β M u ,   β P f x 1 0 0 𝜛 2 I n .
Lemma 6.
Under Assumptions 1–10,  S β V n t = o p 1 .
Lemma 7.
Under Assumptions 1–9,  I n S G n t = o p 1 .
Theorem 3.
Under Assumptions 1–9,  G ^ n t P G n t .

4. Monte Carlo Results

In this section, the Monte Carlo experiment is carried out on the estimation method pertaining to the previously constructed Si-SDPD model, and the simulation results are evaluated to test the performance of the PML method under limited samples. Then, the practical application value of the PMLEs is evaluated. All experiments are compiled using R language and plotted using the ‘ggplot2′ package.
For the parametric part, we generate samples from (1) and use θ 1 β = 0.3 ,   0.4 ,   1 , 0.2 and θ 2 β = 0.25   , 0.6 ,   0.5 ,   0.1 , where θ 0 β = γ 0 ,   ρ 0 ,   σ 0 2 ,   τ 0 . The component of the two-dimensional random variable X = x 11 ,   x 12 ,   ,   x 21 ,   x 22 ,   ,     x 1 n   x 2 n = X 1 X 2   and the random error term V n t = v 1 t ,   v 2 t ,   ,   v n t are generated from the uniform distribution U 3 ,   3 and the independent normal distribution N 0 ,   σ 0 2 , respectively. The spatial weight matrix W n that we use is the Rook matrix, which is one of the main types of spatial weight matrix in spatial econometrics. For the non-parametric part, we use the commonly used Gaussian kernel function, g X = 2 e β 1 X 1 + β 2 X 2 2 , where β 1 ,   β 2 = 1 / 3 ,   2 / 3 0.5774 ,   0.8165 . As it is difficult to select the optimal window width, we simply select the window width using the rule-of-thumb method. Finally, we use n = 10 ,   49 ,   100 as the sample size and T = 4 ,   10 as the number of periods. For each set of n and T , the sampling observations are generated with the Metropolis–Hastings sampling algorithm.
The evaluation of simulation results should also be divided into parametric and non-parametric parts. In the parametric part, for each estimator, we calculate the standard deviation (Std) and the root-mean-squared error (RMSE), where R M S E = 1 n i = 1 r e p s θ i ^ θ 0 2 1 / 2 ,   r e p s is the number of simulations and θ i ^ ,   i = 1 ,   ,   r e p s are the parameter estimates obtained from each simulation. In order to accurately estimate the parameter values, according to Su (2012), we take the window width h = s t d x · n 1 / 5 here, where s t d x represents the standard deviation of sequence x . In the non-parametric part, we refer to Chen (2012) when choosing the mean absolute deviation error (MADE) as the evaluation standard, which is M A D E = 1 M j = 1 M g j ^ x m g j x m ,   j = 1 ,   ,   r e p s ,   where x m m = 1 M is the M fixed grid points selected within the support set of x . We select 20 fixed lattice points in 2 ,   2 , namely M = 20 . When estimating the non-parametric part, we use the method of leave-one-out cross-validation to select the window width, i.e., the window width minimizes 1 n T t = 1 T i = 1 n y ^ i g ^ i 2 , where y ^ i is the ith element of the Y ^ t = A n ρ ^ Y n t Z n t δ ^ after the estimated value θ ^ and g ^ i is the g x i estimate obtained with the observation value other than the ith observation.
For different cases of n and T , 100 simulations are carried out with R Language. In each simulation, the Metropolis–Hastings sampling algorithm is used to conduct 1000 samples in the PML function. In order to obtain the distribution of samples close to reality and in order to achieve stability in the state, the first 200 sampling results are discarded. With two different values of θ 0 for each n and T , finite sample properties of both estimators are summarized in Table 1 and Table 2, in which we report the means, variances (Vars), root mean square error (RMSE), and coverage probability (CP).
For each case, the estimated value of the parameter, i.e., the mean, is relatively close to the real value, and we can see that for each given n , when T is larger, the variance of estimators will be smaller; for each given T , when n is larger, the biases between the real value and the estimators will be nearly the same, but the variance will be smaller. When both n and T are maximized, i.e., n = 100 ,   T = 10 , the variance and RMSEs of the parameter estimators are the smallest in all cases, which indicates that the parameter estimators will converge with the increase in the n value, which is consistent with the large sample property, as demonstrated. For different values of θ 0 , the variances are almost all less than 0.01, which indicates that the fitting error is small and the fitting results are good. In addition, the variance and root mean square error have little changes and will not change with an increase in the simulation time and the sample size, which further indicates that the variance of parameters is stable. Due to the small estimation variance, the mean hardly fluctuates around the true value, and the range of the confidence interval is also relatively stable. In only a few cases, the confidence intervals do not cover the true value, and with an increase in n and T , the coverage degree becomes higher and higher, i.e., the CP gradually approaches 1.
Table 3 shows the average absolute error and variance of unknown function estimators g X = 2 e β 1 X 1 + β 2 X 2 2 under different samples. As shown in Table 3, with an increase in the sample size and total number of periods under the same parameter setting, both the value and estimation error of the unknown function at 20 fixed lattice points decrease, indicating that the estimation of the non-parametric part is convergent. This is consistent with the theoretical results of Theorem 3.

5. Conclusions

In this paper, we propose a Si-SDPD model that can overcome the “Curse of Dimensionality” problem and deal with spatial and temporal correlation. With this model, we construct a PML method when the traditional maximum likelihood estimation method is not applicable. The theoretical results show that, under certain regularity conditions, both parametric and nonparametric estimators are consistent. Numerical simulation results show that PMLEs have good small sample characteristics and estimation accuracy increases with an increase in the sample size and time periods. Our research results will enrich and improve the estimation methods of single-index panel data models in spatial econometrics, and provide a new research tool and perspective for the applied research of related disciplines.
It is a complicated problem to prove the asymptotic normality of PML estimation because the number of parameters and unknown variables in the Si-SDPD model is too large, and the representation of the covariance matrix will be very complicated. The cross-sectional heteroskedasticity (space-varying error variances) in the Si-SDPD model is another interesting extension to consider. Most estimators are generally inconsistent in the presence of an unknown form of heteroskedasticity in the disturbance term in the SDPD model, much less in the Si-SDPD model with more unknown parameters. These works would be much more challenging than the already quite challenging works presented in this paper, and will be the topics of our future research.

Author Contributions

Conceptualization, M.Z. and B.T.; methodology, M.Z. and B.T.; software, M.Z.; validation, M.Z. and B.T.; formal analysis, M.Z. and B.T.; investigation, M.Z. and B.T.; resources, M.Z.; data curation, M.Z.; writing—original draft preparation, M.Z.; writing—review and editing, M.Z. and B.T.; visualization, M.Z. and B.T.; supervision, B.T.; project administration, B.T.; funding acquisition, B.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number 91646106).

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to the requirements of related projects supported by the National Natural Science Foundation of China.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Some Basic Lemmas

Proof of Lemma 2.
Based on  I n S β ^ Y t = V n t ζ = A n ρ Y n t Z n t δ G ^ I N , we have  V n t ζ = V n t Z n t δ δ 0 ρ ρ 0 W n Y n t . Hence,
V n t ζ V n t ζ = V n t V n t 2 δ δ 0 Z n t V n t 2 ρ ρ 0 W n Y n t V n t + ρ ρ 0 2 W n Y n t W n Y n t + δ δ 0 Z n t Z n t δ δ 0 + 2 ρ ρ 0 W n Y n t Z n t δ δ 0
where, using  W n Y n t = W n A n 1 Z n t δ + I n S β ^ 1 V n t  and  F n = W n A n 1 ,
W n Y n t W n Y n t = δ 0 Z n t F n F n Z n t δ 0 + F n I n S β ^ 1 V n t F n I n S β ^ 1 V n t + 2 δ 0 Z n t F n F n I n S β ^ 1 V n t
Using Lemma 1,
1 n T t = 1 T V n t V n t E 1 n T t = 1 T V n t V n t P 0 1 n T t = 1 T Z n t Z n t E 1 n T t = 1 T Z n t Z n t P 0 1 n T t = 1 T Z n t V n t E 1 n T t = 1 T Z n t V n t P 0 1 n T t = 1 T W n Y n t V n t E 1 n T t = 1 T W n Y n t V n t P 0 1 n T t = 1 T W n Y n t Z n t E 1 n T t = 1 T W n Y n t Z n t P 0 1 n T t = 1 T W n Y n t W n Y n t E 1 n T t = 1 T W n Y n t W n Y n t P 0
As  δ  and  ρ  are bounded in Θ, we have  1 n T t = 1 T V n t ζ V n t ζ 1 n T E t = 1 T V n t ζ V n t ζ P 0  that is uniform in θ and Θ. Using the fact that  σ 2  is bounded away from zero in Θ and  Q n ,   T θ = 1 n T E l n   L n ,   T θ ,
1 n T l n   L n ,   T θ Q n ,   T θ = 1 2 σ 2 t = 1 T V n t ζ V n t ζ 1 n T E t = 1 T V n t ζ V n t ζ P 0
uniformly in  θ . □
Proof of Lemma 3
(similar to Ref. [3]). Given  G ^ I N = S β ^ Y t  and  V n t ζ = I n S β ^ Y t , we have
V n t ζ = A n ρ A n 1 Z n t δ 0 + G ^ I N + V n t Z n t δ G ^ I N = A n ρ A n 1 Z n t δ 0 Z n t δ + A n ρ A n 1 V n t + A n ρ A n 1 G ^ I N G ^ I N = A n ρ A n 1 Z n t δ 0 Z n t δ + A n ρ A n 1 S β ^ I n S β ^ 1 V n t
And  Q n ,   T θ = 1 n T E l n   L n ,   T θ = 1 2 l n 2 π + 1 + 1 n   l n A n ρ 1 2 σ 2 E 1 n T t = 1 T V n t ζ V n t ζ , where
E 1 n T t = 1 T V n t ζ V n t ζ = 1 n T E t = 1 T A n ρ A n 1 Z n t δ 0 Z n t δ A n ρ A n 1 Z n t δ 0 Z n t δ + 1 n σ 0 2   t r A n ρ A n 1 S β ^ I n S β ^ 1 A n ρ A n 1 S β ^ I n S β ^ 1 + 2 n T t = 1 T A n ρ A n 1 Z n t δ 0 Z n t δ A n ρ A n 1 S β ^ I n S β ^ 1 V n t
According to Lemma 1, the third term is  2 n T t = 1 T A n ρ A n 1 Z n t δ 0 Z n t δ A n ρ A n 1 S β ^ I n S β ^ 1 V n t = O 1 T , and  O 1 T  is uniform in  θ  and Θ because it is a polynomial function in  θ and Θ is a bounded set. The second term is equal to σ n 2   ρ , where  σ n 2   ρ = 1 n σ 0 2   t r A n ρ A n 1 S β ^ I n S β ^ 1 A n ρ A n 1 S β ^ I n S β ^ 1 , which are polynomial functions of  θ  and are uniform in Θ. Using  A n ρ A n 1 = I n ρ ρ 0 F n  in the first term, we have
1 n T E t = 1 T A n ρ A n 1 Z n t δ 0 Z n t δ A n ρ A n 1 Z n t δ 0 Z n t δ = 1 n T E t = 1 T Z n t δ δ 0 + ρ ρ 0 F n Z n t δ 0 Z n t δ δ 0 + ρ ρ 0 F n Z n t δ 0 = δ δ 0 ,   ρ ρ 0 E Λ n T δ δ 0 ,   ρ ρ 0
where
Λ n T = 1 n T t = 1 T Z n t Z n t 1 n T t = 1 T F n Z n t δ 0 Z n t 1 n T t = 1 T F n Z n t δ 0 Z n t 1 n T t = 1 T F n Z n t δ 0 F n Z n t δ 0
To prove that  Q n ,   T ζ  is uniformly equicontinuous for  θ Θ , the following four conditions must be true. (1)  l n σ 2  is uniformly continuous;  σ 2  is bounded away from zero in  Θ , so (1) is obvious. (2)  1 n l n A n ρ  is uniformly equicontinuous; we know  1 n l n A n ρ 2 1 n l n A n ρ 1 = 1 n t r W n A n 1 ρ ¯ ρ 2 ρ 1 , where  ρ ¯  falls between  ρ 1  and  ρ 2 . Because  A n 1 ρ  is UB, and  θ is uniform in Θ , 1 n t r W n A n 1 ρ ¯  is bounded. Hence, (2) is true. (3) The first term, i.e.,  δ δ 0 ,   ρ ρ 0 E Λ n T δ δ 0 ,   ρ ρ 0 , is uniformly equicontinuous; both  ρ and δ  are bounded, with  E Λ n T = O 1 , meaning (3) is true. (4)  σ n 2   ρ  is uniformly equicontinuous given that
σ n 2   ρ = 1 n σ 0 2   t r A n ρ A n 1 S β ^ I n S β ^ 1 A n ρ A n 1 S β ^ I n S β ^ 1
and  A n ρ A n 1 = I n ρ ρ 0 F n , we have
σ n 2   ρ 2 σ n 2   ρ 1 = σ 0 2 ρ 2 ρ 1   1 n ρ 2 + ρ 1 2 ρ 0 t r F n F n t r F n + F n
And because  F n F n  and  F n  are UB, (4) is true. □
Proof of Lemma 4.
In line with Lee’s study (2004), we can use a contradiction to prove the result. Firstly, we assume  α = α 1 ,   α 2 ,   α 3 ,    where  α 1 ,   α 2  and  α 3  are scalars. Next, for  Σ θ 0 β l i m T Σ θ 0 β ,   n T , we need to prove that  Σ θ 0 β α = 0  implies  α = 0 . If this is true, then, columns of  Σ θ 0 β  would be linear-independent and  Σ θ 0 β  would be nonsingular.
From (21),
Σ θ 0 β = 1 σ 0 2 × E Φ Z Z E Φ Z R 0 E Φ R Z E Φ R R + l i m n σ 0 2 n t r F n 2 + t r F n s F n s l i m n 1 n t r F n s 0 l i m n 1 n t r F n s 1 2 σ 0 2
where Φ Z Z = lim T 1 n T t = 1 T I n S β ^ Z n t I n S β ^ Z n t Φ R Z = Φ Z R Φ Z R = lim T 1 n T t = 1 T I n S β ^ Z n t R n , and  Φ R R = l i m T 1 n T t = 1 T R n R n . Hence,  Σ θ 0 β α = 0 implies
1 σ 0 2 E Φ Z Z × α 1 + 1 σ 0 2 E Φ Z R × α 2 = 0 1 σ 0 2 E Φ R Z × α 1 + 1 σ 0 2 E Φ R R + l i m n 1 n t r F n 2 + t r F n s F n s × α 2 + l i m n 1 n σ 0 2 t r F n s × α 3 = 0 l i m n 1 n σ 0 2 t r F n s × α 2 + 1 2 σ 0 4 × α 3 = 0
The first and third equations imply, respectively,  α 1 = E Φ Z Z 1 E Φ Z R × α 2 and  α 3 = 2 l i m n σ 0 2 n t r F n s × α 2 . By eliminating  α 1 and  α 3 , the second equation becomes  1 σ 0 2 E Φ R R E Φ R Z E Φ Z Z 1 E Φ Z R + l i m n 1 n t r F n 2 + t r F n s F n s + 2 n t r F n s 2 × α 2 = 0 .
Under Assumption 10, we assume that  Φ n T = 1 σ 0 2 Φ Z Z Φ Z R Φ R Z Φ R R  and that  E Φ n T  is nonsingular, meaning that the above formula is only true if  α 2 = 0 , i.e.,  α = 0 . The information matrix  Σ θ 0 β ,   n T  is nonsingular. □
Proof of Lemma 5.
As known from the previous definition,
1 n M u ,   β κ u ,   β M u ,   β = 1 n i = 1 n k h u i u 1 n i = 1 n k h u i u u i u h 1 n i = 1 n k h u i u u i u h 1 n i = 1 n k h u i u u i u h 2 Ψ 11 1 Ψ 12 1 Ψ 21 1 Ψ 22 1 = Ψ 1
From Assumption 4,  x i i = 1 n  is an independent, identically distributed random sequence, so  u i i = 1 n  is an independent, identically distributed sequence.
E Ψ 11 1 = h 1 k u 1 u h f u 1 u d u 1 = k m f u + h m d m = k m f u + h f u m + h 2 f ξ m 2 d m = f u + O h 2
where m = u 1 u h  is a variable substitution, and  ξ  falls between  u 1  and  u .
Similarly, E Ψ 21 1 = E k h u 1 u u 1 u h = h 1 k u 1 u h f u i u 1 u h d u 1 = k m f u + h m m d m = k m f u + h f u m + h 2 f ξ m 2 m d m = O h  and  E Ψ 12 1 = E Ψ 21 1 = O h .
Moreover,  E Ψ 22 1 = E k h u 1 u u 1 u h 2 = h 1 k u 1 u h f u 1 u 1 u h 2 d u 1 = k m f u + h m m 2 d m = k m f u + h f u m + h 2 f ξ m 2 m 2 d m = f u k m m 2 d m + O h 2 .
According to the Khintchine law of large numbers,  Ψ i j 1 P E Ψ i j 1 ,   i = 1 ,   2  and
Ψ 1 P E Ψ 11 1 E Ψ 12 1 E Ψ 21 1 E Ψ 22 1 = f u + O h 2 O h O h f u k m m 2 d m · I n + O h 2
Based on Assumption 6, when  n h 0 , meaning that
Ψ 1 P f u 1 0 0 f u k m m 2 d m · I n .
Proof of Lemma 6.
As  s u ,   β V n t = e 1 M u ,   β κ u ,   β M u ,   β 1 M u ,   β κ u ,   β V n t = e 1 1 n M u ,   β κ u ,   β M u ,   β 1 1 n M u ,   β κ u ,   β V n t , we describe  1 n M u ,   β κ u ,   β V n t = 1 n i = 1 n k h u i u v i t 1 n i = 1 n k h u i u u i u h v i t Ψ 1 2 Ψ 2 2 = Ψ 2 . Based on Assumptions 2 and 4, since  x i i = 1 n  and  v i i = 1 n  are independent,  u i i = 1 n  and  v i i = 1 n  are independent, and,  E Ψ 1 2 = 0 ,   E Ψ 2 2 = 0 . Furthermore,  u i i = 1 n  and  v i i = 1 n  denote an independent, identically distributed sequence, so
V a r Ψ 1 2 = 1 n E k h 2 u i u v i 2 = 1 n E k h 2 u i u E v i 2 = σ 0 2 n E k h 2 u i u = σ 0 2 n h 1 k 2 m f u + h m d m = O n h 1 ; V a r Ψ 2 2 = 1 n E k h 2 u i u u i u h 2 v i 2 = 1 n E k h 2 u i u u i u h 2 E v i 2 = σ 0 2 n E k h 2 u i u u i u h 2 = σ 0 2 n h 1 k 2 m f u + h m m 2 d m = O n h 1 .
Based on Assumption 5,  Ψ 1 2 = O p n h 1 / 2 = o p 1  and  Ψ 2 2 = O p n h 1 / 2 = o p 1 , so  1 n M u ,   β κ u ,   β V n t = O p n h p 1 / 2 = o p 1 . Then, from Lemma 5,  S β V n t = o p 1 . □
Proof of Lemma 7.
As (6),
s u ,   β G n t = e 1 1 n M u ,   β κ u ,   β M u ,   β 1 1 n M u ,   β κ u ,   β G n t , 1 n M u ,   β κ u ,   β G n t = 1 n i = 1 n k h u i u g u i 1 n i = 1 n k h u i u u i u h g u i Ψ 1 3 Ψ 2 3 = Ψ 3 .
Based on Assumptions 2 and 4,
E Ψ 1 3 = E k h u i u g u i = h p k u i u h g u i f u i d u i = k m f u + h m g u + h m d m = k m g u + h g u m + h 2 g ξ 1 m f u + h f x m + h 2 f ξ 2 m 2 d m = g u f m + O h 2 ,   V a r Ψ 1 3 = 1 n V a r k h u i u g u i = 1 n E k h 2 u i u g 2 u i E k h u i u g u i 2 ,  
where ξ i   i = 1 ,   2 falls between u i   and u . And then,
E k h 2 u i u g 2 u i = h 1 k 2 m g u + h m f u + h m d m = O h 1 ,   V a r Ψ 1 3 = 1 n O h 1 + g u f u + O h 2 2 = O n h 1 , Ψ 1 3 = g u f u + O h 2 + O p n h 1 / 2 = g u f u + O h 2 + n h 1 / 2 .
Similarly,
E Ψ 2 3 = E k h u i u u i u h g u i = k m f u + h m g u + h m m d m = k m g u + h g ξ 1 m f u + h f ξ 2 m m d m = O h ,   E Ψ 2 3 2 = 1 n E k h 2 u i u u i u h 2 g 2 u i = n h 1 k 2 m f u + h m g 2 u + h m m 2 d m = O n h 1 .
In that way, V a r Ψ 2 3 = E Ψ 2 3 2 E Ψ 2 3 2 = O h 2 + n h 1 and Ψ 2 3 = O p h + n h 1 / 2 = o p 1 . This suggests that
1 n M u ,   β κ u ,   β G n t = g u f u + O h 2 + n h 1 / 2 O p h + n h 1 / 2 P g u f u 0 .
And then, according to Assumption 4,
s u ,   β G n t = e 1 1 n M u ,   β κ u ,   β M u ,   β 1 1 n M u ,   β κ u ,   β G n t = g u + o p 1 ,   i . e . ,   s u i ,   β G n t = g u i + o p 1 ,   i = 1 ,   2 ,   ,   n S β G n t = s x 1 β G n t ,   s x 2 β G n t ,   ,   s x n β G n t = G n t + o p 1 .   Hence ,   I n S β G n t = o p 1 .

Appendix B. Proof of Theoretical Results

Proof of Theorem 1.
As  E t = 1 T V n t V n t = n T σ 0 2 , at  θ 0 , (10) implies  E l n   L n ,   T θ 0 = n T 2 l n 2 π n T 2 l n σ 0 2 + T   l n A n n T 2 . Then, we have
1 n T E l n   L n ,   T θ 1 n T E l n   L n ,   T θ 0 = 1 2 l n σ 0 2 l n σ 2 + 1 n l n A n ρ 1 n l n A n 1 n T 1 2 σ 2 E t = 1 T V n t ζ V n t ζ 1 2 = T 1 ,   n ρ ,   σ 2 1 2 σ 2 T 2 ,   n ρ ,   δ + o 1 ,   where   T 1 ,   n ρ ,   σ 2 = 1 2 l n σ 0 2 l n σ 2 + 1 n l n A n ρ 1 n l n A n 1 2 σ 2 σ n 2   ρ σ 2   and   T 2 ,   n ρ ,   δ = 1 n T E t = 1 T Z n t δ δ 0 + ρ ρ 0 F n Z n t δ 0 Z n t δ δ 0 + ρ ρ 0 F n Z n t δ 0
We consider the process  Y n t = ρ 0 W n Y n t + V n t , for a period t, the log-likelihood function of which is
l n L p ,   n ρ ,   σ 2 = n 2 l n 2 π σ 2 + l n A n ρ 1 2 σ 2 t = 1 T A n ρ Y n t A n ρ Y n t
By letting  E p ·  be the expectation operator for  Y n t , we have
E p 1 n l n L p ,   n ρ ,   σ 2 E p 1 n l n L p ,   n ρ 0 ,   σ 0 2 = 1 2 l n σ 2 l n σ 0 2 + 1 n l n A n ρ 1 n l n A n 1 2 σ 2 σ n 2   ρ σ 2 = T 1 ,   n ρ ,   σ 2
Based on information inequality,  l n L p ,   n ρ ,   σ 2 l n L p ,   n ρ 0 ,   σ 0 2 0 . Thus,  T 1 ,   n ρ ,   σ 2 0  for any  ρ ,   σ 2 . Also,  T 2 ,   n ρ ,   δ  is a quadratic function of  ρ  and  δ . Under the condition that  E Φ n T  is nonsingular,  T 2 ,   n ρ ,   δ > 0  whenever  ρ ,   δ ρ 0 ,   δ 0 , so  ρ ,   δ  is globally identified, given that  ρ 0 ,   σ 0 2  is a unique maximizer of  T 1 ,   n ρ ,   σ 2 . Hence,  ρ ,   δ ,   σ 2  is globally identified. Combined with uniform convergence and equicontinuity in Lemmas 3 and 4, the consistency follows. □
Proof of Theorem 2.
From the proof of Theorem 1,  1 n T E l n   L n ,   T θ 1 n T E l n   L n ,   T θ 0 = T 1 ,   n ρ ,   σ 2 1 2 σ 2 T 2 ,   n ρ ,   δ + o 1 . When  E Φ n T  is singular,  δ 0  and  ρ 0  cannot be identified from  T 2 ,   n ρ ,   δ  . Global identification requires that the limit of  T 1 ,   n ρ ,   σ 2  is strictly less than zero. As  T 1 ,   n ρ ,   σ 2 0  based on information inequality,  T 1 ,   n ρ ,   σ 2 0  is equivalent to
l i m n 1 n l n σ 0 2 A n 1 A n 1 1 n l n σ n 2   ρ A n 1 ρ A n 1 ρ 0
(see Lee (2004)). After  ρ 0  and  σ 0 2  are identified, given  ρ 0 δ 0  can be identified from  T 2 ,   n ρ ,   δ . Combined with uniform convergence and equicontinuity in Lemmas 3 and 4, the consistency follows. □
Proof of Theorem 3.
As (6), we have  G ^ n t = S β ^ Y ^ t = S β ^ Y n t ρ ^ W n Y n t Z n t δ ^ = S β ^ Y n t ρ ^ W n Y n t + ρ 0 W n Y n t ρ 0 W n Y n t Z n t δ ^ + Z n t δ 0 Z n t δ 0 = S β ^ Y n t ρ 0 W n Y n t Z n t δ 0 ρ ^ ρ 0 W n Y n t Z n t δ ^ δ 0 = S β ^ G n t + V n t ρ ^ ρ 0 W n Y n t Z n t δ ^ δ 0 , so that,  G ^ n t G n t = S β ^ V n t I n S β ^ G n t ρ ^ ρ 0 S β ^ W n Y n t + S β ^ Z n t δ ^ δ 0 .
From Theorems 1 and 2,  ρ ^ ρ 0 = o p 1 ,   δ ^ δ 0 = o p 1 . And from Lemmas 6 and 7,  I n S β ^ G n t = o p 1 ,   S β ^ V n t = o p 1 . Otherwise determined by the boundedness in the assumptions,  G ^ n t G n t = o p 1 . □

References

  1. Ai, C.; Chen, X. Efficient estimation of models with conditional moment restrictions containing unknown functions. Econometrica 2003, 71, 1795–1843. [Google Scholar] [CrossRef]
  2. Baltagi, B.H.; Song, S.H.; Koh, W. Testing panel data regression models with spatial error correlation. Econometrics 2003, 117, 123–150. [Google Scholar] [CrossRef]
  3. Chen, J.; Gao, J.T.; Li, D.G. Estimation in partially linear single-index panel data models with fixed effects. J. Bus. Econ. Stat. 2013, 31, 315–330. [Google Scholar] [CrossRef] [Green Version]
  4. Elhorst, J.P. Specification and estimation of spatial panel data models. Int. Reg. Sci. Rev. 2003, 26, 244–268. [Google Scholar] [CrossRef]
  5. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016. [Google Scholar]
  6. Jin, B.; Wu, Y.; Rao, C.R.; Hou, L. Estimation and model selection in general spatial dynamic panel data models. Proc. Natl. Acad. Sci. USA 2020, 117, 5235–5241. [Google Scholar] [CrossRef] [PubMed]
  7. Lee, L.F.; Yu, J. Estimation of spatial panel model with fixed effects. Econometrics 2010, 154, 165–185. [Google Scholar] [CrossRef]
  8. Lee, L.F.; Yu, J. A spatial dynamic panel data model with both time and individual fixed effects. Econom. Theory 2010, 26, 564–597. [Google Scholar] [CrossRef] [Green Version]
  9. Lee, L.F.; Yu, J. Some recent developments in spatial panel data models. Reg. Sci. Urban Econ. 2010, 40, 255–271. [Google Scholar] [CrossRef]
  10. Pang, Z.; Xue, L.G. Estimation for the single-index models with random effects. Comput. Stat. Data Anal. 2012, 56, 1837–1853. [Google Scholar] [CrossRef]
  11. Parent, O.; LeSage, J.P. A space-time filter for panel data models containing random effects. Comput. Stat. Data Anal. 2011, 55, 475–490. [Google Scholar] [CrossRef]
  12. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
  13. Su, L.; Yang, Z. QML Estimation of Dynamic Panel Data Models with Spatial Errors; Singapore Management University: Singapore, 2007. [Google Scholar]
  14. Su, L.; Jin, S.N. Profile quasi-maximum likelihood estimation of partially linear spatial autoregressive models. J. Econom. 2010, 157, 18–33. [Google Scholar] [CrossRef]
  15. Wang, J.L.; Xue, L.G.; Zhu, L.X.; Chong, Y.S. Estimation for a partial-linear single-index model. Ann. Stat. 2010, 38, 246–274. [Google Scholar] [CrossRef]
  16. Yu, J.; de Jong, R.; Lee, L.F. Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large. J. Econom. 2008, 146, 118–134. [Google Scholar] [CrossRef] [Green Version]
  17. Yu, J.; Lee, J.F. Efficient GMM estimation of spatial dynamic panel data models. J. Econom. 2010, 180, 174–197. [Google Scholar]
  18. Yu, J.; Lee, J.F. Estimation of unit root spatial dynamic panel data models. Econom. Theory 2010, 26, 1332–1362. [Google Scholar] [CrossRef]
  19. Yu, J.; de Jong, R.; Lee, L.F. Estimation for spatial dynamic panel data with fixed effects: The case of spatial cointegration. J. Econom. 2012, 167, 16–37. [Google Scholar] [CrossRef]
  20. Zhang, Y.Q.; Shen, D.M. Estimation of semiparametric varying-coefficient spatial panel data models with random effects. J. Stat. Plan. Inference 2015, 159, 64–80. [Google Scholar] [CrossRef]
Table 1. The performance of spatial coefficients estimators with θ 1 β .
Table 1. The performance of spatial coefficients estimators with θ 1 β .
n T = 4 T = 10
RMSEMeansVarsCPRMSEMeansVarsCP
10 β 1 0.03370.5620.00740.820.03230.5840.00660.84
β 2 0.03410.8320.00720.810.03190.7940.00690.85
γ 0.0282−0.3190.00580.840.0254−0.3130.00520.86
ρ 0.03170.4190.00640.820.03010.4160.00540.84
σ 2 0.10881.0210.00690.850.08671.0190.00620.88
τ 0.0246−0.2240.00570.840.0238−0.2180.00510.87
49 β 1 0.03120.5860.00680.850.02980.5710.00520.88
β 2 0.03250.8280.00670.830.02960.8090.00530.89
γ 0.0234−0.3150.00540.860.0222−0.2880.00490.89
ρ 0.02410.4130.00570.840.02250.3890.00480.88
σ 2 0.08340.9880.00590.870.06761.0080.00540.90
τ 0.0235−0.1790.00530.870.0226−0.2110.00470.91
100 β 1 0.02980.5690.00530.870.02820.5720.00450.90
β 2 0.03020.8070.00550.870.02830.8110.00490.91
γ 0.0217−0.3090.00490.880.0205−0.2970.00430.92
ρ 0.02250.3910.00470.890.02080.4050.00390.91
σ 2 0.06491.0090.00480.900.05210.9940.00410.93
τ 0.0221−0.1850.00470.890.0213−0.1920.00420.92
Table 2. The performance of spatial coefficients estimators with θ 2 β .
Table 2. The performance of spatial coefficients estimators with θ 2 β .
n T = 4T = 10
RMSEMeansVarsCPRMSEMeansVarsCP
10 β 1 0.04060.5640.00880.790.03950.5690.00770.82
β 2 0.05120.8290.00910.810.04940.8060.00860.83
γ 0.03320.2260.00760.830.03180.2340.00690.85
ρ 0.0343−0.6200.00810.820.0335−0.6170.00770.83
σ 2 0.10390.5310.00840.820.10210.5260.00780.84
τ 0.03420.1220.00780.840.03290.1150.00710.86
49 β 1 0.03720.5860.00710.820.03680.5710.00560.85
β 2 0.04330.8020.00740.840.04190.8120.00630.85
γ 0.03250.2390.00680.870.02990.2580.00570.90
ρ 0.0301−0.6120.00680.840.0298−0.5920.00550.88
σ 2 0.09140.5180.00690.850.08670.4890.00620.90
τ 0.03290.1140.00640.880.03140.0890.00540.92
100 β 1 0.03550.5810.00610.840.03160.5730.00560.89
β 2 0.03670.8130.00570.870.03450.8100.00510.91
γ 0.03070.2430.00520.900.02810.2530.00490.92
ρ 0.0281−0.5920.00530.870.0282−0.6040.00460.90
σ 2 0.08230.5120.00610.890.06060.4940.00570.92
τ 0.02980.0920.00580.910.02890.1050.00510.94
Table 3. The performance of unknown function estimators G ^ n t .
Table 3. The performance of unknown function estimators G ^ n t .
n θ 0 a = 0.3 ,   0.4 ,   1 , 0.2 θ 0 b = 0.25 , 0.6 ,   0.5 ,   0.1
T = 4T = 10T = 4T = 10
MADE Std.dev MADE Std.dev MADE Std.dev MADE Std.dev
100.04060.02160.03440.01870.03970.01870.03210.0145
490.03140.01250.02290.01120.02520.01380.01760.0098
1000.01980.00920.01060.00760.01420.00960.00940.0071
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, M.; Tian, B. Profile Maximum Likelihood Estimation of Single-Index Spatial Dynamic Panel Data Model. Mathematics 2023, 11, 2947. https://doi.org/10.3390/math11132947

AMA Style

Zhang M, Tian B. Profile Maximum Likelihood Estimation of Single-Index Spatial Dynamic Panel Data Model. Mathematics. 2023; 11(13):2947. https://doi.org/10.3390/math11132947

Chicago/Turabian Style

Zhang, Mengqi, and Boping Tian. 2023. "Profile Maximum Likelihood Estimation of Single-Index Spatial Dynamic Panel Data Model" Mathematics 11, no. 13: 2947. https://doi.org/10.3390/math11132947

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop