A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series

Yu, Kaizhi; Wang, Huiqiao

doi:10.3390/e23060706

Open AccessArticle

A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series

by

Kaizhi Yu

and

Huiqiao Wang

^*

School of Statistics, Southwestern University of Finance and Economics, Chengdu 611130, China

^*

Author to whom correspondence should be addressed.

Entropy 2021, 23(6), 706; https://doi.org/10.3390/e23060706

Submission received: 30 April 2021 / Revised: 28 May 2021 / Accepted: 31 May 2021 / Published: 2 June 2021

(This article belongs to the Special Issue Time Series Modelling)

Download

Browse Figures

Versions Notes

Abstract

:

A new integer-valued moving average model is introduced. The assumption of independent counting series in the model is relaxed to allow dependence between them, leading to the overdispersion in the model. Statistical properties were established for this new integer-valued moving average model with dependent counting series. The Yule–Walker method was applied to estimate the model parameters. The estimator’s performance was evaluated using simulations, and the overdispersion test of the INMA(1) process was applied to examine the dependence between counting series.

Keywords:

integer-valued moving average model; counting series; dispersion test

1. Introduction

Integer-valued time series can be encountered in numerous fields, such as epidemiology, insurance, and intraday stock transitions. The most widely used model is the integer-valued autoregressive (INAR) model, a recursive model first introduced by Alzaid and Al-Osh [1] and is similar to the traditional autoregressive (AR) model. Du and Li [2] generalized the model to the p-th order (which was called INAR(p) model) and proved the ergodic and Markov properties of the model. Similar to the continuous-valued moving average model, the q-th order integer-valued moving average model INMA(q) was introduced by Al-Osh and Alzaid [3], which is a slightly different form proposed by McKenzie [4].

Many researchers generalize the INAR model to deal with different real-life situations. Weiß [5] presented a new INAR(p) model showing possible marginal distributions of the DSD family. This model overcomes the difficulty of choosing the appropriate marginal distribution. Monteiro and Scotto [6] defined the periodic integer-valued autoregressive model, driven by a periodic sequence of independent Poisson-distributed random variables. Weiß [7] proposed the extended Poisson INAR(1) model, where the innovations are assumed to be serially dependent. Zhu [8] introduced a negative binomial INGARCH model to handle integer-valued time series with overdispersion and potential extreme observations. The study by Weiß [9] discussed threshold models for integer-valued time series with infinite range and briefly discussed new models for counting data time series with a finite range. Kang and Wang [10] generalized the mixture INAR(1) model based on mixing Pegram and binomial thinning operator. Li and Wang [11] proposed the first-order mixed integer-valued autoregressive process with zero-inflated generalized power series innovations, which contains the commonly used zero-inflated Poisson and geometric distributions. To handle the non-stationary integer-valued time series with a large dispersion, Kim and Park [12] introduced an integer-valued autoregressive process with a signed binomial thinning operator (INARS(p)).

Various modified thinning operators have been proposed to capture the specificity of real data, and many new INAR-type models have been defined. For example, Zheng and Basawa [13] introduced the random coefficient thinning operator while Ristić and Bakouch [14] proposed the negative binomial thinning operator. The pth-order integer-valued autoregressive process with signed generalized power series thinning operator was proposed by Zhang et al. [15].

The most significant generalization of the thinning operator was made by Ristić et al. [16] which they called the dependent Bernoulli thinning operator. They constructed the new sequence of the Bernoulli random variable allowing correlation between counting series. Based on this, Miletić Ilić et al. [17] proposed the new model, based on the mix of regular binomial thinning and dependent thinning operator. For more details of thinning operators, refer to Weiß [18].

MA-type models are very important in time series analysis. The method of moving average is generally popular in statistical and mathematical analyses. Some researchers study forecasting using the moving average method. Winters [19] analyzed the exponentially weighted moving average method for forecasting sales. Cox [20] proposed the weighted moving average method to predict the Markov series. Landauskas et al. [21] introduced an algebraic approach to select the appropriate weight coefficients for weight moving averages performing better than classical moving average predictors. Wind speed prediction using combined time series model and neural network prediction was studied by Nan et al. [22]. Other applied weight moving average methods in control charts. Alevizakos et al. [23] proposed the triple exponentially weighted moving average control chart (TEWMA), which improves the detection ability of the classical control chart. Capizzi and Masarotto [24] proposed the adaptive exponentially weighted moving average control chart (AEWMA), which weights past observations of the monitored process using a suitable function of the current error. Adegoke et al. [25] studied the multivariate homogeneously weighted moving average (MHWMA) control chart for monitoring a process mean vector.

Some researchers constructed MA-type models from the perspective of count time series. Brännäs and Quoreshi [26] used the INMA process to model the number of transactions for intraday stocks and extended the model to include explanatory variables. Brännäs and Hall [27] mainly focused on the estimation in the INMA model. The construction of the thinning operation from these studies is based on independent counting series, which is a strong assumption. Thus, to make the model more flexible in capturing the specificity of different data types, this assumption should be relaxed.

For small counts of intraday transactions in stocks per minute, the decision of buyer or seller could be affected by the public news, which means decisions from different individuals may not be uncorrelated. The change in inventories can sometimes be described as an INMA process. However, during a certain period, the change can be influenced by the same external factor. For the INMA model applying a discrete risk model, the number of claims for certain insurance will be affected by the same factor, such as natural disasters. Thus, these claims are no longer independent. The independence between counting series should be relaxed. Allowing the correlation between counting series is natural, and therefore the INMA model based on dependent counting series can be derived to handle different real data situations.

The rest paper is organized as follows. Section 2 presents the model construction and discussions on some relevant statistical properties, and Section 3 discussed the estimation of unknown parameters. Section 4 shows the numerical simulation results and give dispersion test for dependence between counting series, while the conclusions are given in Section 5.

2. The Model and Basic Properties

2.1. The Model Construction

The counting series

{U}_{i \in N}

of the integer-valued model is defined as:

\begin{matrix} U_{i} = (1 - V_{i}) W_{i} + V_{i} Z \end{matrix}

{W}_{i \in N}

is a sequence of

i . i . d

random variable with

W_{i} \sim B (1, β)

,

β \in [0, 1]

.

{V}_{i \in N}

is a sequence of

i . i . d

random variable with

V_{i} \sim B (1, θ)

,

θ \in [0, 1]

. Z is a random variable with

Z \sim B (1, β)

. The operator

\circ_{θ}

is defined by

β \circ_{θ} ε_{t} = \sum_{i = 1}^{ε_{t}} U_{i}

, it is the dependent Bernoulli thinning operator, where

ε_{t}

is a non-negative integer-valued random variable. Based on this construction, we can easily verify that

E (U_{i}) = β

,

V a r (U_{i}) = β (1 - β)

,

c o r r (U_{i}, U_{j}) = θ^{2}

, which has promising dependence between the counting series. Now we generalize the dependent count series to the INMA(q) process, For convenience, we use ∘ instead of

\circ_{θ}

to simplify the notation.

Definition 1

(Dependent Counting series Integer-valued Moving Average Model (DCINMA)). The DCINMA(q) model is defined as:

X_{t} = ε_{t} + β_{1} \circ ε_{t - 1} + \dots + β_{q} \circ ε_{t - q}

β_{j} \circ ε_{t - j}, j = 1, 2, \dots, q

is the dependent Bernoulli thinning operator, and the following conditions should be satisfied.

A1.

{ε_{t}}_{t \in N}

is a sequence of i. i. d non-negative random variables.

A2. The counting variable

{U_{i}}_{i \in N}

is independent of

ε_{t}

for any

i, t

.

A3.

β_{j} \circ ε_{t - j}

, for any

j = 1, 2, \dots, q

are mutually independent.

2.2. The Numerical Properties for DCINMA(q) Model

We denote

μ_{ε}

and

σ_{ε}^{2}

as the mean and variance of term

ε_{t}

.

Theorem 1.

The numerical characteristics of

{X_{t}}

in Definition 1 are as follows:

(i): $E (X_{t}) = μ_{ε} (1 + \sum_{i = 1}^{q} β_{i})$
(ii): $V a r (X_{t}) = σ_{ε}^{2} + \sum_{i = 1}^{q} [μ_{ε} β_{i} + μ_{ε}^{2} θ^{2} β_{i} (1 - β_{i}) - μ_{ε} (θ^{2} β_{i} - θ^{2} β_{i}^{2} + β_{i}^{2}) + β_{i}^{2} σ_{ε}^{2}]$
(iii): $c o v (X_{t}, X_{t - k}) = \{\begin{matrix} σ_{ε}^{2} \sum_{i = 0}^{q - k} β_{i} β_{i + k} & k = 1, \dots, q \\ 0 & k \geq q + 1 \end{matrix}$

Proof.

See Appendix A. ☐

Theorem 2.

{X_{t}}

is the process defined in Definition 1, then

{X_{t}}

is a covariance stationary process.

Proof.

It can be seen from the Theorem 1 that the unconditional mean and unconditional variance of

X_{t}

is a finite constant given the distribution of

ε_{t}

. Thus,

X_{t}

is a stationary process. ☐

Theorem 3.

{X_{t}}

is the process defined in Definition 1, then

{X_{t}}

is ergodic in mean and autocovariance function.

Proof.

See Appendix A. ☐

2.3. The Probability Generating Functions for DCINMA Model

Ristić et al. [16] derived the probability generating function of the

\sum_{i = 1}^{n} U_{i}

as follows:

\begin{matrix} Φ_{U} & = E [s^{(U_{1} + U_{2} + \dots + U_{n})}] \\ = (1 - β) {(1 - β (1 - θ) (1 - s))}^{n} + β {(1 - (β + θ - β θ) (1 - s))}^{n} \end{matrix}

The above equation implies that the term

\sum_{i = 1}^{n} U_{i}

has a distribution of:

U_{1} + U_{2} + \dots + U_{n} = \{\begin{matrix} B i n (n, β (1 - θ)) & w . p 1 - β \\ B i n (n, β + θ - β θ) & w . p β \end{matrix}

Then the probability generating function (PGF) of

{X_{t}}

is:

ϕ_{X_{n}} (s) = [(1 - β) \cdot ϕ_{ε} (1 - β (1 - θ) (1 - s)) + β \cdot ϕ_{ε} (1 - (β + θ - β θ) (1 - s))] \cdot ϕ_{ε} (s)

ϕ_{ε} (s)

is the probability generating function of the

ε_{t}

. Given the distribution of

ε_{t}

, the explicit expression of the probability generating function can be derived. Suppose Poisson distribution of

ε_{t}

, then

ϕ_{X_{n}} (s) = [(1 - β) \cdot e^{- λ β (1 - θ) (1 - s)} + β e^{- λ (β + θ - β θ) (1 - s)}] \cdot e^{(λ (s - 1))}

The probability generating function is defined by the probabilities. The uniqueness of a power series expansion implies that the probability generating function in turn defines probabilities.

Therefore, we can derive the probability of

{X_{t}}

. For

X_{t} = j

, the probability mass function of

X_{t}

is:

P (X_{t} = j) = {[\frac{1}{j!} \frac{d^{j} ϕ (s)}{d s^{j}}]}_{s = 0}

The bivariate probability generating function of

{X_{t}}

is

Φ_{X_{t}, X_{t - 1}} (s_{1}, s_{2})

. Thus, deriving the explicit expression of bivariate probability generating function with Poisson innovation is easy for DCINMA(1) process.

\begin{matrix} E (s_{1}^{X_{t}} s_{2}^{X_{t - 1}}) & = E (s_{1}^{β \circ ε_{t - 1} + ε_{t}} \cdot s_{2}^{β \circ ε_{t - 2} + ε_{t - 1}}) \\ = E (s_{1}^{β \circ ε_{t - 1}} \cdot s_{1}^{ε_{t}} \cdot s_{2}^{β \circ ε_{t - 2}} \cdot s_{2}^{ε_{t - 1}}) \\ = E (s_{1}^{ε_{t}}) \cdot E (s_{1}^{β \circ ε_{t - 2}}) \cdot E (s_{1}^{β \circ ε_{t - 2}} \cdot s_{2}^{ε_{t - 1}}) \end{matrix}

Given the Poisson distribution of the innovation term, we can obtain:

E (s_{1}^{ε_{t}}) = e^{λ (s_{1} - 1)}, E (s_{2}^{β \circ ε_{t - 1}}) = (1 - β) \cdot e^{- λ β (1 - θ) (1 - s_{2})} + β \cdot e^{- λ (β + θ - β θ) (1 - s_{2})}

E (s_{1}^{β \circ ε_{t - 1}} \cdot s_{2}^{ε_{t - 1}}) = (1 - β) \cdot e^{λ s_{2} [1 - β (1 - θ) (1 - s_{2})]} + β \cdot e^{λ s_{2} [(1 - β - θ - β θ) (1 - s_{1})]} .

2.4. Compare with the INMA(q) Model

The mean and the covariance of the q-th order integer-valued moving average model has the same expression, and the variance of the INMA(q) process is:

V_{i n m a} = σ_{ε}^{2} + \sum_{i = 1}^{q} [σ_{ε}^{2} β_{i} + μ_{ε} β_{i} (1 - β_{i})]

From Theorem 1, the variance of the DCINMA process presents a more complicated expression than the INMA(q) process due to the correlation between the counting series of parameter

θ

. For the Poisson innovation, the overdispersion index of the INMA and DCINMA model is as follows:

I_{i n m a} = 1, I_{d i n m a} = \frac{1 + β + β λ θ^{2} (1 - β)}{1 + β}

Since the value of

λ

,

β

and

θ

are all non-negative, the term

1 + β λ θ^{2} (1 - β) > 1

. When two models (INMA(1) and DCINMA(1)) share the same

λ

and

β

, the

θ

will determine whether there is dependence between counting series (

θ \neq 0

). Thus, if we want to test

θ = 0

, it is equivalent to evaluating whether the model is overdispersed. If the value of

θ

is 0, the model degenerates to INMA model.

2.5. Compare the Entropy with INMA(1) Model

Entropy is an important concept in physics, but it can also be applied to other disciplines, including cosmology and economics. Entropy is a measure of the randomness or disorder of a system. In our case, entropy can be seen as a dispersion measure for the model. Thus, we evaluate the model from the perspective of entropy. The definition of Shannon entropy as follows:

H (Y) = - \sum_{i = 1}^{n} p (y_{i}) \cdot l n p (y_{i})

where Y is a discrete random variable with probability mass function taking values on

y_{1}, \dots, y_{n}

. We denote the

ϕ_{d i n m a} (s)

and

ϕ_{i n m a} (s)

as probability generating functions of the DCINMA(1) and INMA(1) process, respectively. Suppose the same innovation term for the two models follows the Poisson distribution. We can rewrite the probability generating function of them as:

\begin{matrix} ϕ_{d i n m a} (s) & = (1 - β) \cdot e^{(s - 1) \cdot [λ β (1 - θ) + λ]} + β \cdot e^{(s - 1) \cdot [λ (β + θ - β θ + λ)]} \\ ϕ_{i n m a} (s) & = e^{(s - 1) \cdot (λ β + λ)} \end{matrix}

Thus, we can conclude the distribution of DCINMA(1) and INMA(1) process based on the definition of the probability generating function.

X_{t}^{i n m a (1)}

and

X_{t}^{d i n m a (1)}

denote the sample from INMA(1) model and DCINMA(1) model.

\begin{matrix} X_{t}^{i n m a (1)} & \sim P o i (λ β + λ) \\ X_{t}^{d i n m a (1)} & \sim \{\begin{matrix} P o i (λ β (1 - θ) + λ) & w . p 1 - β \\ P o i (λ (β + θ - β θ) + λ) & w . p β \end{matrix} \end{matrix}

The Shannon entropy for both models can be derived as follows:

\begin{matrix} H (X_{t}^{i n m a (1)}) & = (λ β + λ) [1 - l o g ((λ β + λ))] + e^{(λ β + λ)} \cdot \sum_{k = 0}^{\infty} \frac{{(λ β + λ)}^{k} l o g (k!)}{k!} \\ H (X_{t}^{d i n m a (1)}) & = (1 - β) \cdot {(λ β (1 - θ) + λ) [1 - l o g ((λ β (1 - θ) + λ))] \\ + e^{(λ β (1 - θ) + λ)} \cdot \sum_{k = 0}^{\infty} \frac{{(λ β (1 - θ) + λ)}^{k} l o g (k!)}{k!}} \\ + β \cdot {(λ (β + θ - β θ) + λ) [1 - l o g ((λ (β + θ - β θ) + λ))] \\ + e^{(λ (β + θ - β θ) + λ)} \cdot \sum_{k = 0}^{\infty} \frac{{(λ (β + θ - β θ) + λ)}^{k} l o g (k!)}{k!}} \end{matrix}

The expression of entropy for the DCINMA(1) model is more complicated than the INMA(1) model due to the additional parameter

θ

.

3. Parameter Estimation

The estimation of the INMA model is complicated. Brännäs and Hall [27] discussed the Yule–Walker estimator, generalized moment method (GMM) based on the probability generating function (PGF) function, and the conditional least square method. Here, we did not attempt to use maximum likelihood estimation, which requires density functions that are generally not easily obtained in the INMA model, especially for this dependent situation. The results of generalized moments method-based probability generating function (PGF) function estimator are highly correlated with the values of

z_{1}

and

z_{2}

in

Φ_{k} (z_{1}, z_{2})

, which are not stable. On the other hand, for the conditional least square method, the number of estimation equations are less than the number of parameters. This means that there is an additional parameter

θ

to be estimated.

Therefore, we derive the Yule–Walker estimator to obtain the unknown parameters for the Poisson DCINMA(q) model. We denote the following symbols:

μ_{X}

is the sample mean of

{X_{t}}

,

μ_{ε}

is the mean of innovation term

ε_{t}

.

γ_{0}

is the sample variance of

{X_{t}}

,

γ_{1}, γ_{2}, \dots, γ_{q}

are the sample covariance of 1-th, 2-th,…, q-th order. The

β_{0}

for the equations below is 1.

Then, the unknown parameters in the DCINMA(q) model can be solved by equations through the sample moments function, which is as follows:

\begin{matrix} \{\begin{matrix} μ_{X} = μ_{ε} (1 + β_{1} + β_{2} + \dots + β_{q}) \\ γ_{0} = σ_{ε}^{2} + \sum_{i = 1}^{q} [μ_{ε} β_{i} + μ_{ε}^{2} θ^{2} β_{i} (1 - β_{i}) - μ_{ε} (θ^{2} β_{i} - θ^{2} β_{i}^{2} + β_{i}^{2}) + β_{i}^{2} σ_{ε}^{2}] \\ γ_{1} = λ (β_{0} β_{1} + β_{1} β_{2} + \dots + β_{q - 1} β_{q}) \\ γ_{2} = λ (β_{0} β_{2} + β_{1} β_{3} + \dots + β_{q - 2} β_{q}) \\ ⋮ \\ γ_{q - 1} = λ (β_{0} β_{q - 1} + β_{1} β_{q}) \\ γ_{q} = λ (β_{0} β_{q}) \end{matrix} \end{matrix}

4. Simulation Study

4.1. Estimation of the Model Parameters

In this section, we present some simulation results to show the performance of the estimator using different sample sizes. (

n = 100, 300, 700, 1000

). We focused on the Poisson DCINMA(1) model. The three group parameter values considered in this model are as follows:

\begin{matrix} Model A : (λ = 1, θ = 0.6, β = 0.2) \\ Model B : (λ = 4, θ = 0.7, β = 0.1) \\ Model C : (λ = 5, θ = 0.5, β = 0.1) \end{matrix}

We used the above parameter groups to generate the data and applied Yule–Walker method, then computed the bias and standard error based on 10,000 replications for each parameter group. The estimation results and their performance are reported in Table 1.

As shown in Table 1, we obtained nearly convergent estimators in all cases. In all cases, as the sample size increased, the bias decreased.

4.2. Testing for Dependence between Counting Series

From Section 2.5, when the two processes have the same

λ

and

β

, the DCINMA model always presents overdispersion due to

θ

. We tested whether the value of

θ

is 0, which is equivalent to assessing whether the DCINMA process presents overdispersion. Aleksandrov and Weiß [28] proposed the diagnostic test for the INMA process. In this section, only the overdispersion test was applied. Under the null hypothesis

H_{0} : θ = 0

(the process does not present overdispersion), the distribution of index dispersion has the following form:

{\hat{I}}_{d i s p} \overset{d}{⟶} N (1 - \frac{1}{T} \frac{1 + 3 β}{1 + β}, \frac{1}{T} (2 + 4 {(\frac{β}{1 + β})}^{2}))

We then analyzed the simulation results to assess the performance of the overdispersion test. The nominal level

α = 0.05

was employed, for sample sizes,

n = 100, 150, 250

. We did 1000 replications to calculate the size and power of the test. The following parameter groups were considered:

\begin{matrix} Model D : (λ = 3, θ = 0.4, β = 0.5) \\ Model E : (λ = 5, θ = 0.4, β = 0.3) \\ Model F : (λ = 6, θ = 0.3, β = 0.4) \\ Model G : (λ = 7, θ = 0.7, β = 0.8) \end{matrix}

From Table 2, given the same values for

λ

and

β

, under the null hypothesis,

θ = 0

, the size of

{\hat{I}}_{d i s p}

are close to nominal level. Under the alternative situation,

θ \neq 0

, for all cases, as n increase the power quickly increases to 1.

5. Real Data Example

We then applied the proposed model to a real dataset. Unlike the integer-valued autoregressive model, since the density function of this DCINMA process is hard to obtain, the Akaike information criterion (AIC) is difficult to use in measuring the fitness of such an INMA-type model. To assess the performance of the model, we adapted the parametric resampling method by Jung et al. [29].

We used crime data available from the Forecasting Principles site. The data consist of 144 observations of monthly larceny counts for the City of Pittsburgh from January of 1990 to December of 2001. It would be highly improbable for criminals to remain in the same place for a long time. They probably would flee in various directions to commit offenses, so that the INMA-type model is appropriate in this case.

The calculated of sample mean

(4.73)

and variance

(6.40)

suggest that the dataset is overdispersed. In addition, the sample autocorrelation function (ACF) in Figure 1 shows that the order of the model should be set to one. We fitted the data set into the DCINMA(1) model and INMA(1) model to evaluate the model performance for overdispersed data. The estimation results for the two models are presented in Table 3.

The parametric resampling method can be performed in several steps:

Step 1: Generate samples with length equal to the original dataset from the fitted model for R times.

Step 2: Compute the autocorrelation function (ACF) for each sample series. Then derive the empirical sample autocorrelation function (ACF) for the different lag orders.

Step 3: Using the results from Step 2, compute the

100 (1 - α / 2)

and

100 (α / 2)

quantiles.

For the given example, R was set to 5000, and

α

was set to

0.05

. We plotted the acceptance envelope for DCINMA(1) model and INMA(1) model, and the results are shown in Figure 2 and Figure 3. The sample autocorrelations for the real dataset lie within the acceptance envelopes of the DCINMA(1) model except for only one point, while the important second order of the autocorrelation function (ACF) exceed the upper bound of the acceptance envelopes of the INMA(1) model. It is clear that the acceptance envelopes of the DCINMA(1) model performs better than the acceptance envelopes of the INMA(1) model. Thus, the proposed model is suitable for this dataset, adequately representing the autocorrelation for real dataset.

6. Conclusions

In this paper, we constructed a new integer-valued moving average model with dependent counting series. The statistical properties of the proposed model were discussed and evaluated. The parameter estimation of the proposed model is based on the Yule–Walker method. The new model presents overdispersion due to the dependence parameter

θ

, which means the dependence between counting series can be verified by an overdispersion test. Numerical simulation results were used to evaluate the performance of the estimation and overdispersion test.

Future extensions for this study are as follows. First, we only focused on the stationary DCINMA(1) model in this paper. However, cases with non-stationary series are more common. The switched system is an important model in studying hybrid systems, particularly from the perspective of control science and engineering. Refer to ([30,31,32]) for more detailed discussion. The mechanism conducts the transformation between different subsystems, providing an approach to generalize the DCINMA process into a non-stationary case. A function can be introduced to control the change for different parameter values of the stationary case, which characterize non-stationary in series. Second, the parameter

θ

in our model provides a probability for overdispersion. In a switched system, the switching signal is a piecewise constant function, depending on time, external signal, output, and its own past value. Thus, the weighted switching signal function with weights

θ

and

(1 - θ)

can be considered, where the weight

θ

gives the probability for switching to different subsystems.

Author Contributions

Conceptualization, K.Y. and H.W.; methodology, K.Y.; software, H.W.; validation, K.Y. and H.W.; formal analysis, K.Y.; investigation, H.W.; resources, K.Y.; data curation, K.Y.; writing—original draft preparation, H.W.; writing—review and editing, K.Y.; visualization, H.W.; supervision, K.Y.; project administration, K.Y.; funding acquisition, K.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data used to support the findings of this study are included within the article.

Acknowledgments

This research was supported by the National Social Science Fund of China (No. 18BTJ039) and Fundamental Research Funds for the Central Universities (No. JBK2102021). In addition, the authors will be grateful for the editor and reviewers’ comments and suggestions, which led to improvements of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Theorem 1.

we give the detailed derivation process for 1-th order, the q-th order can be derived in the same manner. term

(i)

and

(i i)

of the Theorem 1 are easy to verify, so we focus on the term

(i i i)

.

\begin{matrix} V a r (X_{t}) & = V a r (β \circ ε_{t - 1}) + V a r (ε_{t}) \\ = V [E [β \circ ε_{t - 1} | ε_{t - 1}]] + E [V [β \circ ε_{t - 1} | ε_{t - 1}]] + V a r (ε_{t}) \end{matrix}

Then, we focus on the term

V [β \circ ε_{t - 1} | ε_{t - 1}]

.

\begin{matrix} V [β \circ ε_{t - 1} | ε_{t - 1}] & = E [{(\sum_{i = 1}^{ε_{t - 1}} U_{i})}^{2} | ε_{t - 1}] - {[E [\sum_{i = 1}^{ε_{t - 1}} U_{i} | ε_{t - 1}]]}^{2} \\ = ε_{t - 1} \cdot E [U_{1}^{2}] + (ε_{t - 1} - 1) \cdot ε_{t - 1} \cdot E [U_{1} \cdot U_{2}] - {[ε_{t - 1} \cdot β]}^{2} \\ = ε_{t - 1} \cdot (β \cdot (1 - β) + β^{2}) + ε_{t - 1} \cdot (ε_{t - 1} - 1) \cdot (θ^{2} \cdot β \cdot (1 - β) + β^{2}) \\ - β^{2} \cdot ε_{t - 1}^{2} \end{matrix}

\begin{matrix} E [V [β \circ ε_{t - 1} | ε_{t - 1}]] & = (β \cdot (1 - β) + β^{2}) \cdot E [ε_{t - 1}] - E [β^{2} \cdot ε_{t - 1}^{2}] \\ + E [ε_{t - 1} \cdot (ε_{t - 1} - 1)] \cdot (θ^{2} \cdot β \cdot (1 - β) + β^{2}) \\ = λ \cdot (β \cdot (1 - β) - β^{2}) - β^{2} \cdot (λ + λ^{2}) \\ + (λ + λ^{2} - λ) \cdot (θ^{2} \cdot β \cdot (1 - β) + β^{2}) \\ = λ^{2} \cdot β \cdot (1 - β) \cdot θ^{2} + β \cdot λ - β^{2} \cdot λ \end{matrix}

\begin{matrix} V a r (X_{t}) & = V [β \cdot ε_{t - 1}] λ^{2} \cdot β \cdot (1 - β) \cdot θ^{2} + β \cdot λ - β^{2} \cdot λ + λ \\ = β \cdot λ + λ^{2} \cdot β \cdot (1 - β) \cdot θ^{2} + λ \end{matrix}

☐

Proof of Theorem 3.

let

Y_{t} = X_{t} - μ_{X}

,

{X_{t}}

is the process defined in Section 2. To prove the

Y_{t}

is ergodic in autocovariance function, it is sufficient to show that:

\begin{matrix} l i m_{T \to \infty} \frac{1}{T} \sum_{l = 0}^{T} {E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) - R^{2} (v)} = 0 \end{matrix}

It is obvious

E (Y_{t}^{i}) < \infty

. For

i = 1, 2, 3, 4

, let

E (Y_{t}^{2}) = c_{2}, E (Y_{t}^{4}) = c_{4}

.

The case $v = 0$ :

\begin{matrix} E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) = E (Y_{t}^{2} Y_{t + l}^{2}) \end{matrix}

If

l = 1

,

\begin{matrix} E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) = E (Y_{t}^{2} Y_{t + l}^{2}) \leq (E (| Y_{t}^{2} |^{2} | Y_{t + l}^{2} |^{2} {))}^{1 / 2} = c_{4} \end{matrix}

If

l > 1

,

Y_{t}

and

Y_{t + l}

are irrelevant, then

\begin{matrix} E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) = E (Y_{t}^{2} Y_{t + l}^{2}) = c_{2}^{2} = γ_{Y}^{2} (0) \\ \frac{1}{T} \sum_{l = 0}^{T} {E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) - R^{2} (v)} \leq \frac{1}{T} {c_{4} - γ_{Y}^{2} (0)} \to 0 \end{matrix}

The case $v \geq 1$ :

If

l \leq v + 1

:

\begin{matrix} E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) \leq [E (| Y_{t} Y_{t + v} |^{2} | Y_{t + l} Y_{t + l + v} |^{2}]^{\frac{1}{2}} \leq [E (| Y_{t} |^{4} | Y_{t + v} |^{4} | Y_{t + l} |^{4} | Y_{t + l + v} {|^{4}]}^{\frac{1}{4}} = c_{4} \end{matrix}

If

l > v + 1

,

Y_{t} Y_{t + l}

and

Y_{t + l} Y_{t + l + v}

are irrelevant, then

\begin{matrix} E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) = E (Y_{t} Y_{t + v}) E (Y_{t + l} Y_{t + l + v}) = R^{2} (v) \\ \frac{1}{T} \sum_{l = 0}^{T} {E (Y_{t} Y_{t + v} Y_{t + l} Y_{t + v + l}) - R_{v}^{2}} \leq \frac{1}{T} {(c_{4} - R^{2} (v)) (v + 1)} \to 0 \end{matrix}

We can obtain:

\begin{matrix} l i m_{T \to \infty} E [{({\hat{R}}_{n}^{2} (v) - R^{2} (v))}^{2}] = 0 \end{matrix}

Which implies:

\begin{matrix} {\hat{R}}^{2} (v) \overset{p}{\to} R^{2} (v) \end{matrix}

The above equation is holding for variable

Y_{t}

:

Y_{t} = X_{t} - μ_{X}, {\hat{R}}_{Y}^{2} (v) \overset{p}{\to} R_{Y}^{2} (v)

We need to prove that

\begin{matrix} P (| {\hat{R}}_{X}^{2} (v) - R_{X}^{2} (v) | \geq ϵ) \to 0 \end{matrix}

Because of

R_{X}^{2} (v) = R_{Y}^{2} (v)

, rewriting the above equation as

\begin{matrix} P (| {\hat{R}}_{X}^{2} (v) - R_{X}^{2} (v) | \geq ϵ) & = P (| {\hat{R}}_{X}^{2} (v) - {\hat{R}}_{Y}^{2} (v) + {\hat{R}}_{Y}^{2} (v) - R_{X}^{2} (v) | \geq ϵ) \\ \leq P (| {\hat{R}}_{X}^{2} (v) - {\hat{R}}_{Y}^{2} (v) | \geq ϵ / 2) + P (| {\hat{R}}_{Y}^{2} (v) - R_{Y}^{2} (v) | \geq ϵ / 2) \end{matrix}

{\hat{R}}_{Y}^{2} (v)

and

{\hat{R}}_{X}^{2} (v)

expressing as

\begin{matrix} {\hat{R}}_{X}^{2} (v) = \frac{1}{T} \sum_{t = 1}^{T - k} (X_{t + k} - \bar{X}) (X_{t} - \bar{X}), {\hat{R}}_{Y}^{2} (v) = \frac{1}{T} \sum_{t = 1}^{T - k} Y_{t + k} Y_{t} = \frac{1}{T} \sum_{t = 1}^{T - k} (X_{t + k} - μ_{X}) (X_{t} - μ_{X}) \end{matrix}

Then the following expression can be derived

\begin{matrix} P (| {\hat{R}}_{X}^{2} (v) - {\hat{R}}_{Y}^{2} (v) | \geq ϵ / 2) & = P (({\bar{X}}^{2} - μ_{X}^{2}) + (μ_{X} - \bar{X}) ({\bar{X}}_{t + k} + {\bar{X}}_{T}) \geq ϵ / 2) \\ \leq P ({\bar{X}}^{2} - μ_{X}^{2} \geq ε / 4) + P ((μ_{X} - \bar{X}) ({\bar{X}}_{t + k} + {\bar{X}}_{T}) \\ \geq ε / 4) \end{matrix}

Since

\bar{X} \overset{p}{\to} μ_{X}

, by Slutsky’s theorem

\begin{matrix} {\bar{X}}^{2} - μ_{X}^{2} \overset{p}{\to} 0, (μ_{X} - \bar{X}) ({\bar{X}}_{t + k} + {\bar{X}}_{T}) \overset{p}{\to} 0 \end{matrix}

Consequently,

\begin{matrix} P (| {\hat{R}}_{X}^{2} (v) - R_{X}^{2} (v) | \geq ϵ) \to 0, T \to \infty \end{matrix}

☐

References

Alzaid, A.A.; Al-Osh, M.A. An integer-valued pth-order autoregressive structure (INAR(p)) process. J. Appl. Probab. 1990, 27, 314–324. [Google Scholar] [CrossRef]
Du, J.; Li, Y. The integer-valued autoregressive (INAR(p)) model. J. Time Ser. Anal. 1991, 12, 129–142. [Google Scholar]
Al-Osh, M.A.; Alzaid, A.A. Integer-valued moving average (INMA) process. Stat. Pap. 1988, 29, 281–300. [Google Scholar] [CrossRef]
McKenzie, E. Some ARMA models for dependent sequences of Poisson count. Adv. Appl. Probab. 1988, 20, 822–835. [Google Scholar] [CrossRef]
Weiß, C.H. The combined INAR(p) models for time series of counts. Stat. Probab. Lett. 2008, 78, 1817–1822. [Google Scholar] [CrossRef]
Monteiro, M.; Scotto, M.G.; Pereira, I. Integer-valued autoregressive processes with periodic structure. J. Stat. Plan. Infer. 2010, 140, 1529–1541. [Google Scholar] [CrossRef] [Green Version]
Weiß, C.H. A Poisson INAR(1) model with serially dependent innovations. Metrika 2015, 78, 829–851. [Google Scholar] [CrossRef]
Zhu, F. Modeling time series of counts with com-poisson INGARCH models. Math. Comput. Model. 2012, 56, 191–203. [Google Scholar] [CrossRef]
Möller, T.; Weiß, C.H. Threshold models for integer-valued time series with infinite or finite range. In Stochastic Models, Statistics and Their Applications; Steland, A., Rafajłowicz, E., Szajowski, K., Eds.; Springer: Geneva, Switzerland, 2015; pp. 327–334. [Google Scholar]
Yao, K.; Wang, D. A new INAR(1) process with bounded support for counts showing equidispersion, underdispersion and overdispersion. Stat. Pap. 2019, 62, 745–767. [Google Scholar]
Li, C.; Wang, D. First-order mixed integer-valued autoregressive processes with zero-inflated generalized power series innovations. J. Korean Stat. Soc. 2015, 44, 232–246. [Google Scholar] [CrossRef]
Kim, H.Y.; Park, Y. A non-stationary integer-valued autoregressive model. Stat. Pap. 2008, 49, 485. [Google Scholar] [CrossRef]
Zheng, H.; Basawa, I.V. Inference for pth-order random coefficient integer-valued autoregressive processes. J. Time Ser. Anal. 2006, 27, 411–440. [Google Scholar] [CrossRef]
Ristić, M.M.; Bakouch, H.S. A new geometric first-order integer-valued autoregressive (NGINAR(1)) process. J. Stat. Plan. Infer. 2009, 139, 2218–2226. [Google Scholar] [CrossRef]
Zhang, H.; Wang, D.; Zhu, F. Inference for INAR(p) processes with signed generalized power series thinning operator. J. Stat. Plan. Infer. 2010, 140, 667–683. [Google Scholar] [CrossRef]
Ristić, M.M.; Nastić, A.S.; Miletić Ilić, A.V. A geometric time series model with dependent Bernoulli counting series. J. Time Ser. Anal. 2013, 34, 466–476. [Google Scholar] [CrossRef]
Miletić Ilić, A.V.; Ristić, M.M.; Nastić, A.S.; Bakouch, H.S. An INAR(1) model based on a mixed dependent and independent counting series. J. Stat. Comput. Simul. 2018, 88, 290–304. [Google Scholar] [CrossRef]
Weiß, C.H. Thinning operations for modeling time series of counts—A survey. Adv. Appl. Probab. 2008, 92, 319. [Google Scholar] [CrossRef]
Winters, P.R. Forecasting sales by exponentially weighted moving averages. Manag. Sci. 1960, 6, 231–362. [Google Scholar] [CrossRef]
Cox, D.R. Prediction by exponentially weighted moving averages and related methods. J. R. Stat. Soc. Ser. B-Stat. Methodol. 1961, 23, 414–422. [Google Scholar] [CrossRef]
Landauskas, M.; Navickas, Z.; Vainoras, A.; Ragulskis, M. Weighted moving averaging revisited—An algebraic approach. Comput. Appl. Math. 2017, 36, 1545–1558. [Google Scholar] [CrossRef]
Nan, X.; Li, Q.; Qiu, D.; Zhao, Y.; Guo, X. Short-term wind speed syntheses correcting forecasting model and its application. Int. J. Electr. Power Energy Syst. 2013, 49, 264–268. [Google Scholar] [CrossRef]
Alevizakos, V.; Chatterjee, K.; Koukouvinos, C. The triple exponentially weighted moving average control chart. Qual. Technol. Quant. Manag. 2021, 18, 326–354. [Google Scholar] [CrossRef]
Capizzi, G.; Masarotto, G. An adaptive exponentially weighted moving average control chart. Technometrics 2003, 45, 199–207. [Google Scholar] [CrossRef] [Green Version]
Adegoke, N.A.; Abbasi, S.A.; Smith, A.N.H.; Anderson, M.J.; Pawley, M.D.M. A multivariate homogeneously weighted moving average control chart. IEEE Access 2019, 7, 9586–9597. [Google Scholar] [CrossRef]
Brännäs, K.; Quoreshi, A.M.M.S. Integer-valued moving average modelling of the number of transactions in stocks. Appl. Financ. Econ. 2010, 20, 1429–1440. [Google Scholar] [CrossRef]
Brännäs, K.; Hall, A. Estimation in integer-valued moving average models. Appl. Stoch. Models. Bus. Ind. 2001, 17, 277–291. [Google Scholar] [CrossRef]
Aleksandrov, B.; Weiß, C.H. Parameter estimation and diagnostic tests for INMA(1) processes. Test 2019, 29, 196–232. [Google Scholar] [CrossRef]
Jung, R.C.; McCabe, P.M.; Tremayne, A.R. Model validation and diagnostics. In Handbook of Discrete-Valued Time; Davis, R.A., Holan, S.H., Lund, R., Ravishanker, N., Eds.; Chapman & Hall/CRC Press: Boca Raton, FL, USA, 2015; pp. 189–218. [Google Scholar]
Liberzon, D.; Morse, A.S. Basic problems in stability and design of switched systems. IEEE Control Syst. Mag. 1999, 19, 59–70. [Google Scholar]
Blanchini, F.; Miani, S. Set-Theoretic Methods in Control, 2nd ed.; Birkhäuser: Basle, Switzerland, 2015. [Google Scholar]
Jin, C.; Wang, R.; Wang, Q. Stabilization of switched systems with time-dependent switching signal. J. Frankl. Inst.-Eng. Appl. Math. 2020, 357, 13552–13568. [Google Scholar] [CrossRef]

Figure 1. Time series plot (top figure) and autocorrelation function (below figure panel) for actual Larceny dataset.

Figure 2. Acceptance envelope of the DCINMA(1) model for the autocorrelation function for the larceny dataset.

Figure 3. Acceptance envelope of the INMA(1) model for the autocorrelation function for the larceny dataset.

Table 1. Some numerical results of the estimates for true values of the parameters

λ

,

θ

and

β

.

Table 1. Some numerical results of the estimates for true values of the parameters

λ

,

θ

and

β

.

Sample Size	$\hat{λ}$	$\hat{θ}$	$\hat{β}$
(a) True values: $λ$ = 1, $θ$ = 0.6, $β$ = 0.2
100	0.9875	0.5999	0.2641
Bias	0.0124	0.0001	−0.0641
Standard Error	0.1755	0.2065	0.1705
300	0.9858	0.6192	0.2627
Bias	0.0141	−0.0192	−0.0627
Standard Error	0.1082	0.2049	0.1085
700	0.9831	0.6489	0.2663
Bias	0.0168	−0.0489	−0.0663
Standard Error	0.0699	0.1922	0.0717
1000	0.9817	0.6718	0.2684
Bias	0.0182	−0.0718	−0.0684
Standard Error	0.0597	0.1829	0.0600
(b) True values: $λ$ = 4, $θ$ = 0.7, $β$ = 0.1
100	3.7816	0.6173	0.1690
Bias	0.2183	0.0826	−0.0690
Standard Error	0.5296	0.2031	0.1353
300	3.8872	0.6611	0.1394
Bias	0.1127	0.0388	−0.0394
Standard Error	0.3364	0.1767	0.0855
700	3.9123	0.7208	0.1324
Bias	0.0876	−0.0208	−0.0324
Standard Error	0.2337	0.1353	0.0599
1000	3.9144	0.7430	0.1322
Bias	0.0855	−0.0430	−0.0322
Standard Error	0.2012	0.1142	0.0514
(c) True values: $λ$ = 5, $θ$ = 0.5, $β$ = 0.1
100	4.7799	0.5709	0.1556
Bias	0.2200	−0.0709	−0.0556
Standard Error	0.5777	0.2059	0.1142
300	4.9129	0.5703	0.1279
Bias	0.0870	−0.0703	−0.0279
Standard Error	0.3715	0.1852	0.0730
700	4.9343	0.5688	0.1243
Bias	0.0656	−0.0688	−0.0243
Standard Error	0.2615	0.1529	0.0520
1000	4.9340	0.5738	0.1245
Bias	0.0659	−0.0738	−0.0245
Standard Error	0.2211	0.1330	0.0442

Table 2. Size and power of DCINMA(1) and INMA(1) model.

$λ$	$β$	$θ$	n
$λ$	$β$	$θ$	100	250	500
3	0.4	0	0.054	0.057	0.058
3	0.4	0.5	0.431	0.877	0.937
5	0.4	0	0.053	0.057	0.047
5	0.4	0.3	0.447	0.824	0.901
6	0.3	0	0.052	0.055	0.047
6	0.3	0.4	0.63	0.8	0.99
7	0.7	0	0.052	0.046	0.053
7	0.7	0.8	0.998	1	1

Table 3. Estimation results of DCINMA(1) and INMA(1) model.

	$λ$	$β$	$θ$
DCINMA(1)	3.21	0.47	0.80
INMA(1)	3.82	0.24	0

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, K.; Wang, H. A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series. Entropy 2021, 23, 706. https://doi.org/10.3390/e23060706

AMA Style

Yu K, Wang H. A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series. Entropy. 2021; 23(6):706. https://doi.org/10.3390/e23060706

Chicago/Turabian Style

Yu, Kaizhi, and Huiqiao Wang. 2021. "A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series" Entropy 23, no. 6: 706. https://doi.org/10.3390/e23060706

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Overdispersed Integer-Valued Moving Average Model with Dependent Counting Series

Abstract

1. Introduction

2. The Model and Basic Properties

2.1. The Model Construction

2.2. The Numerical Properties for DCINMA(q) Model

2.3. The Probability Generating Functions for DCINMA Model

2.4. Compare with the INMA(q) Model

2.5. Compare the Entropy with INMA(1) Model

3. Parameter Estimation

4. Simulation Study

4.1. Estimation of the Model Parameters

4.2. Testing for Dependence between Counting Series

5. Real Data Example

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI