A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection

Chen, Huaping

doi:10.3390/stats6010018

Open AccessArticle

A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection

by

Huaping Chen

School of Mathematics and Statistics, Henan University, Kaifeng 475004, China

Stats 2023, 6(1), 293-311; https://doi.org/10.3390/stats6010018

Submission received: 17 January 2023 / Revised: 29 January 2023 / Accepted: 8 February 2023 / Published: 9 February 2023

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In this paper, we develop a novel soft-clipping discrete beta GARCH (ScDBGARCH) model that provides an available method to model bounded time series with under-dispersion, equi-dispersion or over-dispersion. The new model not only allows positive dependence, but also negative dependence. The stochastic properties of the models are established, and these results are, in turn, used in the analysis of the asymptotic properties of the conditional maximum likelihood (CML) estimator of the new model. In addition, we apply the new model to measles infection to show its improved performance.

Keywords:

discrete beta distribution; bounded time series; ScDBGARCH model; negative dependence; stochastic property

1. Introduction

More and more authors have underlined the importance and the common occurrence of the bounded integer-valued time series over more than two decades. McKenzie [1] proposed the binomial AR (BAR) model based on the binomial thinning operator to analyze the bounded integer-valued time series. To monitor bounded data (which increase at a certain point and then slowly decrease to the initial level), Weiß and Testik [2] further discussed the BAR model by constructing positive additive outliers; see Möller et al. [3] for its some extensions of zero inflation and Chen et al. [4] for its two types of innovative outliers. Kang et al. [5] proposed an extended binomial AR(1) model based on the generalized binomial thinning operator, which relaxes the independence assumption of the binomial thinning operator. To analyze bounded data with under-dispersion, equi-dispersion and over-dispersion, Chen et al. [6] first constructed the Conway–Maxwell–Poisson–binomial thinning operator based on the Conway–Maxwell–Poisson–binomial distribution [7], and then proposed the Conway–Maxwell–Poisson–binomial AR model. To accurately and flexibly capture the correlation structure between two random coefficients in the BAR process, Zhang et al. [8] proposed a new version of the BAR model by using the Farlie–Gumbel–Morgenstern copula, which allows both positive and negative correlations.

In fact, the volatility (especially the heteroscedasticity) is a reality for many important processes and cannot be described by the above models. Here, we take the number of districts with new cases of measles infection per week in the year 2016–2017 reported in

n = 38

Germany’s districts as an example and present its path in Figure 1, which shows that there seems to be more variation at the median of the time series, where the level also appears to be higher.

For this purpose, Weiß and Pollett [9] considered a linear binomial ARCH(1) model, which is generalized to the pth-order case by Ristić et al. [10]. Lee and Lee [11] further discussed a version of the linear binomial ARCH(1) model with a feedback mechanism. Chen et al. [12] proposed two classes of dynamic binomial ARCH models to model time series with a finite range. Chen et al. [13] generalized the binomial ARCH model to the beta-binomial GARCH model, which allows both the conditional and marginal binomial indices of dispersion to be greater than one, i.e., data with extra-binomial variation can be more adequately captured than binomial GARCH-type models. See Liu et al. [14] for the bounded Poisson AR process and Liu et al. [15] for the novel category AR process.

However, the negative ACF cannot be achieved by the above models. To resolve this dilemma, Weiß and Jahn [16], inspired by the softplus INGARCH model [17], proposed the soft-clipping BGARCH model based on the soft-clipping function [18], i.e.,

\begin{matrix} S c_{c} (x) = c \log (\frac{1 + \exp (x / c)}{1 + \exp ((x - 1) / c)}), x \in (0, 1), c > 0 . \end{matrix}

(1)

To further investigate the soft-clipping function, we give some example of the plot of

S c_{c} (x)

in Figure 2, when c takes the value in

{0.02, 0.05, 0.1, 0.3, 0.5, 0.7, 1, 1.5}

,

\forall x \in (0, 1) .

From Figure 2,

S c_{c} (x)

tends to a linear function when

c \to 0

.

“Although the beta-binomial distribution is very flexible with respect to its shape, it is, to a large extent focused on dealing with data sets which appear, in some way, to arise from binomial distributions but which are in fact overdispersed”, which was discussed by Turner [19]. Hence, another concern arises because beta-binomial distribution focuses on over-dispersion such that under-dispersed pseudo-binomial data sets (which are rare but do exist) cannot be analyzed by the beta-binomial GARCH-type models. To fill this gap, we proposed a new soft-clipping discrete beta GARCH (ScDBGARCH) model based on a re-scaled discrete beta binomial distribution. What is remarkable about the ScDBGARCH model is that it not only can be fitted to under-dispersed data (besides over-dispersed bounded data), but also allows negative dependence (besides positive dependence).

It is worth mentioning that the realization of negative dependence for the ScDBGARCH model is mainly due to the incorporated soft-clipping function. Another main contribution of this paper is that we establish the stochastic order of the discrete beta binomial distribution, and then discuss the stability property of the new model. In addition, we discuss the CML estimators and establish their asymptotic normality. Last but not least, we illustrate the availability and superiority in analyzing the count of districts with new cases of measles infection per week in the period of the year 2016–2017 reported in

n = 38

of Germany’s districts.

The paper is organized as follows. Section 2 first gives a brief review of the discrete beta distribution, then gives the definition of the soft-clipping discrete beta GARCH model and its stability properties. Conditional maximum likelihood estimation and their asymptotic properties are established in Section 3. Section 4 provides real data to show the effectiveness of the new model. Conclusions are made in Section 5. Appendix A presents some auxiliary results.

2. Model Formulation and Stability Properties

2.1. Discrete Beta Distribution

For the readers’ convenience, we first give a brief review of the discrete beta distribution, which is introduced by Turner [19].

A random variable X taking values in

{n_{bot}, n_{bot} + 1, n_{bot} + 2, \dots, n_{top}}

is said to follow a discrete beta distribution with parameters

(α, β)

if its probability mass function of X takes the form

\begin{matrix} P (X = x | α, β, n) = \frac{1}{Z (α, β)} f (\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2}), \forall x = n_{bot}, n_{bot} + 1, \dots, n_{top}, \end{matrix}

(2)

where

f (x) = \frac{1}{B (α, β)} x^{α - 1} {(1 - x)}^{β - 1}, B (α, β) = \frac{Γ (α) Γ (β)}{Γ (α + β)}, Z (α, β) = \sum_{x = n_{bot}}^{n_{top}} f (\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2}),

where

n_{top} \in N

is the predetermined upper limit of the range and

n_{bot} = 0

or 1 is the predetermined lower limit of the range. For simplicity, we denote

X \sim {DB}^{1} (n_{bot}, n_{top}, α, β)

.

Furthermore, the probability mass function (given in (2)) of X can be rewritten as the exponential family form, i.e.,

\begin{matrix} P (X = x | α, β) = h (x) \exp (α T_{1} (x) + β T_{2} (x) - A (α, β)), \end{matrix}

(3)

where

A (α, β) = \log (\sum_{x = n_{bot}}^{n_{top}} h (x) \exp (α T_{1} (x) + β T_{2} (x)))

,

T_{1} (x) = \log (\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})

,

T_{2} (x) = \log (\frac{n_{top} - x + 1}{n_{top} - n_{bot} + 2}),

h (x) = \frac{{(n_{top} - n_{bot} + 2)}^{2}}{(x - n_{bot} + 1) (n_{top} - x + 1)}

.

In fact,

f (\cdot)

involving in (2) is the probability density function of the beta distribution with parameters

α

and

β

. By Lemma A2 in Appendix A, one can obtain the mean, variance and BID of the

{DB}^{1} (n_{bot}, n_{top}, α, β)

, if

n_{bot} = 0

and

n_{top} \to \infty

. Similarly, the moments of

{DB}^{1} (1, n_{top}, α, β)

can be obtained if

n_{top} \to \infty

. It is worth mentioning that

μ_{b} = \frac{α}{α + β}

and

σ_{b}^{2} = \frac{α β}{{(α + β)}^{2} (1 + α + β)}

(given in Lemma A2) are precisely the mean and variance of the beta distribution with parameters

α

and

β

. Hence, we consider a reparameterization of the discrete beta distribution given in (2) by setting

p = α / (α + β)

and

τ = α + β

. For simplicity, we rewrite

{DB}^{1} (n_{bot}, n_{top}, α, β)

as

{DB}^{2} (n_{bot}, n_{top}, p, τ)

.

Unfortunately, the specific range of BID for the DB

^{2} (n_{bot}, n_{top}, p, ϕ)

distribution cannot be obtained, except the case for

n_{top} \to + \infty

. To solve this dilemma, we give an example of the BID in Figure 3 with

n : = n_{top} \in (2, 4, 6, 8)

and

n_{bot} = 0

, when p and

ϕ

are varying from 0.1 to 0.9 with increment 0.1 and 0.1 to 8.1 with increment 0.1, respectively. See Figure 4 for

n_{bot} = 1

.

From Figure 3, we have the following observations. First, for given

τ

, when

p \to 1

, the BID is decreasing but greater than 1, except for that of

n_{top} = 2

(the BID is less than 1, if

p \to

1). Second, when p takes a small value, the BID takes the maximum if

τ

takes the boundary value. Third, for the given p and

τ

, the BID tends to a greater value when

n_{top}

is increasing.

From Figure 4, we have the following observations. First, for

n_{top} = 2

, the BID seems to be increasing but lower than 1 when

p \to 1

,

\forall τ

. Second, for

n_{top} > 2

, if

τ

takes the non-boundary value, the BID seems to be increasing and then decreasing when

p \to 1

; otherwise, the BID seems to be decreasing. Third, for the given p and

τ

, the BID tends to a greater value when

n_{top}

is increasing.

To sum up, the discrete beta distribution allows to model bounded data with under-dispersion, equi-dispersion and over-dispersion.

Similar to the statistical-order property of the one-parameter exponential family in [20], Proposition 1 illustrates that it does hold for the

{DB}^{1}

distribution.

Proposition 1.

Suppose

X_{i} \sim {DB}^{1} (n_{bot}, n_{top}, α_{i}, β_{i}), \forall i = 1, 2

. If

α_{1} \leq α_{2}

and

β_{1} \geq β_{2}

, then the following conclusions hold and are equivalent:

(1): $X_{1} \leq_{l r} X_{2},$
(2): $p_{1} \leq p_{2}$ ,

where

p_{i} = α_{i} / (α_{i} + β_{i})

,

\forall i = 1, 2 .

Proof.

(1) It is easy to see that

X_{i}

exhibits the following probability density function

f_{X_{i}} (x) = \frac{1}{Z (α_{i}, β_{i})} \frac{1}{B (α_{i}, β_{i})} {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{i} - 1} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{i} - 1},

where

Z (α_{i}, β_{i}) = \sum_{x = 0}^{n} \frac{1}{B (α_{i}, β_{i})} {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{i} - 1} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{i} - 1}

. Hence,

\begin{matrix} l (x) : = \frac{f_{X_{1}} (x)}{f_{X_{2}} (x)} & = \frac{Z (α_{2}, β_{2}) B (α_{2}, β_{2})}{Z (α_{1}, β_{1}) B (α_{1}, β_{1})} {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{1} - α_{2}} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{1} - β_{2}} \\ \propto {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{1} - α_{2}} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{1} - β_{2}} \end{matrix}

and

\begin{matrix} l^{'} (x) \propto (α_{1} - α_{2}) {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{1} - α_{2} - 1} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{1} - β_{2}} \\ - (β_{1} - β_{2}) {(\frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{α_{1} - α_{2}} {(1 - \frac{x - n_{bot} + 1}{n_{top} - n_{bot} + 2})}^{β_{1} - β_{2} - 1} \leq 0 \end{matrix}

with equality only if

α_{1} = α_{2}

and

β_{1} = β_{2}

. Hence,

X_{1} \leq_{l r} X_{2} .

Furthermore,

p_{1} \leq p_{2}

by Theorem 4.2 in Wang [21].

(2) Note that if

α_{1} \leq α_{2}

and

β_{1} \geq β_{2}

,

\begin{matrix} \frac{1}{p_{1}} - \frac{1}{p_{2}} = \frac{α_{1} + β_{1}}{α_{1}} - \frac{α_{2} + β_{2}}{α_{2}} = \frac{β_{1}}{α_{1}} - \frac{β_{2}}{α_{2}} \geq 0 . \end{matrix}

Hence,

p_{1} \leq p_{2}

, if

α_{1} \leq α_{2}

and

β_{1} \geq β_{2}

, and vice versa. Therefore,

X_{1} \leq_{l r} X_{2} .

The proof is end. □

2.2. Discrete Beta GARCH(1,1) Model with a Nearly Linear Structure

Inspired by Weiß and Jahn [16] and

{DB}^{2}

distribution, we give the definition of the ScDBGARCH(1,1) model by

\begin{matrix} \{\begin{matrix} Z_{t} | F_{t - 1} \sim {DB}^{2} (n_{bot}, n_{top}, p_{t}, τ), \\ p_{t} = S c_{c} (w + α_{1} p_{t - 1} + β_{1} Z_{t - 1} / n_{top}), \end{matrix} \end{matrix}

(4)

where

F_{t}

is the

σ

-field generated by

{Z_{t}, p_{t}, t \in Z}

,

τ > 0

,

| α_{1} | < 1

,

| β_{1} | < 1

and

| α_{1} | + | β_{1} | < 1

,

S c_{c} (x) = c \log ((1 + \exp (x / c)) / (1 + \exp ((x - 1) / c))), \forall x \in (0, 1),

c > 0

,

n_{bot} = 0

or 1 and

n_{top} \in N

is the predetermined upper limit of the range.

By (3) and (4), the conditional probability mass function of

{Z_{t}}

takes the form

\begin{matrix} P (Z_{t} = z_{t} | F_{t - 1}) = h (z_{t}) \exp (p_{t} τ T_{1} (z_{t}) + (1 - τ) p_{t} T_{2} (z_{t}) - A (p_{t} τ, (1 - τ) p_{t})), \end{matrix}

(5)

where

h (z_{t}) = \frac{{(n_{top} - n_{bot} + 2)}^{2}}{(z_{t} - n_{bot} + 1) (n_{top} - z_{t} + 1)}

,

T_{1} (z_{t}) = \log (\frac{z_{t} - n_{bot} + 1}{n_{top} - n_{bot} + 2}),

T_{2} (z_{t}) = \log (\frac{n_{top} - z_{t} + 1}{n_{top} - n_{bot} + 2}),

A (p_{t} τ, (1 - τ) p_{t}) = \log (\sum_{i = n_{bot}}^{n_{top}} h (i) \exp (p_{t} τ T_{1} (i) + (1 - τ) p_{t} T_{2} (i)))

.

Note that Proposition 1 presents that the new discrete beta distribution exhibits a statistical-order property, which is similar to the one-parameter exponential family in Davis and Liu [20]. Hence, a natural idea of the stability of the ScDBGARCH model is using the theory of the iterated random function approach [22] to construct the stability properties of the ScDBGARCH model. For this purpose, we first illustrate the stochastic order of the coupling process

{Z_{t}, λ_{t}, t \in Z}

given in (4), and then account for the moment property of

| Z_{i} - Z_{j} |

(

\forall i \neq j

), which is essential to derive the stability of the proposed model.

Proposition 2.

If

{Z_{t}, p_{t}, t \in Z}

satisfies (4), then

Z_{1} \leq_{l r} Z_{2}

, if

p_{1} \leq_{l r} p_{2}

, where “lr” denotes the likelihood ratio.

The result of Proposition 2 can be obtained by Proposition 1,

i = 1, 2 .

We omit it.

Proposition 3.

For all

i = 1, 2

, if

Z_{i} \sim {DB}^{2} (n_{bot}, n_{top}, p_{t}, τ)

and

F_{λ_{i}}

is the cumulative distribution function of

{DB}^{2} (n_{bot}, n_{top}, p_{t}, τ)

with

μ_{i} = \sum_{z = n_{bot}}^{n_{top}} z P (Z_{i} = z)

and

F_{λ_{i}}^{- 1} (u) : = \inf {t \geq 0, F_{μ_{i}} (t) \geq u}

, then

E | Z_{1} - Z_{2} | = | λ_{1} - λ_{2} |

, where u is a uniform random variable in

(0, 1)

and

Z_{i} = F_{λ_{i}}^{- 1} (u)

.

Proof.

Denote

λ_{i} = \sum_{x = n_{bot}}^{n_{top}} x P (X_{i} = x)

with

X_{i} \sim {DB}^{1} (n_{bot}, n_{top}, α_{i}, β_{i})

. Similar to the first item of Proposition 1,

λ_{1} \leq_{l r} λ_{2},

if

α_{1} \leq α_{2}

,

β_{1} \geq β_{2}

. Therefore,

Z_{1} \leq_{l r} Z_{2}

by Proposition 2, i.e.,

Z_{1} \leq_{s t} Z_{2}

and

F_{λ_{1}}^{- 1} (t) \leq F_{λ_{2}}^{- 1} (t), \forall t \in (0, 1)

. Hence

E | Z_{1} - Z_{2} | = E (Z_{2} - Z_{1}) = λ_{2} - λ_{1} = | λ_{1} - λ_{2} |

. Similarly,

E | Z_{1} - Z_{2} | = E (Z_{1} - Z_{2}) = | λ_{1} - λ_{2} |

, if

λ_{1} \geq λ_{2}

. Thus,

E | Z_{1} - Z_{2} | = | λ_{1} - λ_{2} |

. The proof is complete. □

In the following, we demonstrate that

S c_{c} (\cdot)

satisfies the contraction condition by using Lemma A1 in Appendix A, i.e.,

\forall z_{1}, z_{2} \geq 0 (z_{1} \neq z_{2})

,

p_{1}, p_{2} \geq 0 (p_{1} \neq p_{2})

, there exist

α

and

β

such that

\begin{matrix} \{\begin{matrix} | S c_{c} (w + α_{1} p_{1} + β_{1} \frac{z_{1}}{n_{top}}) - S c_{c} (w + α_{1} p_{2} + β_{1} \frac{z_{2}}{n_{top}}) | < | α | | p_{1} - p_{2} | + | β | | z_{1} - z_{2} |, \\ S c_{c} (w + α_{1} p_{1} + β_{1} z_{1}) \leq α p_{1} + β z_{1} + S c_{c} (w), \\ | S c_{c} (w + α_{1} p_{1} + β_{1} z_{1} / n_{top}) - S c_{c} (w + α_{1} p_{2} + β_{1} z_{1} / n_{top}) | \leq | α | | p_{1} - p_{2} | . \end{matrix} \end{matrix}

(6)

where

| α_{1} | < 1

and

| β_{1} | < 1

.

Assumption 1.

The parametric space

Θ = {θ = {(w, α_{1}, β_{1}, ϕ)}^{⊤}}

is compact with

w \in R

,

0 < ϕ < 1

,

| α_{1} | < 1

,

| β_{1} | < 1

and

| α_{1} | + | β_{1} | < 1

.

Theorem 1.

Let

{Z_{t}, t \in Z}

satisfy (4). If Assumption 1 and the contraction condition (6) hold, then the following results hold:

(1): If π is a stationary distribution and $p_{0} \sim π$ is independent of $p_{0}^{^{'}} \sim π$ , then ${p_{t}, t \in Z}$ is geometric-moment contracting with unique stationary distribution π and $E_{π} p_{1} < \infty$ .
(2): There exists a measurable function $G_{\infty} : D^{\infty} = {(n_{1}, n_{2}, \dots), n_{i} \in D} ⟶ D$ such that $p_{t} \overset{a . s .}{=}$ $G_{\infty} (Z_{t - 1}, Z_{t - 2}, \dots)$ , i.e., $p_{t}$ is $F_{t - 1}$ -measurable, where $D = [0, n]$ .
(3): If ${p_{t}}$ starts from π, i.e., $p_{0} \sim π$ , then ${Z_{t}}$ is a stationary time series. Furthermore, ${Z_{t}, p_{t}}$ is strictly stationary and ergodic.

By Propositions 3 and (6), Theorem 1 can be proved in a similar way in Davis and Liu [20] and Chen et al. [13], and we omit it here.

It is worth mentioning that the incorporated soft-clipping function results in negative auto-regression, besides the positive auto-regression and over-dispersion. Unfortunately, because of the complexity of the discrete beta distribution, we have the closed forms of the auto-regressive coefficient. To get an idea about the abilities of the ScDBGARCH(1,1) model with

c = 0.01

for explaining different autocorrelation structures, we present some ACF(2)-ACF(1) plots for the ScDBGARCH(1,1) model in Figure 5 and Figure 6. To be precise, for given

n_{top} = 10

and

n_{bot} = 0

or 1, sample size

T = 200

and

τ = 1

, we let

β_{1} = 0.05

and

w = 0.5 (1 - | α_{1} | - | β_{1} |)

with

α_{1}

, varying from

- 0.9

to

0.9

with an increment of 0.1, and we compute the values of ACF(1), ACF(2) and plot them against each other.

From Figure 5 and Figure 6, both negative ACF and non-negative ACF are allowed by the novel ScDBGARCH model, while negative ACF is rejected by the binomial GARCH-type models [10,12], i.e., the novel ScDBGARCH model is much more flexible than the classical binomial GARCH models with respect to the auto-regressive structure.

To be honest, the merit of the model ScDBGARCH goes beyond allowing negative auto-regression, and also allowing under-dispersion. To account for the dispersion, we present the plots of the BID (in Figure 7 and Figure 8) for the ScDBGARCH(1,1) model, for given

n_{top} = 10

or 2 and

n_{bot} = 0

or 1, sample size

T = 200

and

τ = 1

when

α_{1}

is varying from

- 0.9

to

0.9

with an increment 0.1,

β_{1} = 0.05

and

w = 0.5 (1 - | α_{1} | - | β_{1} |)

.

From Figure 7 and Figure 8, under-dispersion (besides over-dispersion) is allowed, especially for the ScDBGARCH model with a smaller

n_{top}

. Hence, the ScDBGARCH model provides an available way to analyze bounded integer-valued time series counts.

Remark 1.

Similar to the BGARCH(1,1) model [11] and the BBGARCH(1,1) model [13], we can define the following two models:

Soft-clipping beta-binomial GARCH(1,1) model with

$\begin{matrix} Z_{t} | F_{t - 1} \sim BB (n, p_{t}, ϕ), p_{t} = S c_{c} (w + α_{1} p_{t - 1} + β_{1} Z_{t - 1} / n), \end{matrix}$

(7)

where $w \in R$ , $0 < ϕ < 1$ , $| α_{1} | < 1$ , $| β_{1} | < 1$ and $| α_{1} | + | β_{1} | < 1$ .
Obviously, this model, given in (7), is an example of the BBGARCH(1,1) model in [13]. For convenience, we recall it as the ScBBGARCH(1,1) model.
Soft-clipping binomial GARCH(1,1) model [16] with

$\begin{matrix} Z_{t} | F_{t - 1} \sim Bin (n, p_{t}), p_{t} = S c_{c} (w + α_{1} p_{t - 1} + β_{1} Z_{t - 1} / n), \end{matrix}$

(8)

where $w \in R$ , $0 < ϕ < 1$ , $| α_{1} | < 1$ , $| β_{1} | < 1$ and $| α_{1} | + | β_{1} | < 1$ .
Obviously, this model, given in (8), can be regarded as a further generation of the BARCH-type model; see [10,11,12]. For convenience, we recall it as the ScBGARCH(1,1) model.

3. Parameter Estimation

In this section, we use the conditional maximum likelihood method to estimate the parameters involved in the ScDBGARCH(1,1) model and study their asymptotic behavior. Let

θ = {(w, α, β, τ)}^{⊤}

. Denote

n_{top}

and

n_{bot}

as the upper and lower ranges, and

T \in N

represents the size of the sample.

{Z_{0}, Z_{1}, \dots, Z_{T}}

is a realization of

{Z_{t}}

, which can be obtained by the following steps: First, we let

p_{0} = S c_{c} (w)

and set a pre-run = 500, then generate

{Z_{0}, p_{1}, Z_{1}, \dots, p_{500}, Z_{500}}

, where

p_{t}

is obtained by (4) and

Z_{t}

is generated by using

rdb ()

function in the ddb package; see Turner [19] for more details. Second, we use

p_{500}

as a new initial value of

p_{t}

and rewrite it as

p_{0}

, then generate

{Z_{0}, Z_{1}, Z_{2}, \cdot \cdot \cdot, Z_{T}}

.

By (5), the conditional log-likelihood function of (4) can be written as

\begin{matrix} \log L (θ) = \sum_{t = 1}^{T} \log P (Z_{t} = z_{t} | F_{t - 1}) \\ = \sum_{t = 1}^{T} \log (h (z_{t})) + \sum_{t = 1}^{T} (η_{1} T_{1} (z_{t}) + η_{2} T_{2} (z_{t}) - A (η_{1}, η_{2})), \end{matrix}

(9)

where

h (z_{t}) = \frac{{(n_{top} - n_{bot} + 2)}^{2}}{(z_{t} - n_{bot} + 1) (n_{top} - z_{t} + 1)}

,

T_{1} (z_{t}) = \log (\frac{z_{t} - n_{bot} + 1}{n_{top} - n_{bot} + 2})

,

η_{1} = τ p_{t}

,

T_{2} (z_{t}) = \log (\frac{n_{top} - z_{t} + 1}{n_{top} - n_{bot} + 2})

,

A (η_{1}, η_{2}) = \log (\sum_{i = n_{bot}}^{n_{top}} h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)))

and

η_{2} = (1 - τ) p_{t}

. Then the CML estimator

{\hat{θ}}^{c m l}

is obtained by maximizing (9).

Note that

\sum_{t = 1}^{T} \log (h (z_{t}))

in (9) is a constant for a given sample. Hence, the conditional log-likelihood function given in (9) can be simplified and denoted as

\begin{matrix} ℓ (θ) = \sum_{t = 1}^{T} l_{t} (θ) = \sum_{t = 1}^{T} (η_{1} T_{1} (z_{t}) + η_{2} T_{2} (z_{t}) - A (η_{1}, η_{2})) \end{matrix}

(10)

and

{\hat{θ}}^{c m l}

can be obtained by maximizing (10), i.e.,

{\hat{θ}}^{c m l}

is a solution of the score equation

\begin{matrix} 0 = \sum_{t = 1}^{T} \frac{\partial l_{t} (θ)}{\partial θ} & = \sum_{t = 1}^{T} (T_{1} (z_{t}) \frac{\partial η_{1}}{\partial θ} - T_{2} (z_{t}) \frac{\partial η_{2}}{\partial θ} - (\frac{\partial A (η_{1}, η_{2})}{\partial η_{1}} \frac{\partial η_{1}}{\partial θ} + \frac{\partial A (η_{1}, η_{2})}{\partial η_{2}} \frac{\partial η_{2}}{\partial θ})) \\ = \sum_{t = 1}^{T} (T_{1} (z_{t}) - \frac{\partial A (η_{1}, η_{2})}{\partial η_{1}}) \frac{\partial η_{1}}{\partial θ} + \sum_{t = 1}^{T} (T_{2} (z_{t}) - \frac{\partial A (η_{1}, η_{2})}{\partial η_{2}}) \frac{\partial η_{2}}{\partial θ} \\ = \sum_{t = 1}^{T} (T_{1} (z_{t}) - A_{1}^{^{'}} (η_{1}, η_{2})) \frac{\partial η_{1}}{\partial θ} + \sum_{t = 1}^{T} (T_{2} (z_{t}) - A_{2}^{^{'}} (η_{1}, η_{2})) \frac{\partial η_{2}}{\partial θ}, \end{matrix}

(11)

where

η_{1} : = η_{1} (θ) = τ p_{t}

,

η_{2} : = η_{2} (θ) = p_{t} (1 - τ)

,

p_{t} = S_{c} (u_{t})

,

\begin{matrix} \partial A (η_{1}, η_{2}) / \partial η_{1} : = A_{1}^{^{'}} (η_{1}, η_{2}) = \sum_{i = n_{bot}}^{n_{top}} (h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)) T_{1} (i)) / B (η_{1}, η_{2}), \\ \partial A (η_{1}, η_{2}) / \partial η_{2} : = A_{2}^{^{'}} (η_{1}, η_{2}) = \sum_{i = n_{bot}}^{n_{top}} (h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)) T_{2} (i)) / B (η_{1}, η_{2}), \end{matrix}

\frac{\partial η_{1}}{\partial θ} = (\begin{matrix} τ S c_{c}^{^{'}} (u_{t}) \\ τ S c_{c}^{^{'}} (u_{t}) p_{t - 1} \\ τ S c_{c}^{^{'}} (u_{t}) z_{t - 1} / n_{top} \\ S c_{c} (u_{t}) \end{matrix}), \frac{\partial η_{2}}{\partial θ} = (\begin{matrix} (1 - τ) S c_{c}^{^{'}} (u_{t}) \\ (1 - τ) S c_{c}^{^{'}} (u_{t}) p_{t - 1} \\ (1 - τ) S c_{c}^{^{'}} (u_{t}) z_{t - 1} / n_{top} \\ - S c_{c} (u_{t}) \end{matrix})

with

A (η_{1}, η_{2}) = \log B (η_{1}, η_{2}), B (η_{1}, η_{2}) = \sum_{i = n_{bot}}^{n_{top}} h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)),

u_{t} = w + α_{1} p_{t - 1} + β_{1} z_{t - 1} / n_{top}

and

p_{t} = S c_{c} (u_{t})

.

Furthermore, the Hessian matrix (denoted as

H_{T} (θ)

) for model (4) is obtained by further differentiation of the score equation, i.e.,

H_{T} (θ) = - \sum_{t = 1}^{T} \frac{\partial^{2} l_{t}}{\partial θ \partial θ^{⊤}}

with

\frac{\partial^{2} l_{t}}{\partial θ \partial θ^{⊤}}

equaling to

\begin{matrix} (T_{1} - A_{1}^{'}) \frac{\partial^{2} η_{1}}{\partial θ \partial θ^{⊤}} + (T_{2} - A_{2}^{'}) \frac{\partial^{2} η_{2}}{\partial θ \partial θ^{⊤}} - A_{11}^{″} \frac{\partial η_{1}}{\partial θ} \frac{\partial η_{1}}{\partial θ^{⊤}} - A_{22}^{″} \frac{\partial η_{2}}{\partial θ} \frac{\partial η_{2}}{\partial θ^{⊤}} - (A_{12}^{″} + A_{21}^{″}) \frac{\partial η_{1}}{\partial θ} \frac{\partial η_{2}}{\partial θ^{⊤}}, \end{matrix}

where

A_{1}^{'} : = A_{1}^{'} (η_{1}, η_{2})

,

A_{2}^{'} : = A_{2}^{'} (η_{1}, η_{2})

,

A_{i j}^{″} : = A_{i j}^{″} (η_{1}, η_{2}) = \partial A_{i}^{'} (η_{1}, η_{2}) / \partial η_{j}

,

\forall i, j = 1, 2

and

\frac{\partial^{2} η_{1}}{\partial θ \partial θ^{⊤}} = (\begin{matrix} τ S c_{c}^{″} & τ S c_{c}^{″} p_{t - 1} & τ S c_{c}^{″} z_{t - 1} / n_{top} & S c_{c}^{'} \\ τ S c_{c}^{″} p_{t - 1} & τ S c_{c}^{″} p_{t - 1}^{2} & τ S c_{c}^{″} p_{t - 1} z_{t - 1} / n_{top} & p_{t - 1} S c_{c}^{'} \\ τ S c_{c}^{″} z_{t - 1} / n_{top} & τ S c_{c}^{″} p_{t - 1} z_{t - 1} / n_{top} & τ S c_{c}^{″} z_{t - 1}^{2} / n_{top}^{2} & S c_{c}^{'} z_{t - 1} / n_{top} \\ S c_{c}^{'} & p_{t - 1} S c_{c}^{'} & S c_{c}^{'} z_{t - 1} / n_{top} & 0 \end{matrix}),

\frac{\partial^{2} η_{2}}{\partial θ \partial θ^{⊤}} = (\begin{matrix} (1 - τ) S c_{c}^{″} & (1 - τ) S c_{c}^{″} p_{t - 1} & (1 - τ) S c_{c}^{″} z_{t - 1} / n_{top} & - S c_{c}^{'} \\ (1 - τ) S c_{c}^{″} p_{t - 1} & (1 - τ) S c_{c}^{″} p_{t - 1}^{2} & (1 - τ) S c_{c}^{ȃ} p_{t - 1} z_{t - 1} / n_{top} & - p_{t - 1} S c_{c}^{'} \\ (1 - τ) S c_{c}^{″} z_{t - 1} / n_{top} & (1 - τ) S c_{c}^{″} p_{t - 1} z_{t - 1} / n_{top} & (1 - τ) S c_{c}^{″} z_{t - 1}^{2} / n_{top}^{2} & - S c_{c}^{'} z_{t - 1} / n_{top} \\ - S c_{c}^{'} & - p_{t - 1} S c_{c}^{'} & - S c_{c}^{'} z_{t - 1} / n_{top} & 0 \end{matrix})

with

S c_{c}^{'} : = S c_{c}^{'} (u_{t})

and

S c_{c}^{″} : = S c_{c}^{″} (u_{t})

.

Lemma 1.

Denote

g (x, η_{1}, η_{2}) = T_{1} (x) η_{1} + T_{2} (x) η_{2} - A (η_{1}, η_{2})

. For all

x \in R

,

g (x, η_{1}, η_{2}) = g (x, η_{1}^{'}, η_{2}^{'})

if and only if

η_{1} = η_{1}^{'}

and

η_{2} = η_{2}^{'}

, where

h (x) = \frac{{(n_{top} - n_{bot} + 2)}^{2}}{(x - n_{bot} + 1) (n_{top} - x + 1)}

,

T_{1} (x) = \log (x - n_{bot} + 1) / (n_{top} - n_{bot} + 2))

,

T_{2} (x) = \log ((n_{top} - x + 1) / (n_{top} - n_{bot} + 2))

,

A (η_{1}, η_{2}) = \log (\sum_{i = n_{bot}}^{n_{top}} h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)))

,

n_{bot} = 0

or 1 and

n_{top}

is considered a known quantity.

Proof.

Note that

g (x, η_{1}, η_{2})

is continuously differentiable; hence,

\begin{matrix} \frac{\partial g (x, η_{1}, η_{2})}{\partial η_{1}} = T_{1} (x) - \frac{\sum_{i = 0}^{n} h (i) T_{1} (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))}{\sum_{i = 0}^{n} h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))}, \\ \frac{\partial g (x, η_{1}, η_{2})}{\partial η_{2}} = T_{2} (x) - \frac{\sum_{i = 0}^{n} h (i) T_{2} (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))}{\sum_{i = 0}^{n} h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))} . \end{matrix}

Because

(\sum h (i) T_{1} (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))) / (\sum h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i)))

is strictly increasing in terms of

η_{1}

or

η_{2}

, so does for

(\sum h (i) T_{2} (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))) / (\sum h (i) \exp (η_{1} T_{1} (i) + η_{2} T_{2} (i))) .

Hence,

\partial g (x, η_{1}, η_{2}) / \partial η_{1} = \partial g (x, η_{1}^{^{'}}, η_{2}^{^{'}}) / \partial η_{2}

if and only if

η_{1} = η_{1}^{^{'}}

and

η_{2} = η_{2}^{^{'}}

.

To sum up,

g (x, η_{1}, η_{2}) = g (x, η_{1}^{^{'}}, η_{2}^{^{'}})

if and only if

η_{1} = η_{1}^{^{'}}

and

η_{2} = η_{2}^{^{'}}

,

\forall x \in R

. □

Assumption 2.

If there exists a

t \geq 1

such that

Z_{t} (θ_{0}) = Z_{t} (θ)

,

P {(z | F_{t - 1})}_{θ_{0}}

a.s., then

θ = θ_{0},

where

P {(z | F_{t - 1})}_{θ_{0}} = P {(Z_{t} = z | F_{t - 1})}_{θ_{0}}

is the probability measure under the true parameter

θ_{0}

and

F_{t - 1} .

Assumption 2 establishes the identification of the ScDBGARCH(1,1) model based on Lemma 1.

Theorem 2.

Let

{Z_{t}, t \in Z}

be a stationary and ergodic sequence with a finite range and its conditional mean process

{λ_{t}}

satisfy (4), the contraction condition (6). If Assumptions 1 and 2 hold, then, as

T \to \infty

, we obtain the following results:

(1): There exists an estimator ${\hat{θ}}_{2}^{c m l}$ such that ${\hat{θ}}^{c m l} \overset{a . s .}{\to} θ$ ;
(2): $\sqrt{T} ({\hat{θ}}^{c m l} - θ) \overset{d}{\to} N (0, H^{- 1} (θ) I (θ) H^{- 1} (θ))$ ,

where

I (θ) : = E (\frac{\partial l_{t} (θ)}{\partial θ} \frac{\partial l_{t} (θ)}{\partial θ^{⊤}})

and

H (θ) : = - E (\frac{\partial^{2} l_{t} (θ)}{\partial θ \partial θ^{⊤}}) .

The proof of Theorem 2 is similar to Theorem 4 in Chen et al. [12]. We omit it.

4. Real Data Example

In this section, we reconsider the number of districts with new cases of measles infection per week in the year 2016–2017 reported in

n = 38

of Germany’s districts. The dataset is taken from the “SurvStat” data (https://survstat.rki.de/Content/Query/Main.aspx) (accessed on 10 December 2022), which have been reported to the Robert Koch Institute by local and state health departments. See Figure 1 for its sample path.

By communication, the sample mean and variances are 4.3173 and 8.3546, respectively. The ACF and PACF plots are given in Figure 9, respectively.

Besides the ScDBGARCH(1,1) model, the ScBBGARCH(1,1) model given in (7) and the ScBGARCH(1,1) model given in (8) with

c = 0.01

, we also choose the following compared models:

BARCH(p) model [10] with

$Z_{t} | F_{t - 1} \sim Bin (n, p_{t}), p_{t} = a_{0} + \sum_{k = 1}^{p} a_{k} Z_{t - k} / n, p = 1, 2;$
logit-BARCH(p) model [12] with

$Z_{t} | F_{t - 1} \sim Bin (n, p_{t}), logit (p_{t}) = α_{0} + \sum_{k = 1}^{p} logit (α_{k}) Z_{t - k}, p = 1, 2;$
score-BARCH(1) model [12] with

$Z_{t} | F_{t - 1} \sim Bin (n, p_{t}), logit (p_{t}) = α_{0} + α_{1} logit (p_{t - 1}) + α_{2} s_{t}, s_{t} = n p_{t} - Z_{t};$
BGARCH(1,1) model [11] with

$Z_{t} | F_{t - 1} \sim Bin (n, p_{t}), p_{t} = α_{0} + α_{1} p_{t - 1} + α_{2} Z_{t - 1} / n;$
logit-BBGARCH(1,1) model [13] with $Z_{t} | F_{t - 1} \sim BB (n, p_{t}, ϕ)$ and its mean process ${λ_{t}}$ satisfying $logit (λ_{t}) = w + α_{1} logit (λ_{t - 1}) + β_{1} Z_{t - 1}$ .

In the following, we use the above models to fit the measles infection’s data by the CML method and compare their estimated standard error (SE), −log-likelihood (−log-lik), AIC and BIC, where SE is computed by Theorem 2 and

\hat{ϕ} = 1 / (1 + \hat{τ})

. The CML estimates and approximated standard errors of parameters (including the fitted values of −log-lik, AIC and BIC) are summarized in Table 1.

From Table 1, we have the following observations. For the BARCH-type models, the BARCH(2) model with a linear transformation takes the smallest −log-lik, AIC and BIC. For the GARCH-type models, the ScDBGARCH(1,1) model takes the smallest −log-lik, AIC and BIC, followed by the ScBGARCH(1,1) model, which may be attributed to the merits of the soft-clipping function.For all compared models, the ScDBGARCH(1,1) model takes the smallest −log-lik, AIC and BIC. Hence, the ScDBGARCH(1,1) model is more suitable for the measles data.

To further check the adequacy of the ScDBGARCH(1,1) model, we analyze its Pearson residuals, which are defined by

e_{t} = (Z_{t} - {\hat{μ}}_{t}) / \sqrt{{\hat{σ}}_{t}^{2}}

with

{\hat{μ}}_{t} = \sum_{z = 0}^{n} Z_{t} P (Z_{t} = z | F_{t - 1})

and

{\hat{σ}}_{t}^{2} = \sum_{z = 0}^{n} {(z - {\hat{μ}}_{t})}^{2} P (Z_{t} = z | F_{t - 1}) .

As discussed in Weiß [23], “for an adequate model, its fitted standardized Pearson residuals are expected to be uncorrelated with a mean about 0 and a variance about 1”.

First, we calculate that the mean and variance of the Pearson residuals of the ScDBGARCH(1,1) model are

- 0.0059

and

1.0107

, which implies that the ScDBGARCH(1,1) model demonstrates adequacy. Second, we give its residual analysis in Figure 10, which also shows that this model does rather well.

Third, we consider the fitted values of the Ljung–Box test based on lags k = 3, 5, 7, 9, 11, 13, and 15, including their p-values and their critical values (

χ_{0.95}^{2} (k)

) with 0.05 confidence, and summarize them in Table 2.

Table 2 shows that all of the Ljung–Box statistics are less than the corresponding critical values, and the p-values are much greater than the significant level 0.05. Hence, both of them further illustrate the availability of the ScDBGARCH(1,1) model in analyzing the measles data. To sum up, the ScDBGARCH(1,1) model shows better performance in analyzing the measles data.

5. Concluding and Discussion

This paper considers a new and flexible soft-clipping discrete beta GARCH(1,1) model, which not only allows positive correlation, but also negative correlation, as well as under-dispersion, equi-dispersion and over-dispersion. We discuss some properties of the new model, the CML estimate of the parameters involved in the novel model, and the large-sample property of the CML estimate. The applicability and superior of the ScDBGARCH model are illustrated by a real data example.

Like linear binomial ARCH/GARCH-type models [10,11], logit binomial ARCH-type models [12] or beta-binomial GARCH-type models [13], the ScDBGARCH model is applicable to analyze stationary non-negative data with a finite range and will be invalid for data with some time trends. Two natural methods arise, and both of them deserve a detailed analysis in a future project.

One popular method is incorporated into the covariate processes when constructing a new model. Similar to the logit-BBGARCHX model [24] and the PARX model [25], one can establish a model with covariates, taking the ScDBGARCH(1,1) model as an example:

\begin{matrix} \{\begin{matrix} Z_{t} | F_{t - 1} \sim {DB}^{2} (n n_{top}, n_{bot}, p_{t}, τ), \\ p_{t} = S c_{c} (w + α_{1} p_{t - 1} + β_{1} Z_{t - 1} / n_{top} + f (X_{t - 1}, γ)), \end{matrix} \end{matrix}

where

n_{top} \in N

is a predetermined upper limit of the range,

n_{bot} = 0

or 1 is a predetermined lower limit of the range,

X_{t} = (X_{1 t}, X_{2 t}, \dots, X_{d t})

is a d-dimensional exogenous covariate vector,

F_{t}

is the

σ

-field generated by

{Z_{s}, λ_{s}, X_{s}, \forall s < t}

,

f (\cdot, γ) : R^{d} \to R

,

γ

is the additional parameter vector involved in

f (\cdot, \cdot)

, and

(w, α_{1}, β_{1}, τ)

is the parameter vector with

τ > 0

,

| α_{1} | < 1, | β_{1} | < 1

and

| α_{1} | + | β_{1} | < 1

. When discussing the statistical property, an essential and unavoidable point is the specific form of

f (\cdot, \cdot)

. See Chen and Khamthong [26] for Markov-switching cases.

Specially, if the considered data have a periodic trend, one can consider a s-periodically distributed sequence

{Z_{t}, t \in Z}

and its mean process

{λ_{t}}

satisfying (4), i.e., the s-periodicity of

{Z_{t}, t \in Z}

is understood in the sense that

Z_{t} \overset{d}{=} Z_{k s + τ}

for all

k, t \in Z, τ = 1, 2, \dots, s

, where

\overset{d}{=}

denotes equality in distribution. To highlight the periodicity, one can consider the model by letting

t = k s + τ

,

\forall k \in Z, \forall τ = 1, 2, \dots, s

, and

\begin{matrix} \{\begin{matrix} Z_{k s + τ} | F_{k s + τ - 1} \sim {DB}^{2} (n, p_{k s + τ}, ϕ_{k s + τ}), \\ λ_{k s + τ} : = (n + 2) p_{k s + τ} - 1 = S_{c} (w_{k s + τ} + α_{1, k s + τ} λ_{k s + τ - 1} + β_{1, k s + τ} Z_{k s + τ - 1}), \end{matrix} \end{matrix}

where

| α_{1, τ} | < 1

,

| β_{1, τ} | < 1

and

\prod_{τ = 1}^{s} | α_{1, τ} | + | β_{1, τ} | < 1

. See Aknouche et al. [27] for a general periodic mixed Poisson autoregression.

The other popular method is to remove the time trend by using the difference method, but having a negative value emerge (besides non-negative bounded data), i.e.,

Z

-valued bounded data emerge. As far as we know, existing GARCH-type models are constructed by random rounding operators (see Liu and Yuan [28]), some

Z

-valued discrete distributions (see Alomani et al. [29], Carallo et al. [30], Cui et al. [31]), difference of two independent non-negative INGARCH models (see Gonçalves and Mendes-Lopes [32]) and non-negative INGARCH models multiplying by some special

Z

-valued discrete random variables (see Xu and Zhu [33]). However, they focus on

Z

-valued data with infinite range and cannot apply to bounded data. Hence, a future project in term of the

Z

-valued bounded data deserves to be considered.

In addition, as discussed in Chen et al. [6], the Conway–Maxwell–Poisson–binomial AR model shows better performance in analyzing bounded time series counts with under-dispersion, equi-dispersion and over-dispersion. A class of the Conway–Maxwell–Poisson–binomial GARCH model deserves to be considered to analyze volatility for integer-valued time series with a finite range. Similar to Bulla et al. [34] and Chen et al. [35], a signed Conway–Maxwell–Poisson–binomial (SCMPB) thinning operator and a bivariate INAR model based on the SCMPB thinning operator also deserve to be considered to analyze bivariate dependent time series with finite ranges.

Funding

Chen’s work is funded by Natural Science Foundation of Henan Province (No. 222300420127) and Postdoctoral research in Henan Province (No. 202103051).

Data Availability Statement

The number of districts with new cases of measles infection per week in the year 2016–2017 reported in

n = 38

Germany’s districts is taken from the “SurvStat” (https://survstat.rki.de/Content/Query/Main.aspx) on 12 December 2019.

Acknowledgments

The author thanks the Editor-in-Chief and the anonymous referees for the valuable comments and suggestions that resulted in a substantial improvement of this paper. We acknowledge the constructive suggestions from Fukang Zhu of Jilin University on the work.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$\| x \|$	absolute of x, $x \in R$ ;
lr	likelihood ratio;
$\leq_{s t}$	stochastic small;
$\overset{d}{=}$	equality in distribution.
$\overset{a . s .}{\to}$	almost surely convergence;
$\overset{d}{\to}$	convergence in distribution.

Appendix A. Auxiliary Results

Lemma A1.

Let

S c_{c} (x) = c \log \frac{1 + \exp (x / c)}{1 + \exp ((x - 1) / c)}, \forall x \in R, c > 0 .

Then,

(1): $S c_{c}^{^{'}} (x) = \frac{\exp (x / c)}{1 + \exp (x / c)} - \frac{\exp ((x - 1) / c)}{1 + \exp ((x - 1) / c)}$ and $| S_{c}^{^{'}} (x) | \leq 1 / 2;$
(2): $\forall x_{1} \in R, x_{2} \in R$ and $x_{1} \neq x_{2}$ , $| S c_{c} (x_{2}) - S c_{c} (x_{1}) | \leq \frac{1}{2} | x_{2} - x_{1} |;$
(3): $S c_{c}^{^{″}} (x) = \frac{\exp (x / c)}{c {(1 + \exp (x / c))}^{2}} - \frac{\exp ((x - 1) / c)}{c {(1 + \exp ((x - 1) / c))}^{2}}$ and $| S c_{c}^{″} (x) | \leq \frac{1}{2 c}$ ;
(4): $S c_{c}^{‴} (x) = \frac{\exp (x / c) (\exp (x / c) - 1)}{c^{2} {(1 + \exp ((x - 1) / c))}^{3}} - \frac{\exp (x - 1 / c) (\exp ((x - 1) / c) - 1)}{c^{2} {(1 + \exp ((x - 1) / c))}^{3}}$ and $| S c_{c}^{‴} (x) | \leq \frac{1}{4 c^{2}}$ .

Proof.

(1) Because

S c_{c} (x)

is a continuously differentiable function in

R

,

S c_{c}^{'} (x)

exists and

S c_{c}^{'} (x) = \frac{\exp (x / c)}{1 + \exp (x / c)} - \frac{\exp ((x - 1) / c)}{1 + \exp ((x - 1) / c)}

and

| S c_{c}^{'} (x) | \leq |\frac{\exp (x / c)}{1 + \exp (x / c)}| + |\frac{\exp ((x - 1) / c)}{1 + \exp ((x - 1) / c)}| \leq \frac{1}{4} + \frac{1}{4} = \frac{1}{2}

by Lemma 4 in [12].

(2) By using the mean value theorem, there exists at least one point

δ \in (x_{1}, x_{2}), \forall x_{1} \neq x_{2},

such that

S c_{c} (x_{2}) - S c_{c} (x_{1}) = S c_{c}^{^{'}} (δ) (x_{2} - x_{1}),

where

S c_{c}^{^{'}} (δ) = \frac{\exp (ξ / c)}{1 + \exp (δ / c)} - \frac{\exp ((δ - 1) / c)}{1 + \exp ((δ - 1) / c)}

. Hence,

| S c_{c}^{^{'}} (δ) | \leq 1 / 2

and

| S c_{c} (x_{2}) - S_{c} (x_{1}) | \leq \frac{1}{2} | x_{2} - x_{1} |, \forall x_{1} \neq x_{2} .

(3) According to item (1),

S c_{c}^{^{'}} (x)

is a continuously differentiable function in

R

, thus

S c_{c}^{^{″}} (x)

exists and

S c_{c}^{^{″}} (x) = \frac{\exp (x / c)}{c {(1 + \exp (x / c))}^{2}} - \frac{\exp ((x - 1) / c)}{c {(1 + \exp ((x - 1) / c))}^{2}}

. Furthermore,

\begin{matrix} | S c_{c}^{^{″}} (x) | \leq |\frac{\exp (x / c)}{c {(1 + \exp (x / c))}^{2}}| + |\frac{\exp ((x - 1) / c)}{c {(1 + \exp ((x - 1) / c))}^{2}}| \leq 1 / (4 c) + 1 / (4 c) = 1 / (2 c) \end{matrix}

by

{(a + b)}^{2} \geq 4 a b, \forall a \in R, \forall b \in R

.

(4) By (3),

S c_{c}^{^{″}} (x)

is a continuously differentiable function in

R

, and thus,

S c_{c}^{^{‴}} (x)

exists and

S c_{c}^{^{‴}} (x) = \frac{\exp (x / c) (\exp (x / c) - 1)}{c^{2} {(1 + \exp (x / c))}^{3}} - \frac{\exp ((x - 1) / c) (\exp ((x - 1) / c) - 1)}{c^{2} {(1 + \exp ((x - 1) / c))}^{3}} .

Furthermore, by using Lemma 4 in [12], we obtain

\begin{matrix} | S c_{c}^{^{‴}} (x) | \leq |\frac{\exp (x / c) (\exp (x / c) - 1)}{c^{2} {(1 + \exp (x / c))}^{3}}| + |\frac{\exp ((x - 1) / c) (\exp ((x - 1) c) - 1)}{c^{2} {(1 + \exp ((x - 1) / c))}^{3}}| \\ \leq \frac{1}{c^{2}} |\frac{\exp (2 x / c)}{{(1 + \exp (x / c))}^{3}}| + \frac{1}{c^{2}} |\frac{\exp (2 (x - 1) / c)}{{(1 + \exp ((x - 1) / c))}^{3}}| \\ \leq \frac{1}{4 c^{2}} . \end{matrix}

The proof is complete. □

Lemma A2.

Let

X \sim {DB}^{1} (n_{bot}, n_{top}, α, β)

with

n_{bot} = 0

and

n_{top} = n

. If

n \to + \infty

, then

(1).: $E (X) \approx (n + 2) \frac{α}{α + β} - 1 = (n + 2) μ_{b} - 1,$
(2).: $Var (X) \approx {(n + 2)}^{2} σ_{b}^{2},$
(3).: $BID = \frac{n Var (X)}{E X (n - E X)} \approx \frac{n ϕ {(n + 2)}^{2} p (1 - p)}{{(n + 2)}^{2} p (1 - p) - (n + 1)} \{\begin{matrix} > 1, if p (1 - p) > \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}}, \\ = 1, if p (1 - p) = \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}}, \\ < 1, if p (1 - p) < \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}}, \end{matrix}$ where $μ_{b} = p = α / (α + β)$ and $σ_{b}^{2} = ϕ μ_{b} (1 - μ_{b})$ with $ϕ = 1 / (1 + α + β)$ .

Proof.

By (2), we compute that

\begin{matrix} E (X) & = \frac{1}{Z (α, β)} \sum_{x = 0}^{n} x f (\frac{x + 1}{n + 2}) = \frac{{(n + 2)}^{2}}{Z (α, β)} \sum_{x = 0}^{n} (\frac{x + 1}{n + 2} - \frac{1}{n + 2}) f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} \\ = \frac{{(n + 2)}^{2}}{Z (α, β)} \sum_{x = 0}^{n} \frac{x + 1}{n + 2} f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} - \frac{{(n + 2)}^{2}}{Z (α, β)} \sum_{x = 0}^{n} \frac{1}{n + 2} f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} \\ \approx \frac{{(n + 2)}^{2}}{Z (α, β)} \int_{0}^{1} x f (x) d x - \frac{n + 2}{Z (α, β)} \int_{0}^{1} f (x) d x \\ \approx \frac{{(n + 2)}^{2}}{n + 2} \int_{0}^{1} x f (x) d x - \frac{n + 2}{n + 2} \int_{0}^{1} f (x) d x \\ \approx (n + 2) \frac{α}{α + β} - 1 = (n + 2) μ_{b} - 1, \\ E (X^{2}) & = \frac{1}{Z (α, β)} \sum_{x = 0}^{n} x^{2} f (\frac{x + 1}{n + 2}) \\ = \frac{{(n + 2)}^{3}}{Z (α, β)} \sum_{x = 0}^{n} (\frac{{(x + 1)}^{2}}{{(n + 2)}^{2}} - \frac{2 (x + 1)}{{(n + 2)}^{2}} + \frac{1}{{(n + 2)}^{2}}) f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} \\ = \frac{{(n + 2)}^{3}}{Z (α, β)} \sum_{x = 0}^{n} {(\frac{x + 1}{n + 2})}^{2} f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} - \frac{2 {(n + 2)}^{2}}{Z (α, β)} \sum_{x = 0}^{n} \frac{x + 1}{n + 2} f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} \\ + \frac{n + 2}{Z (α, β)} \sum_{x = 0}^{n} f (\frac{x + 1}{n + 2}) \frac{1}{n + 2} \\ = \frac{{(n + 2)}^{3}}{Z (α, β)} (\int_{0}^{1} x^{2} f (x) d x - {(\int_{0}^{1} x f (x) d x)}^{2}) + \frac{{(n + 2)}^{3}}{Z (α, β)} {(\int_{0}^{1} x f (x) d x)}^{2} \\ - \frac{2 {(n + 2)}^{2}}{Z (α, β)} \int_{0}^{1} x f (x) d x + \frac{n + 2}{Z (α, β)} \int_{0}^{1} f (x) d x \\ \approx {(n + 2)}^{2} \frac{α β}{{(α + β)}^{2} (1 + α + β)} + {(n + 2)}^{2} \frac{α^{2}}{{(α + β)}^{2}} - 2 (n + 2) \frac{α}{α + β} + 1 \\ = {(n + 2)}^{2} σ_{b}^{2} + {(n + 2)}^{2} μ_{b}^{2} - 2 (n + 2) μ_{b} + 1, \end{matrix}

where

σ_{b}^{2} = \frac{α β}{{(α + β)}^{2} (1 + α + β)}

and

μ_{b} = \frac{α}{α + β}

. Hence,

\begin{matrix} Var (X) & = E (X^{2}) - {(E X)}^{2} \approx {(n + 2)}^{2} σ_{b}^{2} + {(n + 2)}^{2} μ_{b}^{2} - 2 (n + 2) μ_{b} + 1 - {((n + 2) μ_{b} - 1)}^{2} \\ = {(n + 2)}^{2} σ_{b}^{2} . \end{matrix}

Hence, the binomial index of dispersion (BID) of X satisfies

\begin{matrix} BID = \frac{n Var (X)}{E X (n - E X)} \approx \frac{n ϕ {(n + 2)}^{2} p (1 - p)}{{(n + 2)}^{2} p (1 - p) - (n + 1)} \{\begin{matrix} > 1, if p (1 - p) > \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}}, \\ = 1, if p (1 - p) = \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}}, \\ < 1, if p (1 - p) < \frac{n + 1}{(1 - n ϕ) {(n + 2)}^{2}} . \end{matrix} \end{matrix}

The proof is complete. □

References

McKenzie, E. Some simple models for discrete variate time series. J. Am. Water Resour. Bull. 1985, 21, 645–650. [Google Scholar] [CrossRef]
Weiß, C.H.; Testik, M.C. On the Phase I analysis for monitoring time-dependent count processes. IIE Trans. 2015, 47, 294–306. [Google Scholar] [CrossRef]
Möller, T.A.; Weiß, C.H.; Kim, H.Y.; Sirchenko, A. Modeling zero inflation in count data time series with bounded support. Methodol. Comput. Appl. Probab. 2018, 20, 589–609. [Google Scholar] [CrossRef]
Chen, H.; Li, Q.; Zhu, F. Binomial AR(1) processes with innovational outliers. Commun. Stat. Theory Methods 2021, 50, 446–472. [Google Scholar] [CrossRef]
Kang, Y.; Wang, D.; Yang, K. Extended binomial AR(1) processes with generalized binomial thinning operator. Commun. Stat. Theory Methods 2020, 49, 3498–3520. [Google Scholar] [CrossRef]
Chen, H.; Zhang, J.; Liu, X. A Conway-Maxwell-Poisson-Binomial AR(1) model for bounded time series data. Entropy 2023, 25, 126. [Google Scholar] [CrossRef]
Shmueli, G.; Minka, T.P.; Kadane, J.B.; Borle, S.; Boatwright, P. A useful distribution for fitting discrete data: Revival of the Conway-Maxwell-Poisson distribution. Appl. Stat. 2005, 54, 127–142. [Google Scholar] [CrossRef]
Zhang, R.; Wang, D.; Li, C. Flexible binomial AR(1) processes using copulas. J. Stat. Plan. Inference 2022, 219, 306–332. [Google Scholar] [CrossRef]
Weiß, C.H.; Pollett, P.K. Binomial autoregressive processes with density-dependent thinning. J. Time Ser. Anal. 2014, 35, 115–132. [Google Scholar] [CrossRef]
Ristić, M.M.; Weiß, C.H.; Janjić, A.D. A binomial integer-valued ARCH model. Int. J. Biostat. 2016, 12, 20150051. [Google Scholar] [CrossRef]
Lee, Y.; Lee, S. CUSUM test for general nonlinear integer–valued GARCH models: Comparison study. Ann. Inst. Stat. Math. 2019, 71, 1033–1057. [Google Scholar] [CrossRef]
Chen, H.; Li, Q.; Zhu, F. Two classes of dynamic binomial integer-valued ARCH models. Braz. J. Probab. Stat. 2020, 34, 685–711. [Google Scholar] [CrossRef]
Chen, H.; Li, Q.; Zhu, F. A new class of integer-valued GARCH models for time series of bounded counts with extra-binomial variation. AStA Adv. Stat. Anal. 2022, 106, 243–270. [Google Scholar] [CrossRef]
Liu, M.; Zhu, F.; Zhu, K. Modeling normalcy-dominant ordinal time series: An application to air quality level. J. Time Ser. Anal. 2022, 43, 460–478. [Google Scholar] [CrossRef]
Liu, M.; Li, Q.; Zhu, F. Modeling air quality level with a flexible categorical autoregression. Stoch. Environ. Res. Risk Assess. 36, 2835–2845. [CrossRef]
Weiß, C.H.; Jahn, M. Soft-clipping INGARCH models for time series of bounded Counts. Stat. Model. 2022. forthcoming. [Google Scholar] [CrossRef]
Weiß, C.H.; Zhu, F.; Hoshiyar, A. Softplus INGARCH models. Stat. Sin. 2022, 32, 1099–1120. [Google Scholar] [CrossRef]
Klimek, M.D.; Perelstein, M. Neural network-based approach to phase space integration. SciPost Phys. 2020, 9, 053. [Google Scholar] [CrossRef]
Turner, R. A new versatile discrete distribution. R J. 2021, 13, 485–506. [Google Scholar] [CrossRef]
Davis, R.A.; Liu, H. Theory and inference for a class of observation-driven models with application to time series of counts. Stat. Sin. 2016, 26, 1673–1707. [Google Scholar]
Wang, Z. One mixed negative binomial distribution with application. J. Stat. Plan. Inference 2011, 141, 1153–1160. [Google Scholar] [CrossRef]
Wu, W.; Shao, X. Limit theorems for iterated random functions. J. Appl. Probab. 2004, 41, 425–436. [Google Scholar] [CrossRef]
Weiß, C.H. An Introduction to Discrete-Valued Time Series; John Wiley & Sons: Chichester, UK, 2018. [Google Scholar]
Chen, H.; Li, Q.; Zhu, F. A covariate-driven beta-binomial integer-valued GARCH model for bounded counts with an application. Metrika 2023. forthcoming. [Google Scholar] [CrossRef]
Agosto, A.; Cavaliere, G.; Kristensen, D.; Rahbek, A. Modeling corporate defaults: Poisson autoregressions with exogenous covariates (PARX). J. Empir. Financ. 2016, 38, 640–663. [Google Scholar] [CrossRef]
Chen, C.W.S.; Khamthong, K. Bayesian modelling of nonlinear negative binomial integer-valued GARCHX models. Stat. Model. 2020, 20, 537–561. [Google Scholar] [CrossRef]
Aknouche, A.; Bentarzi, W.; Demouche, N. On periodic ergodicity of a general periodic mixed Poisson autoregression. Stat. Probab. Lett. 2018, 134, 15–21. [Google Scholar] [CrossRef]
Liu, T.; Yuan, X. Random rounded integer-valued autoregressive conditional heteroskedastic process. Stat. Pap. 2013, 54, 645–683. [Google Scholar] [CrossRef]
Alomani, G.A.; Alzaid, A.A.; Omair, M.A. A Skellam INGARCH model. Braz. J. Probab. Stat. 2018, 32, 200–214. [Google Scholar] [CrossRef]
Carallo, G.; Casarin, R.; Robert, C.P. Generalized Poisson difference autoregressive processes. arXiv 2020, arXiv:2002.04470. [Google Scholar]
Cui, Y.; Li, Q.; Zhu, F. Flexible bivariate Poisson integer-valued GARCH model. Ann. Inst. Stat. Math. 2020, 72, 1449–1477. [Google Scholar] [CrossRef]
Gonçalves, E.; Mendes-Lopes, N. Signed compound Poisson integer-valued GARCH processes. Commun. Stat. Theory Methods 2020, 49, 5468–5492. [Google Scholar] [CrossRef]
Xu, Y.; Zhu, F. A new GJR-GARCH model for Z-valued time series. J. Time Ser. Anal. 2022, 43, 490–500. [Google Scholar] [CrossRef]
Bulla, J.; Chesneau, C.; Kachour, M. A bivariate first-order signed integer-valued autoregressive process. Commun. Stat. Theory Methods 2017, 46, 6590–6604. [Google Scholar] [CrossRef]
Chen, H.; Zhu, F.; Liu, X. Two-step conditional least squares estimation for the bivariate Z-valued INAR(1) model with bivariate Skellam innovations. Commun. Stat. Theory Methods 2023. forthcoming. [Google Scholar] [CrossRef]

Figure 1. Path of the measles infection counts.

Figure 2. Plots of the soft-clipping function.

Figure 3. Plots of the BID when

n_{bot} = 0

.

Figure 3. Plots of the BID when

n_{bot} = 0

.

Figure 4. Plots of the BID when

n_{bot} = 1

.

Figure 4. Plots of the BID when

n_{bot} = 1

.

Figure 5. Plots of attainable pairs of ACF(2) against ACF(1) for

n_{top} = 10

with c = 0.01.

Figure 5. Plots of attainable pairs of ACF(2) against ACF(1) for

n_{top} = 10

with c = 0.01.

Figure 6. Plots of attainable pairs of ACF(2) against ACF(1) for

n_{top} = 2

with c = 0.01.

Figure 6. Plots of attainable pairs of ACF(2) against ACF(1) for

n_{top} = 2

with c = 0.01.

Figure 7. Plots of BID for

n_{top} = 10

with

c = 0.01

.

Figure 7. Plots of BID for

n_{top} = 10

with

c = 0.01

.

Figure 8. Plots of BID for

n_{top} = 2

with

c = 0.01

.

Figure 8. Plots of BID for

n_{top} = 2

with

c = 0.01

.

Figure 9. Measles infection’s counts: (a) ACF, (b) PACF.

Figure 10. Pearson residual analysis: (a) ACF, (b) PACF.

Table 1. Estimates and SEs in parentheses for the measles infection counts.

Model	Estimates				−log-lik	AIC	BIC
BARCH(1)	${\hat{a}}_{0}$	${\hat{a}}_{1}$			212.6574	429.3148	434.6036
	0.0367	0.6844
	(0.0071)	(0.0651)
BARCH(2)	${\hat{a}}_{0}$	${\hat{a}}_{1}$	${\hat{a}}_{2}$		204.8729	415.7457	423.6789
	0.0270	0.4056	0.3669
	(0.0072)	(0.0991)	(0.1005)
logit-BARCH(1)	${\hat{a}}_{0}$	${\hat{a}}_{1}$			215.9542	415.7457	423.5789
	−2.8248	0.1608
	(0.1002)	(0.0161)
logit-BARCH(2)	${\hat{a}}_{0}$	${\hat{a}}_{1}$	${\hat{a}}_{2}$		207.5645	421.1290	429.0622
	−2.9473	0.1042	0.0827
	(0.1087)	(0.0221)	(0.0220)
score-BARCH(1)	${\hat{α}}_{0}$	${\hat{α}}_{1}$	${\hat{α}}_{2}$		213.1136	432.2272	440.1604
	−0.6178	0.6777	0.1192
	(0.1525)	(0.0751)	(0.0160)
BGARCH(1,1)	${\hat{a}}_{0}$	${\hat{a}}_{1}$	${\hat{a}}_{2}$		212.4199	430.8399	438.7730
	0.0332	0.0175	0.6923
	(0.0087)	(0.0263)	(0.0675)
ScBGARCH(1,1)	$\hat{w}$	${\hat{α}}_{1}$	${\hat{β}}_{1}$		204.9207	415.8414	423.7746
	0.2209	0.5123	0.4292
	(0.2235)	(0.1002)	(0.0836)
ScDBGARCH(1,1)	$\hat{w}$	${\hat{α}}_{1}$	${\hat{β}}_{1}$	$\hat{ϕ}$	203.5400	415.0799	425.575
	0.2130	0.4926	0.4506	0.0196
	(0.2297)	(0.0963)	(0.0832)	(0.0028)
ScBBGARCH(1,1)	$\hat{w}$	${\hat{α}}_{1}$	${\hat{β}}_{1}$	$\hat{ϕ}$	211.9121	431.8242	442.4017
	0.3188	0.4946	0.4401	0.0202
	(0.3267)	(0.1333)	(0.1116)	(0.0174)
logit-BBGARCH(1,1)	$\hat{w}$	${\hat{α}}_{1}$	${\hat{β}}_{1}$	$\hat{ϕ}$	208.9151	425.8302	436.4078
	−1.7203	0.4288	0.1137	0.0020
	(0.2657)	(0.0933)	(0.0176)	(0.0040)

Table 2. Values of the Ljung–Box test for the measles data.

Lag k	3	5	7	9	11	13	15
p-value	0.7736	0.9519	0.9642	0.9916	0.9950	0.9800	0.9934
$χ_{0.95}^{2} (k)$	7.8147	11.0705	14.0671	16.9190	19.6751	22.3620	24.9958
Ljung–Box statistic	1.1144	1.1245	1.9191	1.9895	2.6083	4.7636	4.8430

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, H. A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection. Stats 2023, 6, 293-311. https://doi.org/10.3390/stats6010018

AMA Style

Chen H. A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection. Stats. 2023; 6(1):293-311. https://doi.org/10.3390/stats6010018

Chicago/Turabian Style

Chen, Huaping. 2023. "A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection" Stats 6, no. 1: 293-311. https://doi.org/10.3390/stats6010018

Article Menu

A New Soft-Clipping Discrete Beta GARCH Model and Its Application on Measles Infection

Abstract

1. Introduction

2. Model Formulation and Stability Properties

2.1. Discrete Beta Distribution

2.2. Discrete Beta GARCH(1,1) Model with a Nearly Linear Structure

3. Parameter Estimation

4. Real Data Example

5. Concluding and Discussion

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Auxiliary Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI