A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution

Lee, Jeonghwa

doi:10.3390/fractalfract6100596

Open AccessArticle

A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution

by

Jeonghwa Lee

Department of Statistics, Truman State University, Kirksville, MO 63501, USA

Fractal Fract. 2022, 6(10), 596; https://doi.org/10.3390/fractalfract6100596

Submission received: 17 September 2022 / Revised: 9 October 2022 / Accepted: 11 October 2022 / Published: 14 October 2022

(This article belongs to the Special Issue Numerical Solution and Applications of Fractional Differential Equations)

Download Versions Notes

Abstract

:

We propose a discrete-time, finite-state stationary process that can possess long-range dependence. Among the interesting features of this process is that each state can have different long-term dependency, i.e., the indicator sequence can have a different Hurst index for different states. Furthermore, inter-arrival time for each state follows heavy tail distribution, with different states showing different tail behavior. A possible application of this process is to model over-dispersed multinomial distribution. In particular, we define a fractional multinomial distribution from our model.

Keywords:

long-range dependence; Hurst index; over-dispersed multinomial distribution

1. Introduction

Long-range dependence (LRD) refers to a phenomenon where correlation decays slowly with the time lag in a stationary process in a way that the correlation function is no longer summable. This phenomenon was first observed by Hurst [1,2] and since then it has been observed in many fields such as economics, hydrology, internet traffic, queueing networks, etc. [3,4,5,6]. In a second order stationary process, LRD can be measured by the Hurst index H [7,8],

H = inf {h : \underset{n \to \infty}{lim sup} n^{- 2 h + 1} \sum_{k = 1}^{n} c o v (X_{1}, X_{k}) < \infty} .

Note that

H \in (0, 1),

and if

H \in (1 / 2, 1),

the process possesses a long-memory property.

Among the well-known stochastic processes that are stationary and possess long-range dependence are fractional Gaussian noise (FGN) [9] and fractional autoregressive integrated moving average processes (FARIMA) [10,11].

Fractional Gaussian noise

X_{j}

is a mean-zero, stationary Gaussian process with covariance function:

γ (j) : = c o v (X_{0}, X_{j}) = \frac{v a r (X_{0})}{2} {(| j + 1 |}^{2 H} - {2 | j |}^{2 H} {+ | j - 1 |}^{2 H})

where

H \in (0, 1)

is the Hurst parameter. The covariance function obeys the power law with exponent

2 H - 2

for large lag,

γ (j) \sim v a r (X_{0}) H (2 H - 1) j^{2 H - 2} as j \to \infty .

If

H \in (1 / 2, 1),

then the covariance function decreases slowly with the power law, and

\sum_{j} γ (j) = \infty

, i.e., it has the long-memory property.

A FARIMA(p, d, q) process

{X_{t}}

is the solution of:

ϕ (B) ▿^{d} X_{t} = θ (B) ϵ_{t},

where

p, q

are positive integers, d is real, B is the backward shift,

B X_{t} = X_{t - 1},

and the fractional-differencing operator

▿^{d}

, autoregressive operator

ϕ

, and moving average operator

θ

are, respectively,

\begin{matrix} ▿^{d} & = {(1 - B)}^{d} = \sum_{k = 1}^{\infty} \frac{d (d - 1) \dots (d + 1 - k)}{k!} {(- B)}^{k}, \\ ϕ (B) & = 1 - ϕ_{1} B - ϕ_{2} B^{2} \dots - ϕ_{p} B^{p}, \\ θ (B) & = 1 - θ_{1} B - θ_{2} B^{2} \dots - θ_{q} B^{q} . \end{matrix}

where

{ϵ_{t}}

is the white-noise process, which consists of iid random variables with the finite second moment. Here, the parameter d manages the long-term dependence structure, and by its relation to the Hurst index,

H = d + 1 / 2,

d \in (0, 1 / 2)

corresponds to the long-range dependence in the FARIMA process.

Another class of stationary processes that can possess long-range dependence is from the countable-state Markov process [12]. In a stationary, positive recurrent, irreducible, aperiodic Markov chain, the indicator sequence of visits to a certain state is long-range dependent if and only if return time to the state has an infinite second moment, and this is possible only when the Markov chain has infinite state space. Moreover, if one state has the infinite second moment of return time, then all the other states also have the infinite second moment of return time, and all the states have the same rate of dependency; that is, the indicator sequence of each state is long-range dependence with the same Hurst index.

In this paper, we develop a discrete-time finite-state stationary process that can possess long-range dependence. We define a stationary process

{X_{i}, i \in N}

where the number of possible outcomes of

X_{i}

is finite,

S = {0, 1, \dots, m}

for any

m \in N,

and for

k = 1, 2, \dots, m,

c o v (I_{{X_{i} = k}}, I_{{X_{j} = k}}) = c_{k}^{'} {| i - j |}^{2 H_{k} - 2},

(1)

for any

i, j \in N, i \neq j,

and some constants

c_{k}^{'} \in R_{+}, H_{k} \in (0, 1) .

This leads to:

c o v (X_{i}, X_{j}) \sim c_{k^{'}}^{'} {| i - j |}^{2 H_{k^{'}} - 2} as | i - j | \to \infty,

(2)

where

k^{'} = a r g m a x_{k} {H_{k}; k = 1, \dots, m} .

If

H_{k^{'}} = max {H_{k}; k = 1, \dots, m} \in (1 / 2, 1)

, (1.2) implies that as

n \to \infty,

\sum_{i = 1}^{n} c o v (X_{1}, X_{i})

diverges with the rate of

{| n |}^{2 H_{k^{'}} - 1}

, and the process is said to have long-memory with Hurst parameter

H_{k^{'}}

. Furthermore, from (1.1), for

k = {1, \dots, m},

the process

{I_{{X_{i} = k}}; i = 1, 2, \dots}

is long-range dependence if

H_{k} \in (1 / 2, 1) .

In particular, if

H_{i} \neq H_{j},

then the states

“ i ”

and

“ j ”

produce different levels of dependence. For example, if

H_{i} < 1 / 2 < H_{j},

then the state

“ j ”

produces a long-memory counting process whereas state

“ i ”

produces a short-memory process.

A possible application of our stochastic process is to model the over-dispersed multinomial distribution. In the multinomial distribution, there are n trials, each trial results in one of the finite outcomes, and the outcomes of the trials are independent and identically distributed. When applying the multinomial model to real data, it is often observed that the variance is larger than what it is assumed to be, which is called over-dispersion, due to the violation of the assumption that trials are independent and have identical distribution [13,14], and there have been several ways to model an overdispersed multinomial distribution [15,16,17,18].

Our stochastic process provides a new method to model an over-dispersed multinomial distribution by introducing dependency among trials. In particular, the variance of the number of a certain outcomes among n trials is asymptotically proportional to the fractional exponent of

n,

from which we define:

Y_{k} : = \sum_{i = 1}^{n} I_{{X_{i} = k}} for k = 1, 2, \dots, m,

and call the distribution of

(Y_{1}, Y_{2}, \dots, Y_{m})

the fractional multinomial distribution.

The work in this paper is an extension of the earlier work of the generalized Bernoulli process [19], and the process in this paper is reduced to the generalized Bernoulli process if there are only two states in the possible outcomes of

X_{i}

, e.g.,

S = {0, 1}

.

In Section 2, a finite state stationary process that can possess long-range dependence is developed. In Section 3, the properties of our model are investigated with regard to tail behavior and moments of inter-arrival time of a certain state

“ k ”

, and conditional probability of observing a state

“ k ”

given the past observations in the process. In Section 4, the fractional multinomial distribution is defined, followed by the conclusions in Section 5. Some proofs of propositions and theorems are in Section 6.

Throughout this paper,

{i, i_{0}, i_{1}, \dots}, {i^{'}, i_{0}^{'}, i_{1}^{'}, \dots} \subset N,

with

i_{0} < i_{1} < i_{2} < \dots,

and

i_{0}^{'} < i_{1}^{'} < i_{2}^{'} < \dots .

For any set

A = {i_{0}, i_{1}, \dots, i_{n}},

| A | = n + 1,

the number of elements in the set

A,

and for the empty set, we define

| \emptyset | = 0 .

2. Finite-State Stationary Process with Long-Range Dependence

We define the stationary process

{X_{i}, i \in N}

where the set of possible outcomes of

X_{i}

is finite,

S = {0, 1, \dots, m},

for

m \in N

, with the probability that we observe a state

“ k ”

at time i is

P (X_{i} = k) = p_{k} > 0,

for

k = 0, 1, \dots, m,

and

\sum_{k = 0}^{m} p_{k} = 1 .

For any set

A = {i_{0}, i_{1}, \dots, i_{n}} \subset N

, define the operator:

\begin{matrix} L_{H, p, c}^{*} (A) : = p \prod_{j = 1, \dots, n} (p + c | i_{j} - i_{j - 1} |^{2 H - 2}) . \end{matrix}

If

A = \emptyset,

define

L_{H, p, c}^{*} (A) : = 1,

and if

A = {i_{0}}, L_{H, p, c}^{*} (A) : = p .

Let

H = (H_{1}, H_{2}, \dots, H_{m}), p = (p_{1}, p_{2}, \dots, p_{m}), c = (c_{1}, c_{2}, \dots, c_{m})

be vectors of length

m,

and

H, p, c \in {(0, 1)}^{m} .

We are now ready to define the following operators.

Definition 1.

Let

A_{0}, A_{1}, \dots, A_{m} \subset N

be pairwise disjoint, and

A_{0} = n^{'} > 0 .

Define,

\begin{matrix} L_{H, p, c}^{*} (A_{1}, A_{2}, \dots, A_{m}) : = \prod_{k = 1, \dots m} L_{H_{k}, p_{k}, c_{k}}^{*} (A_{k}), \end{matrix}

and,

D_{H, p, c}^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) : = \sum_{ℓ = 0}^{n^{'}} {(- 1)}^{ℓ} \sum_{\begin{matrix} | B | = ℓ \\ B \subset A_{0} \end{matrix}} \sum_{\begin{matrix} B_{i} \subset B \\ B_{i} \cap B_{j} = \emptyset \\ \cup B_{i} = B \end{matrix}} L_{H, p, c}^{*} (A_{1} \cup B_{1}, A_{2} \cup B_{2}, \dots, A_{m} \cup B_{m}) .

For ease of notation, we denote

D_{H, p, c}^{*},

L_{H, p, c}^{*},

and

L_{H_{k}, p_{k}, c_{k}}^{*}

by

D^{*}, L^{*}, L_{k}^{*},

respectively. Note that if

A_{0} = {i_{0}},

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) = \prod_{k = 1, \dots, m} L_{k}^{*} (A_{k}) (1 - \sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} (A_{k^{'}} \cup {i_{0}})}{L_{k^{'}}^{*} (A_{k^{'}})}) .

(3)

For any pairwise disjoint sets

A_{0}, A_{1}, \dots A_{m} \subset N,

if

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) > 0,

then

{X_{i}; i \in N}

is well defined stationary process with the following probabilities:

\begin{matrix} P (\cap_{i \in A_{k}} {X_{i} = k}) = L_{k}^{*} (A_{k}), for k = 1, \dots, m, \end{matrix}

(4)

\begin{matrix} P (\cap_{k = 1, \dots, m} \cap_{i \in A_{k}} {X_{i} = k}) = \prod_{k = 1, \dots, m} L_{k}^{*} (A_{k}), \end{matrix}

(5)

\begin{matrix} P (\cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = k}) = D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) . \end{matrix}

(6)

In particular, if the stationary process with the probability above is well defined, then, for

k, k^{'} = 1, \dots, m,

we have:

\begin{matrix} P (X_{i} = k, X_{j} = k) & = p_{k} (p_{k} + c_{k} {| j - i |}^{2 H_{k} - 2}), \\ P (X_{i} = k, X_{j} = k^{'}) & = p_{k} p_{k^{'}}, \end{matrix}

\begin{matrix} P (X_{i} = 0, X_{j} = 0) & = 1 - 2 \sum_{k = 1, \dots, m} P (X_{i} = k) + \sum_{k, k^{'} = 1, \dots, m} P (X_{i} = k, X_{j} = k^{'}) \\ = 1 - 2 \sum_{k = 1}^{m} p_{k} + \sum_{k = 1}^{m} p_{k} (p_{1} + p_{2} + \dots + p_{m} + c_{k} {| i - j |}^{2 H_{k} - 2}) \\ = p_{0}^{2} + \sum_{k = 1}^{m} p_{k} c_{k} {| i - j |}^{2 H_{k} - 2}, \\ P (X_{i} = k, X_{j} = 0) & = P (X_{i} = 0, X_{j} = k) = p_{k} (1 - p_{1} - p_{2} - \dots - p_{m} - c_{k} {| i - j |}^{2 H_{k} - 2}) \\ = p_{k} (p_{0} - c_{k} {| i - j |}^{2 H_{k} - 2}) . \end{matrix}

As a result, for

i \neq j, i, j \in N, k \neq k^{'}, k, k^{'} \in {1, 2, \dots, m},

\begin{matrix} c o v (I_{{X_{i} = k}}, I_{{X_{j} = k}}) = p_{k} c_{k} {| i - j |}^{2 H_{k} - 2}, \end{matrix}

(7)

\begin{matrix} c o v (I_{{X_{i} = k}}, I_{{X_{j} = k^{'}}}) = 0, \end{matrix}

(8)

\begin{matrix} c o v (I_{{X_{i} = 0}}, I_{{X_{j} = 0}}) = \sum_{k = 1}^{m} p_{k} c_{k} {| i - j |}^{2 H_{k} - 2}, \end{matrix}

(9)

\begin{matrix} c o v (I_{{X_{i} = k}}, I_{{X_{j} = 0}}) = - p_{k} c_{k} {| i - j |}^{2 H_{k} - 2} . \end{matrix}

(10)

Note that

({I_{{X_{i} = 1}}}_{i \in N}, {I_{{X_{i} = 2}}}_{i \in N}, \dots, {I_{{X_{i} = m}}}_{i \in N})

are m generalized Bernoulli processes with Hurst parameter,

H_{1}, H_{2}, \dots, H_{m}

, respectively (see [19]). However, they are not independent, since for

ℓ \neq k, ℓ \in {1, 2, \dots, m},

P ({I_{{X_{i} = ℓ}} = 1} \cap {I_{{X_{i} = k}} = 1}) = 0 \neq P (I_{{X_{i} = ℓ}} = 1) P (I_{{X_{i} = k}} = 1) = p_{ℓ} p_{k} .

Further, we have,

\begin{matrix} c o v (X_{i}, X_{j}) & = E (X_{i} X_{j}) - E (X_{i}) E (X_{j}) \\ = \sum_{k, k^{'}} k k^{'} P (I_{{X_{i} = k}} = 1, I_{{X_{j} = k^{'}}} = 1) - \sum_{k, k^{'}} k k^{'} p_{k} p_{k^{'}} \\ = \sum_{k = 1, \dots, m} k^{2} p_{k} c_{k} {| i - j |}^{2 H_{k} - 2} . \end{matrix}

Therefore, the process

{X_{i}}_{i \in N}

possesses long-range dependence if

min {H_{1}, \dots, H_{k}} > 1 / 2 .

All the results that appear in this paper are valid regardless of how the finite-state space of

X_{i}

is defined. More specifically, given that:

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) > 0

for any pairwise disjoint sets

A_{0}, A_{1}, \dots A_{m} \subset N,

we can define probability (4)–(6) with any state space

S = {s_{0}, s_{1}, s_{2}, \dots, s_{m}} \subset R

for any

m \in N

in the following way.

\begin{matrix} P (\cap_{i \in A_{k}} {X_{i} = s_{k}}) = L_{k}^{*} (A_{k}), for k = 1, \dots, m, \\ P (\cap_{k = 1, \dots, m} \cap_{i \in A_{k}} {X_{i} = s_{k}}) = \prod_{k = 1, \dots, m} L_{k}^{*} (A_{k}), \\ P (\cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = s_{k}}) = D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) . \end{matrix}

Note that the only difference is that the space

“ k ”

is replaced by

“ s_{k} ”

. As a result, we can obtain the same results as (7)–(10), except that

I_{{X_{i} = k}}

is replaced by

I_{{X_{i} = s_{k}}},

and we get:

\begin{matrix} c o v (X_{i}, X_{j}) & = c o v (X_{i} - s_{0}, X_{j} - s_{0}) \\ = \sum_{k, k^{'} = 1, \dots, m} s_{k} s_{k}^{'} P (I_{{X_{i} = s_{k}}} = 1, I_{{X_{j} = s_{k}^{'}}} = 1) - \sum_{k, k^{'} = 1, \dots, m} s_{k} s_{k}^{'} p_{k} p_{k^{'}} \\ = \sum_{k = 1, \dots, m} {(s_{k} - s_{0})}^{2} p_{k} c_{k} {| i - j |}^{2 H_{k} - 2} . \end{matrix}

In a similar way, all the results in this paper can be easily transfered to any finite-state space

S \subset R .

For the sake of simplicity, we assume

S = {0, 1, \dots, m}, m \in N,

without loss of generality, and define

S^{0} : = {1, \dots, m}

.

Now, we will give a restriction on the parameter values,

{H_{k}, p_{k}, c_{k}; k \in S^{0}}

, which will make

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) > 0

for any pairwise disjoint sets

A_{0}, \dots A_{m} \subset N

; therefore, the process

{X_{i}}

is well-defined with the probability (4)–(6).

ASSUMPTIONS:

(A.1)

c_{k}, H_{k}, p_{k} \in (0, 1)

for

k \in S^{0} .

(A.2) For any

i_{0} < i_{1} < i_{2}

,

i_{0}, i_{1}, i_{2} \in N,

\sum_{k = 1}^{m} \frac{(p_{k} + c_{k} | i_{1} - i_{0} |^{2 H_{k} - 2}) (p_{k} + c_{k} | i_{2} - i_{1} |^{2 H_{k} - 2})}{p_{k} + c_{k} {| i_{2} - i_{0} |}^{2 H_{k} - 2}} < 1 .

(11)

For the rest of the paper, it is assumed that ASSUMPTIONS (A.1, A.2) hold.

Remark 1.

(a). (11) holds if,

\sum_{k = 1}^{m} \frac{(p_{k} + c_{k}) (p_{k} + c_{k})}{p_{k} + c_{k} 2^{2 H_{k} - 2}} < 1,

since,

\frac{(p_{k} + c_{k} | i_{1} - i_{0} |^{2 H_{k} - 2}) (p_{k} + c_{k} | i_{2} - i_{1} |^{2 H_{k} - 2})}{(p_{k} + c_{k} | i_{2} - i_{0} |^{2 H_{k} - 2})}

is maximized when

i_{2} - i_{0} = 2, i_{1} - i_{0} = 1,

as it was seen in Lemma 2.1 of [19].

(b). If

(i_{1} - i_{0}) / (i_{2} - i_{0}) \to 0, (i_{2} - i_{1}) / (i_{2} - i_{0}) \to 1

with

i_{2} - i_{0} \to \infty

in (11), then we have:

\sum_{k = 1}^{m} p_{k} + c_{k} {| i_{1} - i_{0} |}^{2 H_{k} - 2} < 1,

(12)

and this, together with (11), implies that for any set

{A_{k}, i_{k}^{'}} \subset N,

\sum_{k = 1}^{m} \frac{L_{k}^{*} (A_{k} \cup {i_{k}^{'}})}{L_{k}^{*} (A_{k})} < 1 .

This means that for any

A_{0} = {i_{0}} \subset N,

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) > 0

by (3).

(c). From (12),

\sum_{k = 1}^{m} c_{k} < 1 - \sum_{k = 1}^{m} p_{k} = p_{0} .

(d). If

m = 1,

(11) is reduced to (2.7) in the Lemma 2.1 in [19].

Now we are ready to show that

{X_{i}, i \in N}

is well defined with probability (4)–(6).

Proposition 1.

For any disjoint sets

A_{0}, A_{1}, A_{2}, \dots, A_{m} \subset N, A_{0} \neq \emptyset,

D^{*} (A_{1}, A_{2}, \dots, A_{m}; A_{0}) > 0 .

The next theorem shows that the stochastic process

{X_{i}, i \in N}

defined with probability (4)–(6) is stationary, and it has long-range dependence if

max {H_{k}, k \in S^{0}} > 1 / 2 .

Furthermore, the indicator sequence of each state is stationary, and has long-range dependence if its Hurst exponent is greater than 1/2.

Theorem 1.

{X_{i}, i \in N}

is a stationary process with the following properties.

i.

P (X_{i} = k) = p_{k}, f o r k \in S^{0} .

ii.

c o v (I_{{X_{i} = k}}, I_{{X_{j} = k}}) = p_{k} c_{k} {| i - j |}^{2 H_{k} - 2}, f o r k \in S^{0},

and

c o v (I_{{X_{i} = 0}}, I_{{X_{j} = 0}}) \sim p_{k^{'}} c_{k^{'}} {| i - j |}^{2 H_{k^{'}} - 2}, a s | i - j | \to \infty

where

k^{'} = a r g m a x_{k} H_{k} .

iii.

c o v (X_{i}, X_{j}) = \sum_{k = 1}^{m} k^{2} p_{k} c_{k} {| i - j |}^{2 H_{k} - 2}, f o r i \neq j .

Proof.

By Proposition 1,

{X_{i}}

is a well-defined stationary process with probability (4)–(6). The other results follow by (7)–(10). □

3. Tail Behavior of Inter-Arrival Time and Other Properties

For

k \in S^{0},

{I_{{X_{i} = k}}}_{i \in N}

is a stationary process in which the event

{X_{i} = k}

is recurrent, persistent, and aperiodic (here, we follow the terminology and definition in [20]). We define a random variable

T_{k k}^{i}

as the inter-arrival time between the i-th

“ k ”

from the previous

“ k ”

, i.e.,

T_{k k}^{i} : = inf {i > 0 : X_{i + T_{k k}^{i - 1}} = k},

with

T_{k k}^{0} : = 0 .

Since

{I_{{X_{i} = k}}}_{i \in N}

is GBP with parameters

(H_{k}, p_{k}, c_{k})

for

k \in S^{0},

T_{k k}^{2}, T_{k k}^{3}, \dots

are iid (see page 9 [21]). Therefore, we will denote the inter-arrival time between two consecutive observations of k as

T_{k k} .

The next Lemma is directly obtained from Theorem 3.6 in [21].

Lemma 1.

For

k \in S^{0},

the inter-arrival time for state k,

T_{k k},

satisfies the following.

i.

T_{k k}

has a mean of

1 / p_{k}

. It has an infinite second moment if

H_{k} \in (1 / 2, 1) .

ii.

P (T_{k k} > t) = t^{2 H_{k} - 3} L_{k} (t),

where

L_{k}

is a slowly varying function that depends on the parameter

H_{k}, p_{k}, c_{k}

.

The first result i in Lemma 1 is similar to Lemma 1 in [22]. However, here, we have a finite-state stationary process, whereas countable-state space Markov chain was assumed in [22]. Now, we investigate the conditional probabilities and the uniqueness of our process.

Theorem 2.

Let

A_{0}, A_{1}, \dots, A_{m}

be disjoint subsets of

N .

For any

ℓ \in S^{0}

such that

max A_{ℓ} > max A_{0},

and for

i^{'} \notin \cup_{k = 0}^{m} A_{k}

such that

i^{'} > max A_{ℓ},

the conditional probability satisfies the following:

\begin{matrix} P (X_{i^{'}} = ℓ | \cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = k}) = p_{ℓ} + c_{ℓ} {| i^{'} - max A_{ℓ} |}^{2 H_{ℓ} - 2} . \end{matrix}

If there has been no interruption of “0” after the last observation of “ℓ”, then the chance to observe “ℓ” depends on the distance between the current time and the last time of observation of “ℓ”, regardless of how other states appeared in the past. This can be considered as a generalized Markov property. Moreover, this chance to observe

“ ℓ ”

decreases as the distance increases, following the power law with exponent

2 H_{ℓ} - 2

.

Proof.

The result follows from the fact that:

\begin{matrix} P ({X_{i^{'}} = ℓ} \cap_{\begin{matrix} i \in A_{k} \\ k \in S^{0} \end{matrix}} {X_{i} = k}) = P (\cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k}) \times (p_{ℓ} + c_{ℓ} | i^{'} - max A_{ℓ} |^{2 H_{ℓ} - 2}), \end{matrix}

since there is no

i \in A_{0}

between

i^{'}

and

max A_{ℓ} .

□

In a countable state space Markov chain, long-range dependence is possible only when it has infinite state space, and additionally if it is stationary, positive recurrent, irreducible, aperiodic Markov chain, then each state should have the same long-term memory, i.e., sequence indicators have the same Hurst exponent for all states [22]. By relaxing the Markov property, long-range dependence was made possible in a finite-state stationary process, also with different Hurst parameter for different states.

Theorem 3.

Let

A_{0}, A_{1}, \dots, A_{m}

be disjoint subsets of

N .

For

ℓ \in S^{0}

such that

max A_{ℓ} < max A_{0}

, and

i_{1}^{'}, i_{2}^{'}, i_{3}^{'} \notin \cup_{k = 0}^{m} A_{k}

such that

i_{1}^{'}, i_{2}^{'}, i_{3}^{'} > max A_{0},

and

i_{2}^{'} > i_{3}^{'},

the conditional probability satisfies the following:

a.

p_{ℓ} + c_{ℓ} {| i_{1}^{'} - max A_{ℓ} |}^{2 H_{ℓ} - 2} > P (X_{i_{1}^{'}} = ℓ | \cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k}) .

b.

\frac{P (X_{i_{2}^{'}} = ℓ | \cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k})}{P (X_{i_{3}^{'}} = ℓ | \cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k})} > \frac{p_{ℓ} + c_{ℓ} {| i_{2}^{'} - max A_{ℓ} |}^{2 H_{ℓ} - 2}}{p_{ℓ} + c_{ℓ} {| i_{3}^{'} - max A_{ℓ} |}^{2 H_{ℓ} - 2} .}

Theorem 4.

A stationary process with (4)–(6) is the unique stationary process that satisfies

i. for

k \in S

:

P (X_{i} = k) = p_{k}, w h e r e p_{k} > 0 a n d \sum_{k = 0}^{m} p_{k} = 1,

ii. for

k \in S^{0}

and any

i, j \in N, i \neq j,

c o v (I_{{X_{i} = k}}, I_{{X_{j} = k}}) = c_{k}^{'} {| i - j |}^{2 H_{k} - 2},

for some constants

c_{k}^{'} \in R_{+}, H_{k} \in (0, 1),

iii. for any sets,

A \subset S^{0}

and

{i_{k}; k \in A} \subset N,

P (\cap_{k \in A} {X_{i_{k}} = k}) = \prod_{k \in A} p_{k},

iv. for

ℓ \in S^{0},

there is a function

h_{ℓ} (\cdot)

such that,

\begin{matrix} P (X_{i^{'}} = ℓ | \cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k}) = h_{ℓ} (i^{'} - max A_{ℓ}) \end{matrix}

for disjoint subsets,

A_{0}, A_{1}, \dots, A_{m}, {i^{'}} \subset N

, such that

A_{ℓ} \neq \emptyset,

i^{'} > max A_{ℓ},

and

max A_{ℓ} > max A_{0}

(

A_{0}

can be the empty set).

Proof.

Let

X^{*}

be a stationary process that satisfies i–

i v

. By

i, i i,

P (X_{i_{0}}^{*} = k, X_{i_{1}}^{*} = k) = c o v (I_{{X_{i_{0}}^{*} = k}}, I_{{X_{i_{1}}^{*} = k}}) + p_{k}^{2} = c_{k}^{'} {| i_{0} - i_{1} |}^{2 H_{k} - 2} + p_{k}^{2},

which results in:

h_{k} (i_{0} - i_{1}) = P (X_{i_{1}}^{*} = k | X_{i_{0}}^{*} = k) = p_{k} + (c_{k}^{'} / p_{k}) {| i_{0} - i_{1} |}^{2 H_{k} - 2} .

Therefore, by

i v,

\begin{matrix} P (X_{i_{0}}^{*} = k, X_{i_{1}}^{*} = k, X_{i_{2}}^{*} = k, \dots, X_{i_{n}}^{*} = k) & = p_{k} \prod_{j = 1}^{n} h_{k} (i_{j} - i_{j - 1}) \\ = L_{k}^{*} ({i_{0}, i_{2}, \dots, i_{n}}), \end{matrix}

where

L_{k}^{*} = L_{H_{k}, p_{k}, c_{k}^{'} / p_{k}}^{*}

. Furthermore, by applying iii, iv to

X^{*}

,

\begin{matrix} P (\cap_{i \in A_{k}, k \in S^{0}} {X_{i} = k}) = \prod_{k = 1, \dots, m} L_{k}^{*} (A_{k}) . \end{matrix}

This implies that

X^{*}

satisfies (4)–(6) with

c_{k} = c_{k}^{'} / p_{k}

for

k \in S^{0} .

□

4. Fractional Multinomial Distribution

In this section, we define a fractional multinomial distribution that can serve as an over-dispersed multinomial distribution.

Note that

\sum_{i = 1}^{n} I_{{X_{i} = k}}

has mean

n p_{k}

for

k \in S .

Further, as

n \to \infty,

v a r (\sum_{i = 1}^{n} I_{{X_{i} = k}}) \sim {\begin{matrix} (p_{k} (1 - p_{k}) + \frac{c_{k}^{'}}{2 H_{k} - 1}) n & H_{k} \in (0, 1 / 2), \\ c_{k}^{'} n ln n & H_{k} = 1 / 2, \\ \frac{c_{k}^{'}}{2 H_{k} - 1} {| n |}^{2 H_{k}}, & H_{k} \in (1 / 2, 1), \end{matrix}

for

k \in S^{0}

, and,

v a r (\sum_{i = 1}^{n} I_{{X_{i} = 0}}) \sim {\begin{matrix} (p_{k^{'}} (1 - p_{k^{'}}) + \frac{c_{k^{'}}^{'}}{2 H_{k^{'}} - 1}) n & H_{k^{'}} \in (0, 1 / 2), \\ c_{k^{'}}^{'} n ln n & H_{k^{'}} = 1 / 2, \\ \frac{c_{k^{'}}^{'}}{2 H_{k^{'}} - 1} {| n |}^{2 H_{k^{'}}}, & H_{k^{'}} \in (1 / 2, 1), \end{matrix}

where

k^{'} = a r g m a x_{k} {H_{k}; k \in S^{0}}

, and

c_{k}^{'} = p_{k} c_{k} .

It also has the following covariance.

c o v (\sum_{i = 1}^{n} I_{{X_{i} = k}}, \sum_{i = 1}^{n} I_{{X_{i} = k^{'}}}) = - n p_{k} p_{k^{'}},

c o v (\sum_{i = 1}^{n} I_{{X_{i} = 0}}, \sum_{i = 1}^{n} I_{{X_{i} = k}}) = - n p_{0} p_{k} - \sum_{\begin{matrix} i \neq j \\ i, j = 1, \dots, n \end{matrix}} c_{k}^{'} {| i - j |}^{2 H_{k} - 2},

for

k, k^{'} \in S^{0} .

We define

Y_{k} : = \sum_{i = 1}^{n} I_{{X_{i} = k}},

for

k \in S,

and a fixed n, and call its distribution fractional multinomial distribution with parameters

n, p, H, c .

If

c = 0

,

(Y_{0}, Y_{1}, Y_{2}, \dots, Y_{m})

follows a multinomial distribution with parameters

n, p,

and

E (Y_{k}) = n p_{k}, v a r (Y_{k}) = n p_{k} (1 - p_{k}), c o v (Y_{k}, Y_{k^{'}}) = - n p_{k} p_{k^{'}},

for

k, k^{'} \in S, k \neq k^{'},

and

p_{0} = 1 - \sum_{i = 1}^{m} p_{i} .

If

c \neq 0,

(Y_{0}, Y_{1}, \dots, Y_{m})

can serve as over-dispersed multinomial random variables with:

E (Y_{k}) = n p_{k}, V a r (Y_{k}) = n p_{k} (1 - p_{k}) (1 + ψ_{n, k}),

where the over-dispersion parameter

ψ_{n, k}

is as follows.

ψ_{n, k} \sim {\begin{matrix} \frac{c}{(1 - p_{k}) (2 H_{k} - 1)} & if H_{k} \in (0, 1 / 2), \\ \frac{c ln n}{1 - p_{k}} - 1 & if H_{k} = 1 / 2, \\ \frac{c n^{2 H_{k} - 1}}{(1 - p_{k}) 2 H_{k} - 1} - 1 & if H_{k} \in (1 / 2, 1), \end{matrix}

for

k \in S^{0},

and,

ψ_{n, 0} \sim {\begin{matrix} \frac{c}{(1 - p_{k^{'}}) (2 H_{k^{'}} - 1)} & if H_{k^{'}} \in (0, 1 / 2), \\ \frac{c ln n}{1 - p_{k^{'}}} - 1 & if H_{k^{'}} = 1 / 2, \\ \frac{c n^{2 H_{k^{'}} - 1}}{(1 - p_{k^{'}}) 2 H_{k^{'}} - 1} - 1 & if H_{k^{'}} \in (1 / 2, 1), \end{matrix}

where

k^{'} = a r g m a x_{k} {H_{k}; k \in S^{0}},

as

n \to \infty .

If

H_{k} \in (0, 1 / 2),

the over-dispersion parameter

ψ_{n, k}

remains stable as n increases, whereas if

H_{k} \in (1 / 2, 1)

the over-dispersed parameter

ψ_{n, k}

increases with the rate of fractional exponent of n,

n^{2 H_{k} - 1} .

5. Conclusions

A new method for modeling long-range dependence in discrete-time finite-state stationary process was proposed. This model allows different states to have different Hurst indices except that for the base state “0”, the Hurst exponent is the maximum Hurst index of all other states. Inter-arrival time for each state follows a heavy tail distribution, and its tail behavior is different for different states. The other interesting feature of this process is that the conditional probability to observe a state “k” (k is not the base state “0”) depends on the Hurst index

H_{k}

and the time difference between the last observation of “k” and the current time, no matter how other states appeared in the past, given that there was no base state observed since the last observation of “k”. From the stationary process developed in this paper, we defined a fractional multinomial distribution that can express a wide range of over-dispersed multinomial distributions; each state can have a different over-dispersion parameter that can behave as an asymptotically constant or grow with a fractional exponent of the number of trials.

6. Proofs

Lemma 2.

For any

{a_{0}, a_{1}, \dots, a_{n}, a_{0}^{'}, a_{1}^{'}, \dots, a_{n}^{'}} \subset R_{+}

that satisfies

a_{0} - \sum_{i = 1}^{j} a_{i} > 0, a_{0}^{'} - \sum_{i = 1}^{j} a_{i}^{'} > 0

for

j = 1, 2, \dots, n

,

i. if,

\frac{a_{0}}{a_{0}^{'}} \geq \frac{a_{1}}{a_{1}^{'}} \geq \dots \geq \frac{a_{n}}{a_{n}^{'}},

then,

\frac{a_{0} - a_{1} - a_{2} - \dots - a_{n}}{a_{0}^{'} - a_{1}^{'} - a_{2}^{'} - \dots - a_{n}^{'}} \geq \frac{a_{0}}{a_{0}^{'}} .

ii. If,

\frac{a_{0}}{a_{0}^{'}} < \frac{a_{1}}{a_{1}^{'}} \leq \dots \leq \frac{a_{n}}{a_{n}^{'}},

then,

\frac{a_{0} - a_{1} - a_{2} - \dots - a_{n}}{a_{0}^{'} - a_{1}^{'} - a_{2}^{'} - \dots - a_{n}^{'}} \leq \frac{a_{0}}{a_{0}^{'}} .

iii. For any

{a_{0}, a_{1}, \dots, a_{n}, a_{0}^{'}, a_{1}^{'}, \dots, a_{n}^{'}} \subset R_{+}

,

max_{i} \frac{a_{i}}{a_{i}^{'}} \geq \frac{a_{1} + a_{2} + \dots + a_{n}}{a_{1}^{'} + a_{2}^{'} + \dots + a_{n}^{'}} \geq min_{i} \frac{a_{i}}{a_{i}^{'}} .

Proof.

i and ii were proved in Lemma 5.2 in [19].

For iii, define

b_{j}

such that,

\frac{a_{j}}{a_{j}^{'}} = b_{j} .

Then,

\frac{a_{1} + a_{2} + \dots + a_{n}}{a_{1}^{'} + a_{2}^{'} + \dots + a_{n}^{'}} = \frac{b_{1} a_{1}^{'} + b_{2} a_{2}^{'} + \dots + b_{n} a_{n}^{'}}{a_{1}^{'} + a_{2}^{'} + \dots + a_{n}^{'}}

which is weighted average of

{b_{j}, j = 1, \dots, n}

. □

To ease our notation, we will denote:

L^{*} (A_{1}, A_{2}, \dots, A_{k - 1}, A_{k} \cup {i}, A_{k + 1}, \dots, A_{m})

by,

L^{*} (\dots, A_{k} \cup {i}, \dots),

and,

L^{*} (\dots, A_{k} \cup {i}, A_{k^{'}} \cup {j}, \dots) = L^{*} (A_{1}^{*}, A_{2}^{*}, \dots, A_{m}^{*})

where, if

k \neq k^{'}

,

A_{i}^{*} = {\begin{matrix} A_{i} if i \neq k, k,^{'} \\ A_{i} \cup {i} if i = k, \\ A_{i} \cup {j} if i = k^{'}, \end{matrix}

and if

k = k^{'}

,

A_{i}^{*} = {\begin{matrix} A_{i} if i \neq k, \\ A_{i} \cup {i \cup} if i = k . \end{matrix}

D^{*} (\dots, A_{k} \cup {i}, \dots)

and

D^{*} (\dots, A_{k} \cup {i}, A_{k^{'}} \cup {j}, \dots)

are also defined in a similar way.

Lemma 3.

For any disjoint sets

A_{1}, \dots, A_{m}, {i_{0}, i_{1}} \subset N

,

i.

D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}}) > 0

ii.

D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}, i_{1}}) > 0

Proof.

i.

\begin{matrix} D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}}) & = \prod_{k = 1}^{m} L_{k}^{*} (A_{k}) (1 - \sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} (A_{k^{'}} \cup {i_{0}})}{L_{k^{'}}^{*} (A_{k^{'}})}) \\ = \prod_{k = 1}^{m} L_{k}^{*} (A_{k}) (1 - \sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}, i_{0}})}{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}})}) \end{matrix}

where

i_{1, k^{'}}, i_{2, k^{'}} \in A_{k^{'}}

are two closest elements to

i_{0}

among

A_{k^{'}}

such that if

min A_{k^{'}} < i_{0} < max A_{k^{'}},

then

i_{1, k^{'}} < i_{0} < i_{2, k^{'}},

if

i_{0} > max A_{k^{'}},

then

i_{1, k^{'}} < i_{2, k^{'}} < i_{0},

if

i_{0} < min A_{k^{'}},

then

i_{0} < i_{1, k^{'}} < i_{2, k^{'}},

and if

A_{k^{'}} = \emptyset,

then

i_{1, k^{'}} = i_{2, k^{'}} = \emptyset

. Therefore,

\begin{matrix} \frac{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}, i_{0}})}{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}})} \\ = {\begin{matrix} \frac{(p_{k^{'}} + c_{k^{'}} | i_{1, k^{'}} - i_{0} |^{2 H_{k^{'}} - 2}) (p_{k^{'}} + c_{k^{'}} | i_{0} - i_{2, k^{'}} |^{2 H_{k^{'}} - 2})}{p_{k^{'}} + c_{k^{'}} {| i_{1, k^{'}} - i_{2, k^{'}} |}^{2 H_{k^{'}} - 2}} & if min A_{k^{'}} < i_{0} < max A_{k^{'}}, \\ p_{k^{'}} + c_{k^{'}} {| max A_{k^{'}} - i_{0} |}^{2 H_{k^{'}} - 2} & if i_{0} > max A_{k^{'}}, \\ p_{k^{'}} + c_{k^{'}} {| min A_{k^{'}} - i_{0} |}^{2 H_{k^{'}} - 2} & if i_{0} < min A_{k^{'}}, \\ p_{k^{'}} & if A_{k^{'}} = \emptyset . \end{matrix} \end{matrix}

By (11),

\sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}, i_{0}})}{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}})} < 1,

and the result is derived.

ii. Since,

\begin{matrix} D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}, i_{1}}) & = D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}}) - \sum_{k = 1}^{m} D^{*} (\dots, A_{k} \cup {i_{1}}, \dots; {i_{0}}), \end{matrix}

it is sufficient if we show:

\begin{matrix} \frac{L^{*} (A_{1}, A_{2}, \dots, A_{m}) - \sum_{k = 1}^{m} L^{*} (\dots, A_{k} \cup {i_{0}}, \dots)}{\sum_{k^{'} = 1}^{m} L^{*} (\dots, A_{k^{'}} \cup {i_{1}}, \dots) - \sum_{k, k^{'} = 1}^{m} L^{*} (\dots, A_{k} \cup {i_{0}}, A_{k^{'}} \cup {i_{1}}, \dots)} > 1 . \end{matrix}

Note that:

\begin{matrix} \frac{L^{*} (A_{1}, A_{2}, \dots, A_{m})}{\sum_{k^{'} = 1}^{m} L^{*} (\dots, A_{k^{'}} \cup {i_{1}}, \dots)} = \frac{1}{\sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}, i_{0}})}{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}})}}, \end{matrix}

which is non-increasing as set

A_{k}

increases for

k = 1, \dots, m

. That is,

\frac{L^{*} (A_{1}, A_{2}, \dots, A_{m})}{\sum_{k^{'} = 1}^{m} L^{*} (\dots, A_{k^{'}} \cup {i_{1}}, \dots)} \leq \frac{L^{*} (A_{1}^{'}, A_{2}^{'}, \dots, A_{m}^{'})}{\sum_{k^{'} = 1}^{m} L^{*} (\dots, A_{k^{'}}^{'} \cup {i_{1}}, \dots)}

for any sets

A_{k} \subseteq A_{k}^{'}, k = 1, 2, \dots, m .

Therefore,

\begin{matrix} \frac{L^{*} (A_{1}, A_{2}, \dots, A_{m})}{\sum_{k^{'} = 1}^{m} L^{*} (\dots, A_{k^{'}} \cup {i_{1}}, \dots)} > \frac{\sum_{k = 1}^{m} L^{*} (\dots, A_{k} \cup {i_{0}}, \dots)}{\sum_{k, k^{'} = 1}^{m} L^{*} (\dots, A_{k} \cup {i_{0}}, A_{k^{'}} \cup {i_{1}}, \dots)} \end{matrix}

by iii of Lemma 2. By i of Lemma 2 combined with the fact that:

\frac{1}{\sum_{k^{'} = 1}^{m} \frac{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}, i_{0}})}{L_{k^{'}}^{*} ({i_{1, k^{'}}, i_{2, k^{'}}})}} > 1

from (11), the result is derived. □

Note that for any disjoint sets

A_{1}, A_{2}, \dots, A_{m}, {i_{0}, i_{1}, \dots, i_{n}}

\begin{matrix} D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}, i_{1}, \dots, i_{n}}) & = D^{*} (A_{1}, A_{2}, \dots, A_{m}; {i_{0}, i_{1}, \dots, i_{n - 1}}) \\ - D^{*} (A_{1} \cup {i_{n}}, A_{2}, \dots, A_{m}; {i_{0}, i_{1}, \dots, i_{n - 1}}) \\ - D^{*} (A_{1}, A_{2} \cup {i_{n}}, \dots, A_{m}; {i_{0}, i_{1}, \dots, i_{n - 1}}) \\ \dots \\ - D^{*} (A_{1}, A_{2}, \dots, A_{m} \cup {i_{n}}; {i_{0}, i_{1}, \dots, i_{n - 1}}) . \end{matrix}

Let us denote:

\begin{matrix} \sum_{k = 1}^{m} D^{*} (A_{1}, \dots, A_{k - 1}, A_{k} \cup {i_{n}}, A_{k + 1} \dots, A_{m}; {i_{0}, i_{1}, \dots, i_{n - 1}}) \end{matrix}

by:

\sum_{k = 1}^{m} D^{*} (\dots, A_{k} \cup {i_{n}}, \dots; {i_{0}, i_{1}, \dots, i_{n - 1}}) .

Proof of Proposition 1.

We will show by mathematical induction that

{X_{i_{1}}, \dots, X_{i_{n}}}

is a random vector with probability (4)–(6) for any n and any

{i_{1}, i_{2}, \dots, i_{n}} \subset N

. For

n = 1,

it is trivial. For

n = 2,

it is proved by Lemma 3. Let us assume that

{X_{i_{1}}, \dots, X_{i_{n^{'} - 1}}}

is a random vector with probability (4)–(6) for any

{i_{1}, i_{2}, \dots, i_{n^{'} - 1}} \subset N

. We will prove that

{X_{i_{1}}, \dots, X_{i_{n^{'}}}}

is a random vector for any

{i_{1}, i_{2}, \dots, i_{n^{'}}} \subset N .

Without loss of generality, fix a set

{i_{1}, i_{2}, \dots, i_{n^{'}}} \subset N .

To prove that

{X_{i_{1}}, \dots, X_{i_{n^{'}}}}

is a random vector with probability (4)–(6), we need to show that

D^{*} (A_{1}, \dots, A_{m}; A_{0}) > 0

for any pairwise disjoint sets,

A_{0}, \dots, A_{m},

such that

\cup_{k = 0}^{m} A_{k} = {i_{1}, \dots, i_{n^{'}}} .

If

| A_{0} | = 0

or 1, then the result follows from the definition of

D^{*}

and Lemma 3, respectively. Therefore, we assume that

| A_{0} | \geq 2,

A_{0} = {i_{0}^{'}, i_{1}^{'}, \dots, i_{n_{0}}^{'}},

and

max A_{0} = i_{n_{0}}^{'} .

Let

A_{0}^{'} = A_{0} / {i_{n_{0}}^{'}} .

We will first show that for any such sets,

\frac{D^{*} (A_{1}, \dots, A_{m}; A_{0}^{'})}{\sum_{ℓ = 1}^{m} D^{*} (\dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots; A_{0}^{'})} > 1 .

(13)

(13) is equivalent to

D^{*} (A_{1}, \dots, A_{m}; A_{0}) > 0 .

For fixed

ℓ \in {1, 2, \dots, m},

define the following vectors of length

m - 1,

\begin{matrix} H^{ℓ} & = (H_{1}, \dots, H_{ℓ - 1}, H_{ℓ + 1}, \dots, H_{m}), \\ p^{ℓ} & = (p_{1}, \dots, p_{ℓ - 1}, p_{ℓ + 1}, \dots, p_{m}), \\ c^{ℓ} & = (c_{1}, \dots, c_{ℓ - 1}, c_{ℓ + 1}, \dots, c_{m}) . \end{matrix}

We also define:

D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}) : = D_{H^{ℓ}, p^{ℓ}, c^{ℓ}}^{*} (A_{1}, \dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots, A_{m}; A_{0}) .

Since

{X_{i}; i \in \cup_{k = 1}^{m} A_{k} \cup A_{0}^{'}}

is a random vector with (4)–(6),

D^{*} (\dots, A_{ℓ}, \dots; A_{0}^{'}) > 0,

and it can be written as:

\begin{matrix} D^{*} (\dots, A_{ℓ},, \dots; A_{0}^{'}) = P (\cap_{i \in A_{0}^{'}} {X_{i} = 0} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ}} X_{i} = ℓ}) \\ = P (\cap_{i \in A_{0}^{'}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ}} {X_{i} = ℓ}) \\ - P (\cap_{i \in A_{0}^{'} / {i_{0}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{0}^{'}}} {X_{i} = ℓ}) \\ - P (\cap_{i \in A_{0}^{'} / {i_{0}^{'}, i_{1}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{1}^{'}}} {X_{i} = ℓ} \cap {X_{i_{0}^{'}} = 0}) \\ - P (\cap_{i \in A_{0}^{'} / {i_{0}^{'}, i_{1}^{'}, i_{2}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{2}^{'}}} {X_{i} = ℓ} \cap_{i \in {i_{0}^{'}, i_{1}^{'}}} {X_{i} = 0}) \\ ⋮ \\ - P (\cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{n_{0} - 1}^{'}}} {X_{i} = ℓ} \cap_{i \in A_{0}^{'} / {i_{n_{0} - 1}^{'}}} {X_{i} = 0}) . \end{matrix}

(14)

Note that:

\begin{matrix} P (\cap_{i \in A_{0}^{'}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ}} {X_{i} = ℓ}) \\ = P (\cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ}} {X_{i} = ℓ}) \\ - P (\cap_{i \in A_{0}^{'}} {X_{i} \in {1, \dots, ℓ - 1, ℓ + 1, \dots, m}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ}} {X_{i} = ℓ}) \\ = L_{ℓ}^{*} (A_{ℓ}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}^{'}), \end{matrix}

(15)

and:

\begin{matrix} P (\cap_{i \in {i_{j + 1}^{'}, \dots, i_{n_{0} - 1}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \\ \cap_{i \in A_{ℓ} \cup {i_{j}^{'}}} {X_{i} = ℓ} \cap_{i \in {i_{0}^{'}, \dots, i_{j - 1}^{'}}} {X_{i} = 0}) \\ = P (\cap_{i \in A_{0}^{'} / {i_{j}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{j}^{'}}} {X_{i} = ℓ}) \\ - \sum_{i^{*} \in A_{0}^{'}, i^{*} < i_{j}^{'}} P (\cap_{i \in A_{0}^{'} / {i_{j}^{'}, i^{*}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{j}^{'}, i^{*}}} {X_{i} = ℓ}) \\ + \sum_{\begin{matrix} i^{*}, i^{* *} \in A_{0}^{'}, \\ i^{*} < i^{* *} < i_{j}^{'} \end{matrix}} P (\cap_{i \in A_{0}^{'} / {i_{j}^{'}, i^{*}, i^{* *}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{j}^{'}, i^{*}, i^{* *}}} {X_{i} = ℓ}) \\ ⋮ \\ {(- 1)}^{j} P (\cap_{i \in A_{0}^{'} / {i_{j}^{'}, i_{0}^{'}, i_{1}^{'}, \dots, i_{j - 1}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{j}^{'}, i_{0}^{'}, i_{1}^{'}, \dots, i_{j - 1}^{'}}} {X_{i} = ℓ}) \\ = \sum_{\begin{matrix} C \cap D = \emptyset \\ C = \emptyset or max C < i_{j}^{'} \\ C \cup D = A_{0}^{'} / {i_{j}^{'}} \end{matrix}} {(- 1)}^{| C |} L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}} \cup C) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; D) \end{matrix}

(16)

where

| \emptyset | = 0

. Therefore, by (14)–(16),

\begin{matrix} D^{*} (\dots, A_{ℓ}, \dots; A_{0}^{'}) = L_{ℓ}^{*} (A_{ℓ}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}^{'}) \\ + \sum_{j = 0}^{n_{0} - 1} \sum_{\begin{matrix} C \cap D = \emptyset \\ C = \emptyset or \max C < i_{j}^{'} \\ C \cup D = A_{0}^{'} / \{i_{j}^{'}\} \end{matrix}} {(- 1)}^{| C | + 1} L_{ℓ}^{*} (A_{ℓ} \cup \{i_{j}^{'}\} \cup C) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; D) . \end{matrix}

(17)

(17) can also be derived by the definition of

L_{ℓ}^{*}, D^{*},

without using probability for

{X_{i}; i \in \cup_{k = 1}^{m} A_{k} \cup A_{0}^{'}}

. In the same way, using the definition of

L_{ℓ}^{*}, D^{*},

\begin{matrix} D^{*} (\dots, A_{ℓ} \cup \{i_{n_{0}}^{'}\}, \dots; A_{0}^{'}) = L_{ℓ}^{*} (A_{ℓ} \cup \{i_{n_{0}}^{'}\}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}^{'}) \\ + \sum_{j = 0}^{n_{0} - 1} \sum_{\begin{matrix} C \cap D = \emptyset \\ C = \emptyset or \max C < i_{j}^{'} \\ C \cup D = A_{0}^{'} / \{i_{j}^{'}\} \end{matrix}} {(- 1)}^{| C | + 1} L_{ℓ}^{*} (A_{ℓ} \cup \{i_{n_{0}}^{'}, i_{j}^{'}\} \cup C) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; D) \end{matrix}

(18)

Note that, for

j = 0, 1, \dots, n_{0} - 1,

\begin{matrix} g_{H, p, c} (A_{1}, \dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'}) : = \\ \sum_{\begin{matrix} C \cap D = \emptyset \\ C = \emptyset or max C < i_{j}^{'} \\ C \cup D = A_{0}^{'} / {i_{j}^{'}} \end{matrix}} {(- 1)}^{| C | + 1} L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}} \cup C) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; D) < 0, \end{matrix}

since we have:

\begin{matrix} g_{H, p, c} (A_{1}, \dots, A_{ℓ}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'}) = \\ - P (\cap_{i \in {i_{j + 1}^{'}, \dots, i_{n_{0} - 1}^{'}}} {X_{i} \in {0, ℓ}} \cap_{\begin{matrix} i \in A_{k} \\ k = 1, \dots, m \\ k \neq ℓ \end{matrix}} {X_{i} = k} \cap_{i \in A_{ℓ} \cup {i_{j}^{'}}} {X_{i} = ℓ} \\ \cap_{i \in {i_{0}^{'}, \dots, i_{j - 1}^{'}}} {X_{i} = 0}) < 0 \end{matrix}

by (16), and:

f_{H_{ℓ}, p_{ℓ}, c_{ℓ}} (A_{ℓ}; i_{j}^{'}; i_{n_{0}}^{'}) : = \frac{g_{H, p, c} (A_{1}, \dots, A_{ℓ}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'})}{g_{H, p, c} (A_{1}, \dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'})} > 1 .

(19)

The last inequality is due to the fact that:

\begin{matrix} \frac{g_{H, p, c} (A_{1}, \dots, A_{ℓ}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'})}{g_{H, p, c} (A_{1}, \dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots, A_{m}; A_{0}^{'}; i_{j}^{'})} \\ = \frac{\sum_{j = 0}^{n_{0} - 1} \sum_{\begin{matrix} C \subseteq A_{0}^{'} / {i_{j}^{'}} \\ C = \emptyset or max C < i_{j}^{'} \end{matrix}} {(- 1)}^{| C | + 1} L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}} \cup C)}{\sum_{j = 0}^{n_{0} - 1} \sum_{\begin{matrix} C \subseteq A_{0}^{'} / {i_{j}^{'}} \\ C = \emptyset or max C < i_{j}^{'} \end{matrix}} {(- 1)}^{| C | + 1} L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}} \cup C)}, \end{matrix}

and for any set C such that

max C < i_{j}^{'}

or

C = \emptyset,

\frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}} \cup C)}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}} \cup C)} = \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}})}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}})} > 1

by (11). More specifically,

f_{H_{ℓ}, p_{ℓ}, c_{ℓ}} (A_{ℓ}; i_{j}^{'}; i_{n_{0}}^{'}) = \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}} \cup C)}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}} \cup C)} = \frac{L_{ℓ}^{*} (i_{ℓ, j, 1}, i_{ℓ, j, 2})}{L_{ℓ}^{*} (i_{ℓ, j, 1}, i_{ℓ, j, 2}, i_{n_{0}}^{'})}

(20)

where

i_{ℓ, j, 1}, i_{ℓ, j, 2}

are the two closest elements to

i_{n_{0}}^{'}

among

A_{ℓ} \cup {i_{j}^{'}}

. That is,

i_{ℓ, j, 1}, i_{ℓ, j, 2} \in A_{ℓ} \cup {i_{j}^{'}}

are two closest elements to

i_{n_{0}}^{'}

such that if

min A_{ℓ} \cup {i_{j}^{'}} < i_{n_{0}}^{'} < max A_{ℓ},

then

i_{ℓ, j, 1} < i_{n_{0}}^{'} < i_{ℓ, j, 2},

and if

i_{n_{0}}^{'} > max A_{ℓ} \cup {i_{j}^{'}},

then

i_{ℓ, j, 1} < i_{ℓ, j, 2} < i_{n_{0}}^{'} .

\begin{matrix} \frac{L_{ℓ}^{*} ({i_{ℓ, j, 1}, i_{ℓ, j, 2}})}{L_{ℓ}^{*} ({i_{ℓ, j, 1}, i_{ℓ, j, 2}, i_{n^{'}}})} \\ = {\begin{matrix} \frac{p_{ℓ} + c_{ℓ} {| i_{ℓ, j, 1} - i_{ℓ, j, 2} |}^{2 H_{ℓ} - 2}}{(p_{ℓ} + c_{ℓ} | i_{ℓ, j, 1} - i_{n^{'}} |^{2 H_{ℓ} - 2}) (p_{ℓ} + c_{ℓ} | i_{n^{'}} - i_{ℓ, j, 2} |^{2 H_{ℓ} - 2})} & if min A_{ℓ} \cup {i_{j}^{'}} < i_{n^{'}} < max A_{ℓ}, \\ \frac{1}{p_{ℓ} + c_{ℓ} {| i_{ℓ, j, 2} - i_{n^{'}} |}^{2 H_{ℓ} - 2}} & if i_{n^{'}} > max A_{ℓ} \cup {i_{j}^{'}}, \end{matrix} \end{matrix}

which is non-increasing as j increases since

i_{j}^{'} < i_{n_{0}}^{'} .

Therefore,

f_{H_{ℓ}, p_{ℓ}, c_{ℓ}} (A_{ℓ}; i_{j}^{'}; i_{n_{0}}^{'})

is non- increasing as j increases. Also, for fixed

j, C

such that

max C < i_{j}^{'}

or

C = \emptyset

,

\begin{matrix} \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}, i_{j}^{'}} \cup C)}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{j}^{'}} \cup C)} \geq \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}})}{L_{ℓ}^{*} (A_{ℓ})} \end{matrix}

(21)

by the fact that

\frac{L_{ℓ}^{*} (A \cup {i})}{L_{ℓ}^{*} (A)}

is non-decreasing as the set A increases.

Combining the above facts with (17) and (18), and by i of Lemma 2,

\begin{matrix} \frac{L_{ℓ}^{*} (A_{ℓ})}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}})} \leq \frac{D^{*} (\dots, A_{ℓ}, \dots; A_{0}^{'})}{D^{*} (\dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots; A_{0}^{'})} . \end{matrix}

Therefore,

\begin{matrix} \frac{D^{*} (A_{1}, \dots, A_{m}; A_{0}^{'})}{\sum_{ℓ = 1}^{m} D^{*} (\dots, A_{ℓ} \cup {i_{n_{0}}^{'}}, \dots; A_{0}^{'})} \geq \frac{1}{\sum_{ℓ = 1}^{m} \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{n_{0}}^{'}})}{L_{ℓ}^{*} (A_{ℓ})}} > 1, \end{matrix}

which proves (13) and,

D^{*} (A_{1}, \dots, A_{m}; A_{0}) > 0 .

□

Proof of Theorem 3.

a. Let

A_{0} = {i_{0}, i_{1}, \dots, i_{n}} .

Note that:

\begin{matrix} P (X_{i_{1}^{'}} = ℓ | \cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = k}) = \frac{D^{*} (\dots, A_{ℓ} \cup {i_{1}^{'}}, \dots; A_{0})}{D^{*} (A_{1}, \dots, A_{m}; A_{0})} = \\ \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{1}^{'}}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}) + \sum_{j = 0}^{n} g_{H, p, c} (\dots, A_{ℓ} \cup {i_{1}^{'}}, \dots; A_{0}; i_{j})}{L_{ℓ}^{*} (A_{ℓ}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}) + \sum_{j = 0}^{n} g_{H, p, c} (A_{1}, \dots, A_{m}; A_{0}; i_{j})} . \end{matrix}

Since,

\frac{g_{H, p, c} (A_{1}, \dots, A_{ℓ} \cup {i_{1}^{'}}, \dots, A_{m}; A_{0}; i_{j})}{g_{H, p, c} (A_{1}, \dots, A_{m}; A_{0}; i_{j})}

is non-decreasing as j increases, and by (19) and (20):

\frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{1}^{'}})}{L_{ℓ}^{*} (A_{ℓ})} \leq \frac{g_{H, p, c} (A_{1}, \dots, A_{ℓ} \cup {i_{1}^{'}}, \dots, A_{m}; A_{0}; i_{j})}{g_{H, p, c} (A_{1}, \dots, A_{ℓ}, \dots, A_{m}; A_{0}; i_{j})},

the result follows by ii of Lemma 2.

b.

\begin{matrix} \frac{P (X_{i_{2}^{'}} = ℓ | \cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = k})}{P (X_{i_{3}^{'}} = ℓ | \cap_{k = 0, \dots, m} \cap_{i \in A_{k}} {X_{i} = k})} = \frac{D^{*} (\dots, A_{ℓ} \cup {i_{2}^{'}}, \dots; A_{0})}{D^{*} (\dots, A_{ℓ} \cup {i_{3}^{'}}, \dots; A_{0})} = \\ \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{2}^{'}}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}) + \sum_{j = 0}^{n} g_{H, p, c} (\dots, A_{ℓ} \cup {i_{2}^{'}}, \dots,; A_{0}; i_{j})}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{3}^{'}}) D_{(- ℓ)}^{*} (\dots, A_{ℓ - 1}, A_{ℓ + 1}, \dots; A_{0}) + \sum_{j = 0}^{n} g_{H, p, c} (\dots, A_{ℓ} \cup {i_{3}^{'}}, \dots; A_{0}; i_{j})} . \end{matrix}

For fixed

j, C

such that

max C < i_{j}

,

\begin{matrix} \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{2}^{'}, i_{j}} \cup C)}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{3}^{'}, i_{j}} \cup C)} \leq \frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{2}^{'}})}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{3}^{'}})}, \end{matrix}

and,

\frac{L_{ℓ}^{*} (A_{ℓ} \cup {i_{2}^{'}, i_{j}} \cup C)}{L_{ℓ}^{*} (A_{ℓ} \cup {i_{3}^{'}, i_{j}} \cup C)}

is non-increasing as j increases. Therefore, the result follows by i of Lemma 2. □

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Hurst, H. Long-term storage capacity of reservoirs. Civ. Eng. Trans. 1951, 116, 770–808. [Google Scholar] [CrossRef]
Hurst, H. Methods of using long-term storage in reservoirs. Proc. Inst. Civ. Eng. 1956, 5, 519–543. [Google Scholar] [CrossRef]
Benson, D.A.; Meerschaert, M.M.; Baeumer, B.; Scheffler, H.-P. Aquifer operator-scaling and the effect on solute mixing and dispersion. Water Resour. Res. 2006, 42, W01415. [Google Scholar] [CrossRef] [Green Version]
Delgado, R. A reflected fBm limit for fluid models with ON/OFF sources under heavy traffic. Stoch. Processes Their Appl. 2007, 117, 188–201. [Google Scholar] [CrossRef] [Green Version]
Majewski, K. Fractional Brownian heavy traffic approximations of multiclass feedforward queueing networks. Queueing Syst. 2005, 50, 199–230. [Google Scholar] [CrossRef]
Samorodnitsky, G. Stochastic Processes and Long Range Dependence; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Daley, J.; Vesilo, R. Long range dependence of point processes, with queueing examples. Stoch. Processes Their Appl. 1997, 70, 265–282. [Google Scholar] [CrossRef] [Green Version]
Daley, J.; Vesilo, R. Long range dependence of inputs and outputs of classical queues. Fields Inst. Commun. 2000, 28, 179–186. [Google Scholar]
Mandelbrot, B.; Van Ness, J. Fractional Brownian motions, fractional noises and applications. SIAM Rev. 1968, 10, 422–437. [Google Scholar] [CrossRef]
Hosking, J.R.M. Fractional differencing. Biometrika 1981, 68, 165–176. [Google Scholar] [CrossRef]
Hosking, J.R.M. Modeling persistence in hydrological time series using fractional differencing. Water Resour. Res. 1984, 20, 1898–1908. [Google Scholar] [CrossRef]
Carpio, K.J.E. Long-Range Dependence of Markov Chains. Ph.D. Thesis, The Australian National University, Canberra, Australia, 2006. [Google Scholar]
Dean, C.B.; Lundy, E.R. Overdispersion. Wiley StatsRef: Statistics Reference Online. 2014. Available online: https://onlinelibrary.wiley.com/doi/10.1002/9781118445112.stat06788.pub2 (accessed on 9 October 2022).
Poortema, K. On modelling overdispersion of counts. Stat. Neerl. 1999, 53, 5–20. [Google Scholar] [CrossRef]
Afroz, F. Estimating Overdispersion in Sparse Multinomial Data. Ph.D. Thesis, The University of Otago, Dunedin, New Zealand, 2018. [Google Scholar]
Afroz, F.; Shabuz, Z.R. Comparison Between Two Multinomial Overdispersion Models Through Simulation. Dhaka Univ. J. Sci. 2020, 68, 45–48. [Google Scholar] [CrossRef]
Landsman, V.; Landsman, D.; Bang, H. Overdispersion models for correlated multinomial data: Applications to blinding assessment. Stat. Med. 2019, 38, 4963–4976. [Google Scholar] [CrossRef] [PubMed]
Mosimann, J.E. On the Compound Multinomial Distribution, the Multivariate β- Distribution, and Correlations Among Proportions. Biometrika 1962, 49, 65–82. [Google Scholar]
Lee, J. Generalized Bernoulli process and fractional binomial distribution. Depend. Model. 2021, 9, 1–12. [Google Scholar] [CrossRef]
Feller, W. An Introduction to Probability Theory and Its Applications, 3rd ed.; John Wiley: New York, NY, USA, 1968; Volume 1. [Google Scholar]
Lee, J. Generalized Bernoulli process and fractional binomial distribution II. arXiv 2022, arXiv:2209.01516. [Google Scholar]
Carpio, K.J.E.; Daley, D.J. Long-Range Dependence of Markov Chains in Discrete Time on Countable State Space. J. Appl. Probab. 2007, 44, 1047–1055. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, J. A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution. Fractal Fract. 2022, 6, 596. https://doi.org/10.3390/fractalfract6100596

AMA Style

Lee J. A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution. Fractal and Fractional. 2022; 6(10):596. https://doi.org/10.3390/fractalfract6100596

Chicago/Turabian Style

Lee, Jeonghwa. 2022. "A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution" Fractal and Fractional 6, no. 10: 596. https://doi.org/10.3390/fractalfract6100596

Article Menu

A Finite-State Stationary Process with Long-Range Dependence and Fractional Multinomial Distribution

Abstract

1. Introduction

2. Finite-State Stationary Process with Long-Range Dependence

3. Tail Behavior of Inter-Arrival Time and Other Properties

4. Fractional Multinomial Distribution

5. Conclusions

6. Proofs

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI