Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks

Hall, Anthony D.; Silvennoinen, Annastiina; Teräsvirta, Timo

doi:10.3390/econometrics11010005

Open AccessArticle

Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks

by

Anthony D. Hall

¹,

Annastiina Silvennoinen

¹

and

Timo Teräsvirta

^2,3,*

¹

National Centre for Econometric Research (NCER), Queensland University of Technology, Brisbane, QLD 4000, Australia

²

Aarhus BSS, Aarhus University, DK-8210 Aarhus V, Denmark

³

Center for Applied Statistics and Economics (C.A.S.E.), Humboldt-Universität zu Berlin, DE-10178 Berlin, Germany

^*

Author to whom correspondence should be addressed.

Econometrics 2023, 11(1), 5; https://doi.org/10.3390/econometrics11010005

Submission received: 15 November 2022 / Revised: 26 January 2023 / Accepted: 27 January 2023 / Published: 6 February 2023

Abstract

:

This paper proposes a methodology for building Multivariate Time-Varying STCC–GARCH models. The novel contributions in this area are the specification tests related to the correlation component, the extension of the general model to allow for additional correlation regimes, and a detailed exposition of the systematic, improved modelling cycle required for such nonlinear models. There is an R-package that includes the steps in the modelling cycle. Simulations demonstrate the robustness of the recommended model building approach. The modelling cycle is illustrated using daily return series for Australia’s four largest banks.

Keywords:

unconditional correlation; modelling volatility; modelling correlations; multivariate autoregressive conditional heteroskedasticity

JEL Classification:

C32; C52; C58

1. Introduction

Recently, Silvennoinen and Teräsvirta (2021) introduced a new multivariate GARCH model called the Multivariate Time-Varying Smooth Transition GARCH model (MTV model). This is a model that explicitly accounts for nonstationarities that are common in daily return series. The authors considered maximum likelihood (ML) estimation of the parameters of the model and, under suitable conditions, proved the consistency and asymptotic normality of the resulting ML estimators.

Before actually estimating an MTV model, however, the model builder has to make a number of data-driven decisions needed for specifying the parametric structure of the model. Further, the estimated structure has to be evaluated by statistical tests to reveal its potential weaknesses. Silvennoinen and Teräsvirta (2021) did not, however, discuss any model building issues, leaving them for further research. The present work is intended to fill this void.

As with many other multivariate GARCH models, the MTV model is based on the decomposition of the conditional covariance matrix Bollerslev (1990), in which the conditional covariance is decomposed to conditional variances and a conditional correlation matrix. In the MTV model, however, it is assumed that the conditional variances can be nonstationary, while a nested special case, (weak) stationarity, is a testable hypothesis.

Likewise, the correlations in this model are time-varying such that its time-varying correlation matrix nests a constant correlation matrix. Due to the parametric structure of this nonlinear correlation matrix, the constancy of correlations has to be tested (and rejected) before fitting a model with time-varying correlations.

Since, as will be explained later, these testing situations, both in the variances and the correlation matrix, are nonstandard, specification of the MTV model is an important issue in building MTV models. Furthermore, the estimated MTV model has to be evaluated before using it, from which it follows that techniques for this part of the model building process have to be examined as well.

In order to illustrate the MTV model building, we consider the Australian banking sector. This is an oligopoly dominated by four banks, commonly called the ‘Big Four’. In early 2020, they represented approximately 19% of the market value of the ASX200 share index and held about 80% of the home loan market in Australia; see Figure 1. Consequently, the banking sector is a major component for many Australian superannuation and other investment funds. As to the Big Four daily returns, their volatility cannot automatically be assumed to be stationary.

Furthermore, the correlations, even when time-varying, cannot a priori be assumed to fluctuate around a constant level, which is one of the assumptions in many popular multivariate GARCH models. Applying the flexible MTV model to these return series is, therefore, an interesting exercise. An in-depth analysis of the Australian banking sector is beyond the scope of this paper; however, modelling the daily returns of the Big Four over a period of almost 30 years serves as a useful example of how our MTV model building techniques work and are applied in practice.

The modelling process is data driven, requiring user input and consists of several steps. For this reason, we developed an R-package to help users build MTV models. The version of R used is 4.1.0. The package, called mtvgarch, includes, among other things, the estimation routines as well as the necessary specification and evaluation tests. The code is maintained in a private GitHub repository and can be obtained upon request.

The plan of the paper is as follows. The MTV model is introduced in Section 2, followed by details of the stages and procedures related to the model building in Section 3. Model specification is considered in Section 4, estimation in Section 5 and evaluation in Section 6. Section 7 is devoted to the illustration of the complete modelling cycle on the Big Four volatilities and correlations. Our conclusions can be found in Section 8. There are also appendices containing material, such as relevant test statistics, simulation results, details of the estimation algorithm, and estimated equations.

2. The MTV Model

The MTV model used in this paper belongs to the family of multivariate GARCH models introduced by Bollerslev (1990). In the original model, the conditional correlations were constant, hence, the name Constant Conditional Correlation (CCC-GARCH) model. This assumption that made the resulting model rather parsimonious was later found to be too restrictive in applications, and time-varying correlations were simultaneously introduced by Engle (2002) (dynamic conditional correlations, DCC) and Tse and Tsui (2002) (varying correlations, VC). In these models, conditional variance components are typically assumed to be stationary, and correlations are assumed, at least implicitly, to fluctuate around a constant level.

In order to consider the MTV model as defined in Silvennoinen and Teräsvirta (2021), we introduce certain notation. The observable stochastic

N \times 1

vector

ε_{t}

is decomposed in a customary fashion as

ε_{t} = H_{t}^{1 / 2} z_{t} = S_{t} D_{t} P_{t}^{1 / 2} ζ_{t},

(1)

where

H_{t} = S_{t} D_{t} P_{t} D_{t} S_{t}

is an

N \times N

covariance matrix, and

ζ_{t} \sim

iid

(0, I_{N})

. We also define

z_{t} = P_{t}^{1 / 2} ζ_{t}

, a vector of independent random variables with

E z_{t} = 0

and a positive definite deterministically varying covariance matrix cov(

z_{t}) = P_{t}

. The structure of

P_{t}

will be defined later. The deterministic matrix

S_{t} = diag (g_{1 t}^{1 / 2}, \dots, g_{N t}^{1 / 2})

has positive diagonal elements for all t, and

D_{t} = diag (h_{1 t}^{1 / 2}, \dots, h_{N t}^{1 / 2})

contains the conditional standard deviations of the elements of

S_{t}^{- 1} ε_{t} = {(ε_{1 t} / g_{1 t}^{1 / 2}, \dots, ε_{N t} / g_{N t}^{1 / 2})}^{'}

. As in Silvennoinen and Teräsvirta (2021) and earlier univariate papers, beginning with Amado and Teräsvirta (2008), and in the multivariate time-varying GARCH article by Amado and Teräsvirta (2014), the diagonal elements of

S_{t}^{2}

are defined as follows:

g_{i t} = g_{i} (t / T) = δ_{i 0} + \sum_{j = 1}^{r_{i}} δ_{i j} G_{i j} (t / T, γ_{i j}, c_{i j}),

(2)

i = 1, \dots, N

, where

δ_{i 0} > 0

is a known constant,

δ_{i j} \neq 0

,

j = 1, \dots, r_{i}

, and the (generalised) logistic function

G_{i j} (t / T, γ_{i j}, c_{i j}) = {(1 + exp {- γ_{i j} \prod_{k = 1}^{K_{i j}} (t / T - c_{i j k})})}^{- 1},

(3)

where

γ_{i j} > 0

and

c_{i j} = {(c_{i j 1}, \dots, c_{i j K_{i j}})}^{'}

such that

c_{i j 1} \leq \dots \leq c_{i j K_{i j}}

. Both

γ_{i j} > 0

,

c_{i j 1} \leq \dots \leq c_{i j K_{i j}}

, and

δ_{i j} \neq 0

,

j = 1, \dots, r_{i}

are identification restrictions. Assuming

δ_{i 0}

in (2) is known is another one. Furthermore, to prevent exchangeability of the components in (2), restrictions are needed on

c_{i j}

. As an example, if

K_{i j} = 1

for

j = 1, \dots, r_{i}

, one can assume (for instance) that

c_{i 11} < \dots < c_{i r_{1} 1}

.

As discussed in earlier papers, the idea of

g_{i t}

is to normalise or rescale the observations. Left-multiplying (1) by

S_{t}^{- 1}

yields

ϕ_{t} = S_{t}^{- 1} ε_{t} = D_{t} z_{t},

where each element of

ϕ_{t}

is assumed to have a standard weakly stationary GARCH representation. In our work, the conditional variances have a GARCH or GJR-GARCH(1,1) structure; see Glosten et al. (1993) for the latter:

h_{i t} = α_{i 0} + α_{i 1} ϕ_{i, t - 1}^{2} + κ_{i 1} I (ϕ_{i, t - 1} < 0) ϕ_{i, t - 1}^{2} + β_{i 1} h_{i, t - 1},

(4)

where

I (A)

is an indicator function:

I (A) = 1

when A occurs, zero otherwise. A higher-order structure is possible, although there do not seem to exist applications of the GJR-GARCH model of order greater than one.

The conditional covariance matrix

E {ϕ_{t} ϕ_{t}^{'} | F_{t - 1}} = D_{t} P_{t} D_{t}

. In order to describe the correlation structure, we employ the Double Smooth Transition Conditional Correlation (DSTCC) model by Silvennoinen and Teräsvirta (2009). In that model, assuming that the transition variable is

t / T

throughout, the time-varying correlation matrix

P_{t}

is defined as

\begin{matrix} P_{t} = & (1 - G_{2} (t / T, γ_{2}, c_{2})) {(1 - G_{1} (t / T, γ_{1}, c_{1})) P_{(11)} + G_{1} (t / T, γ_{1}, c_{1}) P_{(21)}} \\ + G_{2} (t / T, γ_{2}, c_{2}) {(1 - G_{1} (t / T, γ_{1}, c_{1})) P_{(12)} + G_{1} (t / T, γ_{1}, c_{1}) P_{(22)}}, \end{matrix}

(5)

where

P_{(i j)},

i, j = 1, 2

, are four positive definite correlation matrices not equal to each other, and

G_{i} (t / T, γ_{i}, c_{i}) = (1 + exp {- γ_{i} \prod_{k = 1}^{K_{i}} (t / T - c_{i k})})^{- 1}, γ_{i} > 0

(6)

where

c_{i} = (c_{i 1}, \dots, c_{i K_{i}}),

c_{i 1} < \dots < c_{i K_{i}},

i = 1, 2

. This variant of the DSTCC model is called the Time-Varying Correlation (TVC) model to emphasise its deterministic rather than stochastic nature—hence, removing the term ‘Conditional’ from its name. For the Big Four application, we simplify the definition (5) slightly by assuming

P_{(12)} = P_{(22)},

and therefore (5) becomes

\begin{matrix} P_{t} = & (1 - G_{2} (t / T, γ_{2}, c_{2})) {(1 - G_{1} (t / T, γ_{1}, c_{1})) P_{(1)} + G_{1} (t / T, γ_{1}, c_{1}) P_{(2)}} \\ + G_{2} (t / T, γ_{2}, c_{2}) P_{(3)}, \end{matrix}

(7)

where re-indexing the matrices highlights the interpretation that there are two transitions over time. One is from

P_{(1)}

to

P_{(2)}

, and the other one is from a convex combination of these two to

P_{(3)}

. Since

P_{(1)}

,

P_{(2)}

, and

P_{(3)}

are positive definite,

P_{t}

is positive definite as a convex combination of the three matrices. This simplified version of the TVC model is especially useful when modelling correlations that shift from one state to the next as a function of time. To that end, the obvious extension to n such transitions is best expressed as a recursion

\begin{matrix} P_{t}^{(0)} & = P_{(1)} \\ P_{t}^{(n)} & = (1 - G_{n} (t / T, γ_{n}, c_{n})) P_{t}^{(n - 1)} + G_{n} (t / T, γ_{n}, c_{n}) P_{(n + 1)} . \end{matrix}

(8)

When

G_{2} (t / T, γ_{2}, c_{2}) \equiv 1

and

N = 2

, (5) and (7) collapse into the smooth transition correlation GARCH model by Berben and Jansen (2005) or, if the transition variable in

G_{1}

is stochastic and

N \geq 2

, into the smooth transition conditional correlation GARCH model of Silvennoinen and Teräsvirta (2005, 2015). An MTV-Conditional Correlation GARCH model with GARCH equations similar to the ones here but differently defined stochastic

P_{t}

was discussed in Amado and Teräsvirta (2014). It may be noted that Feng (2006) introduced another multivariate Conditional Correlation type GARCH model with deterministically varying correlations. In this model, the variation is described nonparametrically, and the model can be viewed as a generalisation of the univariate model in Feng (2004).

3. The Three Stages of Model Building

The MTV model is rather general and nests many models. To take one example, fitting an MTV model when a nested CCC-GARCH model generates the data leads to inconsistent parameter estimates. For this reason, building adequate MTV models requires care, and a systematic approach is necessary. Selecting a candidate from this family of models is a data-driven process, and statistical inference has to be used to obtain an acceptable model such that it passes the available misspecification tests.

In this work, we follow the classical approach to model building advocated by Box and Jenkins (1970) and later applied to nonlinear models of the conditional mean; see, for example, Teräsvirta et al. (2010, Ch. 16). It has also been applied to building single-equation MTV-GARCH models; see Amado and Teräsvirta (2017) and Amado et al. (2017). The idea is to first specify the model (select a member from the family of MTV models) and, once this has been done, to estimate its parameters. At the evaluation stage, the estimated model is subjected to a battery of misspecification tests. These three stages, specification, estimation, and evaluation, will be considered in the next three sections. The emphasis will be on specification and evaluation as maximum likelihood estimation of the parameters of the MTV model was already considered in Silvennoinen and Teräsvirta (2021).

4. Specification of the MTV Model

4.1. Specification of the Univariate Variance Equations

Specification of the MTV model is begun by specifying the univariate volatility equations. This was first discussed in Amado and Teräsvirta (2017). The idea is to begin with a GARCH(1,1) model by Bollerslev (1986) or the GJR-GARCH model by Glosten et al. (1993) and to test the hypothesis that the multiplicative deterministic component is constant. The single-equation MTV-GARCH model has the following form:

ε_{i t} = z_{i t} h_{i t}^{1 / 2} g_{i t}^{1 / 2},

(9)

where

z_{i t} \sim

iid

(0, 1)

, the conditional variance

h_{i t}

is defined as in (4) with

ϕ_{i t} = ε_{i t} / g_{i t}^{1 / 2}

, and the deterministic positive-valued function

g_{i t} = g_{i} (t / T)

is defined as in (2) and (3).

Positivity of (2) imposes the following restrictions on

δ_{i j}

,

j = 1, \dots, r_{i}

:

δ_{i 0} + \sum_{j = 1}^{r_{i}} δ_{i j} G_{i j} (r, γ_{i j}, c_{i j}) > 0

for all

r \in

[0,1].

Typically in applications,

K_{i j} = 1, 2

in (3). There are two specification issues, determining

r_{i}

and choosing

K_{i j}

,

j = 1, \dots, r_{i}

. It is possible that

g_{i} (t / T) = δ_{0} > 0

—that is,

g_{i} (t / T)

is a positive constant. In this case, the MTV-GARCH model collapses into a standard GARCH or GJR-GARCH equation.

Amado and Teräsvirta (2017) solved the problem of choosing

r_{i}

by first estimating the GARCH model and testing the hypothesis of a constant

g_{i} (t / T)

against the alternative

r_{i} = 1

in (2) thereafter using a Lagrange multiplier type test. The test can be viewed as a misspecification test of the estimated GARCH model. If the null hypothesis is rejected, an MTV-GARCH model with a single transition is estimated, and the hypothesis

r_{i} = 1

is tested against

r_{i} = 2

. Sequential testing continues until the first non-rejection of the null hypothesis.

The number of transitions is determined in this order because of an identification problem: the model with

r_{i} + 1

transitions is not identified if the true number of transitions is

r_{i}

. The shape of the logistic function, controlled by the parameter

K_{i j}

, can be determined using the sequence of tests familiar from the specification of smooth transition autoregressive (STAR) models; see Teräsvirta (1994) or Teräsvirta et al. (2010, Ch. 16). Details can be found in Amado and Teräsvirta (2017).

More recently, Silvennoinen and Teräsvirta (2016) considered testing the constancy of

g_{i} (t / T)

before estimating the GARCH model—that is, assuming

h_{i t} = 1

in (9). The details are laid out in Appendix A.1. This implies that the size of the test is distorted because conditional heteroskedasticity is ignored, so it has to be adjusted by simulation. It turns out that, by doing this, the power of the size-adjusted test considerably improves compared to the case where the test is a standard misspecification test. Reasons for this improvement are discussed in Silvennoinen and Teräsvirta (2016).

A major difficulty with this approach is that, while in simulations, the parameters of the conditional variance component

h_{i t}

under the null hypothesis are known—in practice, this is not the case. The underlying ‘null’ GARCH process has to be generated artificially. In so doing, special attention is to be placed on the persistence of the (GJR-)GARCH process, measured by

α_{i 1} + κ_{i 1} / 2 + β_{i 1}

in (4) when

g_{i t} \equiv 1

. In fact, the asymmetry parameter has no practical importance for the purpose of calibrating the test statistic distribution, and it is therefore sufficient to restrict attention to the standard GARCH process. Other features, such as implied kurtosis or relative sizes of

α

and

β

corresponding to a particular level of persistence only have a negligible effect on the performance of the test.

A practical problem is that it is not possible to estimate this measure of persistence when the null hypothesis does not hold—that is, when

g_{i t}

is not constant over time. How this difficulty is handled has an effect on the power of the test. We study two approaches that are discussed more in detail in Appendix B.1. The first one consists of visually identifying a period of time where there appears to be no change in the overall level of baseline volatility. A standard GARCH(1,1) is estimated over this subperiod. The second approach is to use rolling window variance targeting. This means that the intercept in the GARCH equation is time-varying, and its value at each point in time is calculated such that it matches the unconditional variance obtained from a window around that point in time.

Simulations discussed in Appendix B.1 experiment with the choice of window size. Both of these methods provide GARCH parameter and persistence estimates that are used for calibrating the null distribution of the test statistic and calculating p-values.

4.2. Specification of Time-Varying Correlations

After the MTV-GARCH equations have been specified and estimated assuming the errors are uncorrelated, the next step is to specify the time-varying correlation structure. This is done by sequential testing. First, the constancy of correlation tested against the model with a single transition, i.e.,

G_{2} (t / T, γ_{2}, c_{2}) \equiv 1

in (5). The null hypothesis is that the model is a MTV-Constant Correlation GARCH model as in Bollerslev (1990), except that the GARCH equations are MTV-GARCH equations. If this model is rejected, the one-transition model is estimated and tested against (5) or (7). If this specification is also rejected, the alternative with two transitions is estimated. This is repeated until no further evidence for time-variation in the correlations is detected.

As discussed in Silvennoinen and Teräsvirta (2005, 2015), the MTV model with one transition is only identified under the alternative, which invalidates the standard asymptotic inference. The identification problem can be circumvented by approximating the transition function (6) by its Taylor expansion around the null hypothesis, H₀:

γ_{1} = 0

. The form of the expansion depends on the order of the exponent in (6).

The test can be constructed along the lines presented in the appendix of Silvennoinen and Teräsvirta (2005).1 See also Silvennoinen and Teräsvirta (2021). To derive the test statistic, consider the first-order Taylor expansion of (6) around

γ_{1} = 0

assuming

K_{1} = 2

. It has the following form:

\begin{matrix} G_{1} (t / T, γ_{1}, c_{1}) & = (1 + exp {- γ_{1} \prod_{k = 1}^{K_{1}} (t / T - c_{1 k})})^{- 1} \\ = \frac{1}{2} + \frac{1}{4} (t / T - c_{11}) (t / T - c_{12}) γ_{1} + R_{2} (t / T; γ_{1}), \end{matrix}

(10)

where

R_{2} (t / T; γ_{1})

is the remainder. Using (10), (5) becomes

\begin{matrix} P_{t} = & (P_{(1)} - P_{(2)}) (\frac{1}{2} + \frac{γ_{1} c_{11} c_{12}}{4}) + P_{(2)} - (t / T) (P_{(1)} - P_{(2)}) \frac{γ_{1} (c_{11} + c_{12})}{4} \\ + {(t / T)}^{2} (P_{(1)} - P_{(2)}) \frac{γ_{1}}{4} + (P_{(1)} - P_{(2)}) R_{2} (t / T; γ_{1}) \\ = & P_{(A 0)} + (t / T) P_{(A 1)} + {(t / T)}^{2} P_{(A 2)} + (P_{(1)} - P_{(2)}) R_{2} (t / T; γ_{1}), \end{matrix}

where

P_{(A 0)} = (P_{(1)} - P_{(2)}) (1 / 2 + γ_{1} c_{11} c_{22} / 4) + P_{(2)}

,

P_{(A 1)} = - (P_{(1)} - P_{(2)}) γ_{1} (c_{11} + c_{12}) / 4

,

P_{(A 2)} = (P_{(1)} - P_{(2)}) γ_{1} / 4

, and

P_{(1)} \neq P_{(2)}

. The main diagonals of

P_{(A 1)}

and

P_{(A 2)}

consist of zeroes. Setting

ρ_{A} = {(ρ_{A 0}^{'}, ρ_{A 1}^{'}, ρ_{A 2}^{'})}^{'}

, where

ρ_{A i} = vecl (P_{(A i)})

,

i = 0, 1, 2

, the new null hypothesis is H₀:

ρ_{A 1} = ρ_{A 2} = 0_{N (N - 1) / 2}

.2

Note that a simpler version of the test assumes

K_{i} = 1

and yields a similar approximation although without the term

{(t / T)}^{2} P_{(A 2)}

. The new null in this case is H₀:

ρ_{A 1} = 0_{N (N - 1) / 2}

. This version of the test is more powerful than the former in the case that time-variation in the correlations is monotonic. However, and especially with longer time horizons, this may not always be the case, and the square term of the expansion is able to capture at least some nonmonotonic changes.

The details of the ensuing LM-type test statistic for the test of constant correlations is presented in Appendix A.3, and the test for an additional transition in correlations is laid out in Appendix A.4.

5. Estimation of the MTV Model

After specifying the deterministic components of the model, both in GARCH equations and correlations, one can estimate the complete model with conditional heteroskedasticity included. The log-likelihood of the MTV-STCC-GARCH model has the form

\begin{matrix} ln f (ζ_{t} | θ) \propto & - (1 / 2) \sum_{i = 1}^{N} ln g_{i t} (θ_{g i}) - (1 / 2) \sum_{i = 1}^{N} ln h_{i t} (θ_{h i}) - (1 / 2) ln | P_{t} (θ_{P}) | \\ - (1 / 2) ε_{t}^{'} {S_{t} (θ_{g}) D_{t} (θ_{g}, θ_{h}) P_{t} (θ_{P}) D_{t} (θ_{g}, θ_{h}) S_{t} (θ_{g})}^{- 1} ε_{t}, \end{matrix}

(11)

where the full parameter vector

θ = {(θ_{h}^{'}, θ_{g}^{'}, θ_{P}^{'})}^{'}

is partitioned according to the relevant functions: the conditional variance in (4)

θ_{h} = {(θ_{h 1}^{'}, \dots, θ_{h N}^{'})}^{'}

with

θ_{h i} = {(α_{i 0}, α_{i 1}, κ_{i 1}, β_{i 1})}^{'}

,

i = 1, \dots, N

; the deterministic variance component in (2) and (3)

θ_{g} = {(θ_{g 1}^{'}, \dots, θ_{g N}^{'})}^{'}

with

θ_{g i} = {(δ_{i 1}, γ_{i 1}, c_{i 1}, \dots, δ_{i r_{i}}, γ_{i r_{i}}, c_{i r_{i}})}^{'}

,

i = 1, \dots, N

; and correlations

θ_{P} =

{(vecl P_{(1)}, \dots, γ_{1}, \dots, c_{1}, \dots)}^{'}

, where the number of matrices as well as transition functions (6) are determined by the choice of the model, (5), (7), or (8).

We make the following assumptions; see Silvennoinen and Teräsvirta (2021):

AN1.: In (4), $α_{i 0} > 0$ , either $α_{i 1} > 0$ and $α_{i 1} + κ_{i 1} \geq 0$ or $α_{i 1} \geq 0$ and $α_{i 1} + κ_{i 1} > 0$ , $β_{i 1} \geq 0$ , and $α_{i 1} + κ_{i 1} / 2 + β_{i 1} < 1$ for $i = 1, \dots, N$ .
AN2.: The parameter subspaces ${α_{i 0} \times α_{i 1} \times κ_{i 1} \times β_{i 1}}$ , $i = 1, \dots, N$ , are compact, the whole space $Θ_{h}$ is compact, and the true parameter value $θ_{h}^{0}$ is an interior point of $Θ_{h}$ .
AN3.: $ζ_{t} \sim$ iid $N (0, I_{N})$ .

AN1 is the necessary and sufficient weak stationarity condition for the ith first-order GJR-GARCH equation. Assumption AN2 is a standard regularity condition required for proving the asymptotic normality of maximum likelihood estimators of

θ_{h i}

,

i = 1, \dots, N

. AN3 (normality) is a strong condition; however, it is needed here for the proofs to go through; see Silvennoinen and Teräsvirta (2021). These assumptions are sufficient for the maximum likelihood estimators of the GARCH parameters in single-equation GARCH models to be consistent and asymptotically normal.

The parameters are estimated in turn: first estimate

θ_{g i}

to obtain starting-values for the joint estimation of

θ_{g}

and

θ_{P}

. This is done assuming

h_{i t} (θ_{h i}) \equiv 1

,

i = 1, \dots, N

. Amado and Teräsvirta (2013) showed in the single-equation GJR-GARCH case that, under regularity conditions, the maximum likelihood estimator of

θ_{g i}

is consistent and asymptotically normal. Silvennoinen and Teräsvirta (2021) generalised this result to MTV models. That means that joint estimation of

θ_{g}

and

θ_{P}

by maximum likelihood produces consistent estimates of these parameter vectors.

If

{\hat{θ}}_{g}

and

{\hat{θ}}_{P}

are consistent and Assumptions AN1, AN2, and AN3 hold, then, by Theorem 3.3 of Song et al. (2005), the maximum likelihood estimator of

θ_{h}

is consistent and asymptotically normal. After estimating

θ_{h}

, the parameter vectors

θ_{g}

and

θ_{P}

are re-estimated. Iteration continues until convergence. Song et al. (2005) showed that the final maximum likelihood estimator of

θ

is consistent and asymptotically normal. A more detailed description of the maximisation by parts applied to the present situation can be found in Appendix C; see also Silvennoinen and Teräsvirta (2021).

6. Evaluation of the MTV Model

Once the model has been specified and estimated, it has to be evaluated in order to find potential misspecifications. The tests in Section 4.1 were used to guide the choice of the functional form of the deterministic component, and a rejection of the null was seen as evidence of the current model still lacking in its specification. In that sense, the tests in Section 4.1 are seen as both specification and evaluation tests. It is worth reiterating that these specification tests were constructed at the stage when the GARCH part was not yet specified—that is,

h_{t} = 1

in (4). However, when the deterministic part passes these tests, and an MTV-GARCH equation is subsequently estimated, there is room for additional checks in terms of model misspecification, beyond the presently final model specification.

The tests in Amado and Teräsvirta (2017) are available for this purpose. They fall into three categories. In the first one, the deterministic component is additively misspecified. In the context of the current MTV-GARCH model, the relevant case is a test for yet another transition in (2). The second test assesses the GARCH equation for additive misspecification. The concern here is the validity of the maximum lags p or q. The final test is the ‘test of no remaining ARCH’, which is based on the idea of a sufficiently well-specified model managing to clear any autocorrelation from the squared standardised residuals. The test that suits each of these situations (or its robustified version to avoid the assumption of normality) is conveniently performed following a set of steps outlined in Appendix A.2.

It is worth stating that the tests here are applied to

{\hat{P}}_{t}^{- 1 / 2} ε_{t}

, one series at a time. Efforts towards completing the tests in the complete N-variate system simultaneously would open up a vast number of permutations of various misspecification options. To manage the task, the recommendation is to focus on the univariate specifications one at a time, even with the acknowledgment of some potential for deviating from the asymptotically exact results. Simulations in Section Appendix B.2 indicate that applying the tests on the pre-filtered data has very little impact on the distributions of the test statistics. While the standard form of the misspecification tests suffers from minor oversizing, this is mostly corrected when using the robust version of the test.

The test for an additional transition in the correlations in Section 4.2 may also be used as an evaluation test. It is based on the completely specified univariate and correlation components, and therefore its role as a misspecification test of a complete model is justified. The number of degrees of freedom in this test quickly becomes large with increasing N. One way of restricting this growth would be to assume that, under the alternative, only the eigenvalues of the correlation matrix are changing over time. The alternative would be a correlation matrix only if all correlations were identical (see Engle and Kelly (2012)); however, an LM test can nevertheless be built on this assumption.

Write the correlation matrix as

P_{t} = Q_{t} Λ_{t} Q_{t}^{'}

, where

P_{t}

is defined as in (5),

Λ_{t}

is the diagonal matrix of eigenvalues and

Q_{t}

contains the corresponding eigenvectors. Simplify this by assuming

Q_{t} = Q

and approximate

Λ_{t}

by

Ψ_{t} = \sum_{k = 0}^{K} Ψ_{k} {(t / T)}^{k}

. Under the null hypothesis,

K = 0

. The resulting test statistic is derived, and its small-sample properties are studied in Kang et al. (2022).

7. Big Four Results

7.1. Main Features of the Australian Banking Sector 1990–2020

In order to provide some background for our empirical results, we shall now draw attention to a number of interesting features of the Australian banking sector between the years 1990 and 2020. In 1990, the Australian government adopted an intervention policy called ‘six pillars’. It covered the four biggest Australian banks commonly referred to as the ‘Big Four’, the Commonwealth Bank of Australia (CBA), Westpac Banking Corporation (WBC), National Australia Bank (NAB) and Australia and New Zealand Banking Group (ANZ), listed in descending order of market share, as well as two insurers (AMP Limited and National Mutual).

This policy stated that further mergers of these institutions would not be accepted. The basic idea was to ensure a competitive banking market. In 1997, the policy became ‘four pillars’ as the insurers were left outside the arrangement. Since its establishment, it has mostly enjoyed the support of the two main political parties, and the proponents of the policy have argued that it has contributed to the stability and strength of the Australian financial sector. The government also had sympathetic policy settings, which allowed the banks to recapitalize in the 1990s and 2000s.

During this period, there were financial losses at Westpac (a $1.6 billion loss in 1992 and close to insolvency), ANZ (poorly executed international expansions) and subsequently with NAB (purchase of the US mortgage originator and servicer Homeside led to $2.2 billion in losses 2002). Although the Big Four were not allowed to merge with each other, larger financial concentration due to mergers with other financial institutions was seen as acceptable: in 2008, Westpac and CBA acquired St. George and BankWest and then the fifth and the sixth largest Australian banks, respectively. The impact can be seen in the right panel of Figure 1 as the Big Four’s share of the ASX200 Financials Index drastically increased.

The recent history of the pillars policy coincides with a few major incidents and changes. These include not only the dot-com boom in the late 1990s and early 2000s and the global financial crisis (GFC) nearly ten years later but also events that have had more localised impacts, such as a number of of regulatory changes (Basel guidelines), the most recent mining boom that started around 2005 and was interrupted by the GFC, and technology-driven market disruptions (non-bank lenders and payment providers). Since the GFC, the banks have enjoyed substantial government support, including a deposit guarantee and, as already noted, have come to dominate the home mortgage market with an 80% market share.

During the first decade of the millennium, a few of the above-mentioned events have positioned the Big Four in an increasingly competitive environment. The stagnation of the housing market and the removal of barriers to changing mortgage providers may also have contributed towards this trend; however, the most notable event was the announcement in 2003 that Basel II was to be implemented in Australia by end of 2007. The idea behind the updated accord was to level inequalities amongst the internationally active banks and to set expectations regarding capital adequacy requirements. The Australian Prudential Regulation Authority, in charge of overseeing the uptake of the accord, worked extensively with numerous Authorised Deposit-Taking Institutions, the industry and other relevant bodies during 2005 to 2007, aiming at ensuring that the adoption of Basel II included all relevant aspects of the implementation process, its goals, and impacts.

Fear of being subjected to a competitive disadvantage relative to their international counterparts, both within international and domestic operations, coupled with an opportunity for a reduced regulatory capital, incentivised the banks to signal early their preference to conform with the accord.3 As a result, Australia was amongst the first nations to have fully implemented the framework on 1 January 2008.

7.2. Modelling the Error Variances

The daily return series for the Big Four used in this paper extend from 2 January 1992 to 31 January 2020. As suggested in the Introduction, these series may not be adequately described by a weakly stationary GARCH or GJR-GARCH model. From the plots in Figure 2, it is seen that the amplitude of clusters varies for all four banks, in particular during and after the financial crisis beginning in 2008. The crisis was preceded by a rather tranquil period between 2003 and 2008. This variation also shows in the autocorrelation functions of the squared returns in Figure 3. In all four cases, the autocorrelations decay very slowly as a function of the lag length, which suggests nonstationarity.

For this reason, modelling the returns has to be initiated by testing the stationarity hypothesis. As discussed in Section 4.1, the slow moving ‘baseline’ volatility is specified first, followed by the inclusion of the GJR-GARCH component. The test statistic from Appendix A.1 is calibrated using methods described in Appendix B.1. In the first one, based on choosing a period during which the amplitude of clusters seems constant, the selected period extends from November 2003 to October 2007. In the other, where a rolling window is moved over the observation period, the window size chosen by simulations is 400, as outlined in Appendix B.1.

To enhance the performance of the test, the entire sample of over 7000 observations is broken into subsections. Once one transition is found and estimated, the test is applied to both before and after this transition to determine if there is another transition on either side. The process is continued until the null of no transition is not rejected. After the deterministic component has been specified and its parameters tentatively estimated, the GJR-GARCH equations are estimated together with the time-varying component to form a complete TV-GJR-GARCH model. The estimated equations are then checked for signs of misspecification using the tests from Amado and Teräsvirta (2017).

Estimated TV-GJR-GARCH equations results appear in Table 1, and the deterministic components are in Appendix D. For comparison, the GJR-GARCH equations without the deterministic component are also reported in this table. In all four cases, the persistence strongly decreases after rescaling the returns with the TV component.

This is also indirectly obvious from the autocorrelations in Figure 4 that are considerably smaller and decay faster than the ones in Figure 3. The main cause for this decrease lies in the coefficient of the lagged conditional variance whose estimate shrinks in the process. Estimates of the asymmetry parameter

κ_{i 1}

slightly increase, and thus asymmetry becomes more pronounced when nonstationarity is properly modelled. Table 1 also contains the kurtosis estimates for the two GJR-GARCH processes, obtained using definitions in He and Teräsvirta (1999). It is seen that, in all four cases, rescaling lowers the kurtosis to values close to three.

Figure 5 contains the estimated transitions. There are two conspicuous features in these graphs. One is the downward shift around 2004, which, for WBC, is a long and rather smooth decline. This coincides with the local events discussed in the previous Section. The other is that, for all four banks, the deterministic component remains higher after 2010 than it was before 2008.

For WBC, the deterministic component slowly but steadily declines after the crisis, whereas, for the three others, it remains constant. This can also be seen from the estimated equations behind the figures reported in Appendix D.

Effects of the deterministic component

g_{i t}

on the GARCH equations also become obvious by comparing the conditional variances from the GJR-GARCH equations in Figure 6 with the TV-GJR-GARCH ones in Figure 7. Clearly, for all four banks, the nonstationarity around 2008–2010 in the former figure is no longer visible in the latter.

7.3. Modelling the Error Correlations

The stability of correlations over time, as discussed in Section 4.2, is tested using the test statistic (A5) in Appendix A.3. The p-value of the test is very close to zero, and thus the null hypothesis is rejected. An TVC model is then estimated with a single monotonic time transition. The adequacy of this specification is tested with the test for an additional transition.

The resulting p-value is 0.467; therefore, the single time-transition is deemed sufficient. The estimation results of the TVC component of the model are presented in Table 2 (see also Figure 8) indicate that the correlations between the standardised residuals are stable, around 0.49–0.60 depending on the bank, from 1992 until the mid-to-late 2006. At that point, the correlations begin their steady increase to the range of 0.78–0.83, which they reach by early 2008. The final correlations are not only large but also remarkably similar.

In addition to considering the complete Big Four system, we repeated the analysis using bivariate models. The outcome was similar: a single transition was sufficient for all pairs. Furthermore, inspection of the pairwise correlation estimates in Table 3 reveals strong similarity between the four-variate and bivariate estimates. This is illustrated in Figure 9. It should be noted that the shift in the correlation of the ANZ–NAB pair is estimated as a step function.

This happens when the speed of transition increases without a bound due to the likelihood function effectively becoming ‘flat’ with respect to that particular parameter. The solution is to fix the speed to a large value and proceed with the estimation of the remaining parameters.

The steady increase of the correlations among the Big Four over time as well as the maintained high correlation state since the GFC could be linked to a few changes in the Australian financial sector. For instance, the time-varying correlation structure may reflect the four pillars policy, the financial concentration, which was further strengthened by the acquisition of the next largest banks by CBA and WBC in 2008. Further contributing factors that may have made the banks look more similar from the investors’ point of view include the regulatory requirements brought by the implementation of Basel II, the easing of restrictions that directly impacted the home mortgage market that the Big Four now dominated and the stagnation of the housing credit market. Furthermore, since the GFC, all four banks have enjoyed substantial government support.

The fact that serious effort has been made to model the volatilities and correlations separately allows for observations on the timing and magnitude of those features without the cross-contamination occurring if covariances were examined instead. It is often noted that correlations increase during turbulent times. In the case of the Big Four, this is not exactly the case. The calm volatility period (from 2003 until late 2007) overlaps with the period of smoothly increasing correlations (16 months leading to early 2008). Furthermore, it is notable that the GFC has a tremendous impact on the volatilities, whereas the correlations have by then settled to their high levels and exhibit no further change.

8. Conclusions

In this paper, a data-driven modelling cycle for building MTV-GARCH models for asset returns was constructed and illustrated with an empirical example. The paper complements Silvennoinen and Teräsvirta (2021), which presented the asymptotic theory for this model, however, as already mentioned in the Introduction, does not contain any discussion on practical model-building issues. All three phases of the cycle: model specification, estimation, and evaluation, were covered. Specification includes testing the constancy of the GARCH equations against multiplicatively time-varying GARCH. This is a nonstandard testing problem as the MTV-GARCH model is not identified when the null hypothesis holds.

Furthermore, constructing the null model requires new techniques due to the fact that the conditional variance is not observed, and two such alternatives for solving this problem are presented. Simulations reported in an online appendix show that the proposed testing procedure had reasonable small-sample properties. In specifying the correlation structure, the constancy of correlations has to be tested, and a relevant test for this nonstandard testing situation was developed.

The GARCH equations and the correlation structure, both nonlinear, were estimated jointly, and the technical details of this process were presented. Misspecification tests for the estimated model were derived to be used for the evaluation of the estimated model. The application to the four main Australian banks, the Big Four, demonstrated the use of the modelling cycle. The GARCH equations were found to be multiplicatively time-varying, and the correlations also changed over time. The estimation results indicated that the amplitude of the volatility clusters declines to a lower level around 2004 and temporarily rises during the GFC.

The (positive) correlations were found to be nonconstant, and they increased to a fairly high level already before the GFC. There is no single reason for this shift that is also established by estimating pairwise models for the six pairs of banks separately. Generally speaking, it may be argued that, whatever the aforementioned events before 2008 have meant to the ability of the banks to compete with each other, from the investors’ viewpoint, they have become increasingly similar. Given their size, these four banks may represent a systemic risk to the Australian financial sector or the Australian economy in general.

Finally, the paper is accompanied by an R-package entitled mtvgarch, which is maintained in a private GitHub repository and contains all the econometric tools necessary for building MTV-GARCH models.

There are appendices contain additional material to the paper. Appendix A provides details of the TVV-model specification, the MTV-GARCH model evaluation, the test of constant correlations, and finally the test for an additional transition in the correlations. The simulations studies in Appendix B explore aspects of the specification and evaluation of the GARCH equations, and the size and sensitivity of the test of constant correlations. Appendix C presents the details of maximisation by parts. The estimated deterministic components of the Four Banks’ transition equations are presented in Appendix D.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/econometrics11010005/s1.

Author Contributions

Conceptualization, A.D.H., A.S. and T.T.; methodology, T.T. and A.S.; software, A.S.; validation, A.S.; formal analysis, A.S. and T.T.; data curation, A.D.H.; writing A.S. and T.T.; visualization, A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Supplementary Material that includes data for the application, source code for the estimation and simulations, as well as the MTVGARCH package version used in the production of this paper can be found at https://econ.au.dk/research/researchcentres/creates/research/creates-research-papers/supplementary-downloads/rp-2021-13 (accessed on 26 January 2023).

Acknowledgments

This research was supported by the Center for Research in Econometric Analysis of Time Series (CREATES). An earlier version of this paper with the title ‘Four Australian Banks and the Multivariate Time-Varying Smooth Transition Correlation GARCH model’ appeared as CREATES Research Papers No. 2021–2013. Part of the work was conducted when the third author was visiting School of Economics and Finance of Queensland University of Technology, Brisbane, whose kind hospitality is gratefully acknowledged. We would also like to thank Glen Wade for their work with the R-package. Material from this paper was presented at the 26th Annual Symposium of the Society for Nonlinear Dynamics and Econometrics, Tokyo, March 2018; the workshop ‘Frontiers in Econometrics’, Queensland University of Technology, Brisbane, July 2018; the Quantitative Methods in Finance 2018 Conference (UTS), Sydney, December 2018; the 13th International Conference of the ERCIM WG on Computational and Methodological Statistics, London, December 2020; the International Association of Applied Econometrics Conference, Rotterdam, July 2021; and the Workshop on Financial Econometrics, Örebro University, November 2021. Comments from the participants are gratefully acknowledged. The responsibility for any errors and shortcomings in this work remains ours.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Test Statistics

Appendix A.1. Test Statistic for TVV-Model Specification

In order to specify

g_{t}

, we not only test constancy but even specify the number of transitions before estimating the GARCH component of the model. Amado and Teräsvirta (2013) showed that maximum likelihood estimators of the corresponding time-varying variance (TVV) model, assuming that there is no conditional heteroskedasticity, are consistent and asymptotically normal. This forms the base for constructing Lagrange multiplier type tests for testing r against

r + 1

transitions. For notational simplicity, consider testing one transition against two. Omitting the subscript i for simplicity, the TVV model is (9) with

h_{t} = 1

, and

g_{t} = δ_{0} + δ_{1} G_{1} (t / T, γ_{1}, c_{1}) + δ_{2} G_{2} (t / T, γ_{2}, c_{2}), γ_{i} > 0, i = 1, 2 .

The null hypothesis is

γ_{2} = 0

—in which case,

G_{2} (t / T, γ_{2}, c_{2}) \equiv 1 / 2

. To circumvent the identification problem (the model with one transition is only identified when the alternative

γ_{2} > 0

is true), we follow Luukkonen et al. (1988) and approximate the second transition by a third-order Taylor expansion around the null hypothesis. After reparametrisation, this yields

g_{t} = δ_{0}^{* 0} + δ_{1} G_{1} (t / T, γ_{1}, c_{1}) + ψ_{1} t / T + ψ_{2} {(t / T)}^{2} + ψ_{3} {(t / T)}^{3}, γ_{1} > 0 .

(A1)

We may call (9) with (A1) the auxiliary TVV model. The parameters

ψ_{i} = γ_{2} {\tilde{ψ}}_{i}

, where

{\tilde{ψ}}_{i} \neq 0

,

i = 1, 2, 3

. The new null hypothesis in (A1) equals H

_{0}^{'}

:

ψ_{1} = ψ_{2} = ψ_{3} = 0

. The remainder term of the expansion can be ignored because when we construct a Lagrange multiplier test, the model is only estimated under H

_{0}

(or H

_{0}^{'})

, and, under this hypothesis, the order of the Taylor expansion equals zero. The remainder is present only under the alternative, and thus ignoring it when H

_{0}

is valid does not affect the asymptotic size of the test. It does make a positive contribution to the power of the test when H

_{0}

does not hold.

Assume (again, for notational simplicity) that

K_{1} = 1

in (A1), so

c_{1} = c_{1}

(a scalar). The log-likelihood for observation t of the auxiliary TVV model equals

ℓ_{t} = k - (1 / 2) ln g_{t} - (1 / 2) \frac{ε_{t}^{2}}{g_{t}}

and the corresponding element of the score is

\frac{\partial ℓ_{t}}{\partial θ_{1}} = \frac{1}{2} (\frac{ε_{t}^{2}}{g_{t}} - 1) \frac{1}{g_{t}} \frac{\partial g_{t}}{\partial θ_{1}},

(A2)

where

θ_{1} = {(δ_{0}^{* 0}, δ_{1}, γ_{1}, c_{1}, ψ_{1}, ψ_{2}, ψ_{3})}^{'}

. Denoting

G_{1} (t / T) = G_{1} (t / T, γ_{1}, c_{1})

, the partial derivative in (A2) is

\partial g_{t} / \partial θ_{1} = {(g_{1}^{'} (t / T), τ_{t}^{'})}^{'}

where

g_{1} (t / T) = {(1, G_{1} (t / T), G_{1 γ} (t / T), G_{1 c} (t / T) G_{1 γ} (t / T))}^{'}

with

G_{1 γ} (t / T) = G_{1} (t / T) (1 - G_{1} (t / T)) (t / T - c_{1})

,

G_{1 c} (t / T) = - γ_{1} G_{1} (t / T) (1 - G_{1} (t / T))

, and

τ_{t} = (t / T, {(t / T)}^{2}, {(t / T)}^{3})

. Define the true parameter vector under H

_{0}

as

θ_{1}^{0} = (δ_{0}^{* 0}, δ_{1}^{0}, γ_{1}^{0}, c_{1}^{0}, {0, 0, 0)}^{'}

. If

z_{t}

is normally distributed, the corresponding element of the information matrix under H

_{0}

has the form

B_{t} = \frac{1}{4} E {(\frac{ε_{t}^{2}}{g_{t}} - 1)}^{2} [\begin{matrix} B_{11 t} & B_{12 t} \\ B_{21 t} & B_{22 t} \end{matrix}] = \frac{1}{2} [\begin{matrix} B_{11 t} & B_{12 t} \\ B_{21 t} & B_{22 t} \end{matrix}],

where, letting

g_{t}^{0} = g^{0} (t / T) = δ_{0}^{* 0} + δ_{1}^{0} G_{1} (t / T, γ_{1}^{0}, c_{1}^{0})

and denoting

G_{1}^{0} (t / T) = G_{1} (t / T, γ_{1}^{0}, c_{1}^{0})

,

B_{11 t} = \frac{1}{2 {(g^{0} (t / T))}^{2}} g_{1}^{0} (t / T) g_{1}^{0} {(t / T)}^{'}, B_{12 t} = \frac{1}{2 {(g^{0} (t / T))}^{2}} g_{1}^{0} (t / T) τ_{t}^{'}

and

B_{22 t} = \frac{1}{2 {(g^{0} (t / T))}^{2}} τ_{t} τ_{t}^{'} .

Let

g_{1}^{0} (r) = {(1, G_{1}^{0} (r), G_{1 γ}^{0} (r), G_{1 c}^{0} (r))}^{'}

and

r = {(r, r^{2}, r^{3})}^{'}

. We state the following lemma:

Lemma A1.

Under the null hypothesis and assuming

z_{t} \sim

iid

N (0, 1)

, the information matrix

B = lim_{T \to \infty} \frac{1}{2 T} \sum_{t = 1}^{T} [\begin{matrix} B_{11 t} & B_{12 t} \\ B_{21 t} & B_{22 t} \end{matrix}] = \frac{1}{2} [\begin{matrix} B_{11} & B_{12} \\ B_{21} & B_{22} \end{matrix}],

where

B_{11} = \frac{1}{2} \int_{0}^{1} {(g^{0} (r))}^{- 2} g_{1}^{0} (r) g_{1}^{0} {(r)}^{'} d r, B_{12} = \frac{1}{2} \int_{0}^{1} {(g^{0} (r))}^{- 2} g_{1}^{0} (r) r^{'} d r

and

B_{22} = \frac{1}{2} \int_{0}^{1} {(g^{0} (r))}^{- 2} r r^{'} d r .

Proof.

The ‘sample’ information matrix with T observations equals

\frac{1}{4 T} \sum_{t = 1}^{T} B_{t} = \frac{1}{4 T} \sum_{t = 1}^{T} [\begin{matrix} B_{11 t} & B_{12 t} \\ B_{21 t} & B_{22 t} \end{matrix}] .

Consider the

(1, 2)

element of

B_{11 (T)} = (1 / T) \sum_{t = 1}^{T} B_{11 t}

:

{[B_{11 (T)}]}_{12} = (1 / T) \sum_{t = 1}^{T} {(g^{0} (t / T))}^{- 2} G_{1}^{0} (t / T),

which is an average of T values of the logistic cumulative distribution function. Let

[T r] = t

be the integer closest to t. Then,

\begin{matrix} (1 / T) \sum_{t = 1}^{T} {(g^{0} (t / T))}^{- 2} G_{1}^{0} (t / T) & = \sum_{t = 1}^{T} \int_{t / T}^{(t + 1) / T} {(g^{* 0} ([T r] / T))}^{- 2} G_{1}^{0} ([T r] / T) d r \\ = \int_{1 / T}^{(T + 1) / T} {(g^{* 0} ([T r] / T))}^{- 2} G_{1}^{0} ([T r] / T) d r \\ \to \int_{0}^{1} {(g^{* 0} (r))}^{- 2} G_{1}^{0} (r) d r \end{matrix}

as

T \to \infty

. The other elements of

B_{11} = {lim}_{T \to \infty} B_{11 (T)}

, are derived in a similar fashion. In matrix form,

B_{11} = \frac{1}{2} \int_{0}^{1} {(g^{* 0} (r))}^{- 2} g_{1}^{0} (r) g_{1}^{0} {(r)}^{'} d r .

The blocks

B_{12}

and

B_{22}

are obtained similarly. □

Since the maximum likelihood estimators of the parameters of the auxiliary TVV model under H

_{0}

are consistent, we may construct the LM test for the hypothesis H

_{0}^{'}

:

ψ = {(ψ_{1}, ψ_{2}, ψ_{3})}^{'} = 0

. Denoting the relevant block of the score by

s_{2} ({\hat{θ}}_{1}) = \frac{1}{2 T} \sum_{t = 1}^{T} (\frac{ε_{t}^{2}}{{\hat{g}}_{t}} - 1) \frac{1}{{\hat{g}}_{t}} \frac{\partial g_{t}}{\partial ψ},

where

\partial g_{t} / \partial ψ = τ_{t}

and

{\hat{g}}_{t} = {\hat{δ}}_{0} + {\hat{δ}}_{1} {(1 + exp {- {\hat{γ}}_{1} (t / T - {\hat{c}}_{1})})}^{- 1} .

Then, assuming

z_{t} = ε_{t} / g_{t}^{1 / 2}

is standard normal under H

_{0}

, the test statistic has the following form:

L M_{T}^{1} = (T / 2) s_{2}^{'} ({\hat{θ}}_{1}) {(B_{22} - B_{21} B_{11}^{- 1} B_{12})}^{- 1} s_{2} ({\hat{θ}}_{1}),

(A3)

where

{\hat{θ}}_{1} = {({\hat{δ}}_{0}^{*}, {\hat{δ}}_{1}, {\hat{γ}}_{1}, {\hat{c}}_{1}, 0, 0, 0)}^{'}

; see, for example, Godfrey (1988, p. 14). In order to make (A3) operational, the blocks of

B

are replaced by their consistent counterparts.

When constancy of the error variance is tested against a single transition,

g_{t} \equiv δ_{0}

,

g_{1} (t / T) = 1

(scalar), and

\partial g_{t} / \partial ψ = τ_{t}

as before. Then,

B_{11} = {(2 {(δ_{0}^{0})}^{2})}^{- 1}

,

B_{12} = \frac{1}{2 {(δ_{0}^{0})}^{2}} \int_{0}^{1} r^{'} d r and B_{22} = \frac{1}{2 {(δ_{0}^{0})}^{2}} \int_{0}^{1} r r^{'} d r .

The test statistic (A3) becomes

L M_{T}^{0} = \frac{T {(δ_{0}^{0})}^{2}}{2 {\hat{δ}}_{0}^{2}} s_{2}^{'} ({\hat{θ}}_{1}) {(B_{22} - B_{21} B_{11}^{- 1} B_{12})}^{- 1} s_{2} ({\hat{θ}}_{1}),

(A4)

where

s_{2} ({\hat{θ}}_{1}) = \frac{1}{2 T} \sum_{t = 1}^{T} (\frac{ε_{t}^{2}}{{\hat{δ}}_{0}} - 1) τ_{t} .

When the elements of the covariance matrix are replaced by their consistent estimators in (A4), the ratio

{(δ_{0}^{0})}^{2} / {\hat{δ}}_{0}^{2}

equals unity.

As already mentioned, conditional heteroskedasticity is ignored in setting up the test. For this reason, the test statistic (A3) is likely to be size distorted when applied to financial time series of sufficiently high frequency—that is, when GARCH-type volatility clustering is present. In applications, its size has to be adjusted by calibrating its distribution to reflect the persistence of the GARCH effect present in the data. This is the topic of discussion in Appendix B.1.

Appendix A.2. Test Statistic for MTV-GARCH Model Evaluation

In this section, the evaluation tests of the univariate MTV-GARCH equations are presented in an easy to implement fashion. The full details can be found in Amado and Teräsvirta (2017).

The test statistic is computed based on the following components:

{\hat{ζ}}_{t}

,

r_{1 t}

, and

r_{2 t}

.

{\hat{ζ}}_{t} = ε_{t} / \sqrt{{\hat{h}}_{t} {\hat{g}}_{t}}

are the residuals,

r_{1 t}

contains the derivatives of the functions

h_{t}

and

g_{t}

with respect to the parameters that govern the MTV-GARCH model under the null,

θ_{g}

and

θ_{h}

:

r_{1 t} = ({\hat{g}}_{t}^{- 1} \frac{\partial g_{t}}{\partial {\hat{θ}}_{g}} + {\hat{h}}_{t}^{- 1} \frac{\partial h_{t}}{\partial {\hat{θ}}_{g}}, {\hat{h}}_{t}^{- 1} \frac{\partial h_{t}}{\partial {\hat{θ}}_{h}}),

evaluated at the estimated parameters

{\hat{θ}}_{g}

and

{\hat{θ}}_{h}

. These are recursively calculated, and depend on the prevailing MTV-GARCH model under the null. For example, one could have

g_{t} = δ_{0} + δ_{1} G_{1} (t / T, γ_{1}, c_{1}) + δ_{2} G_{2} (t / T, γ_{2}, {(c_{21}, c_{22})}^{'})

and

h_{t} = α_{0} + α_{1} ε_{t - 1}^{2} / g_{t - 1} + κ_{1} I (ε_{t - 1} < 0) ε_{t - 1}^{2} / g_{t - 1} + β_{1} h_{t - 1}

. Here,

θ_{g} = {(δ_{1}, γ_{1}, c_{1}, δ_{2}, γ_{2}, c_{21}, c_{22})}^{'}

and

θ_{h} = {(α_{0}, α_{1}, κ_{1}, β_{1})}^{'}

. Then,

\frac{\partial g_{t}}{\partial θ_{g}} = {(G_{1}, δ_{1} \frac{\partial G_{1}}{\partial γ_{1}}, δ_{1} \frac{\partial G_{1}}{\partial c_{1}}, G_{2}, δ_{2} \frac{\partial G_{2}}{\partial γ_{2}}, δ_{2} \frac{\partial G_{2}}{\partial c_{21}}, δ_{2} \frac{\partial G_{2}}{\partial c_{22}})}^{'}

where

\begin{matrix} \frac{\partial G_{1}}{\partial γ_{1}} & = G_{1} (1 - G_{1}) (t / T - c_{1}) \\ \frac{\partial G_{1}}{\partial c_{1}} & = - G_{1} (1 - G_{1}) γ_{1} \\ \frac{\partial G_{2}}{\partial γ_{2}} & = G_{2} (1 - G_{2}) (t / T - c_{21}) (t / T - c_{22}) \\ \frac{\partial G_{2}}{\partial c_{21}} & = - G_{2} (1 - G_{2}) γ_{2} (t / T - c_{22}) \\ \frac{\partial G_{2}}{\partial c_{22}} & = - G_{2} (1 - G_{2}) γ_{2} (t / T - c_{21}) . \end{matrix}

The GARCH equation derivatives are formed recursively as

\frac{\partial h_{t}}{\partial θ_{g}} = - g_{t}^{- 1} (α_{1} ε_{t - 1}^{2} / g_{t - 1} + κ_{1} I (ε_{t - 1} < 0) ε_{t - 1}^{2} / g_{t - 1}) \frac{\partial g_{t - 1}}{\partial θ_{g}} + β_{1} \frac{\partial h_{t - 1}}{\partial θ_{g}}

and

\frac{\partial h_{t}}{\partial θ_{h}} = {(ε_{t - 1}^{2} / g_{t - 1}, I (ε_{t - 1} < 0) ε_{t - 1}^{2} / g_{t - 1}, h_{t - 1})}^{'} + β_{1} \frac{\partial h_{t - 1}}{\partial θ_{h}}

From this example, it should be easy to extend the null model to include more additive deterministic terms and/or have a higher order GARCH equation with or without asymmetric terms.

One extension regarding the deterministic part should be mentioned. It is often convenient to replace the slope parameter

γ

with

e^{η}

. In this case,

θ_{g} = (δ_{1}, η_{1}, c_{1}, δ_{2}, η_{2}, c_{21}, c_{22})^{'}

, and

\begin{matrix} \frac{\partial G_{1}}{\partial η_{1}} & = G_{1} (1 - G_{1}) e^{η_{1}} (t / T - c_{1}) \\ \frac{\partial G_{1}}{\partial c_{1}} & = - G_{1} (1 - G_{1}) e^{η_{1}} \\ \frac{\partial G_{2}}{\partial η_{2}} & = G_{2} (1 - G_{2}) e^{η_{2}} (t / T - c_{21}) (t / T - c_{22}) \\ \frac{\partial G_{2}}{\partial c_{21}} & = - G_{2} (1 - G_{2}) e^{η_{2}} (t / T - c_{22}) \\ \frac{\partial G_{2}}{\partial c_{22}} & = - G_{2} (1 - G_{2}) e^{η_{2}} (t / T - c_{21}) . \end{matrix}

Vector

r_{2 t}

contains the derivatives of the misspecified part. Details in the most commonly encountered situations will be given shortly. The number of variables (columns) in

r_{2 t}

defines the degrees of freedom in the

χ^{2}

-distribution for the test statistic under the null.

Given the three components, the LM-test is performed as follows:

Compute the $S S R_{0} = \sum_{t = 1}^{T} {({\hat{ζ}}_{t}^{2} - 1)}^{2}$ .
Regress ${\hat{ζ}}_{t}^{2} - 1$ on $(r_{1 t}, r_{2 t})$ , and form the sum of squared residuals $S S R_{1}$ .
Compute the test statistic $L M = T \frac{S S R_{0} - S S R_{1}}{S S R_{0}}$ .

The robust version that does not rely on the normality of the error term is formed as follows:

Regress $r_{2 t}$ on $r_{1 t}$ and obtain residuals $w_{t}$ . When $r_{2 t}$ has more than one variable, run the regression for each of them separately and, thereby, obtain a set of residuals $w_{t}$ .
Regress $1$ on $({\hat{ζ}}_{t}^{2} - 1) w_{t}$ and form the sum of squared residuals $S S R$ .
Compute the test statistic $L M_{R} = T - S S R$ .

The first case seeks to find evidence of misspecification of the determininstic part of the MTV-GARCH model. The conditional variance is of the form

σ_{t}^{2} = h_{t} (g_{t} + f_{t}),

where the additive term

f_{t}

is zero under the null of the model being correctly specified. The case that we consider here is the one of testing r against

r + 1

transitions in the deterministic part. The additive term is linearised and reparameterised, after which, it becomes

f_{t} = δ_{0}^{*} + δ_{1}^{*} t / T + δ_{2}^{*} {(t / T)}^{2} + δ_{3}^{*} {(t / T)}^{3} .

The derivative component for the alternative is then

r_{2 t} = {\hat{g}}_{t}^{- 1} (1, t / T, {(t / T)}^{2}, {(t / T)}^{3}) .

The second case addresses misspecification in the GARCH part:

σ_{t}^{2} = (h_{t} + f_{t}) g_{t},

where the additive term

f_{t}

is again zero under the null. A common scenario is when

f_{t}

may increase either the ARCH or the GARCH order (but not both). An example of the former is GARCH(1,1) vs. GARCH(2,1), in which case,

f_{t} = α_{2} ε_{t - 2}^{2} / g_{t - 2}

, and therefore

r_{2 t} = {\hat{h}}_{t}^{- 1} ε_{t - 2}^{2} / {\hat{g}}_{t - 2} .

If the model is a GJR one, and the potential increase in the order of the ARCH term extends to the asymmetric terms as well, then

f_{t} = α_{2} ε_{t - 2}^{2} / g_{t - 2} + κ_{2} I (ε_{t - 2} < 0) ε_{t - 2}^{2} / g_{t - 2}

, and

r_{2 t} = {\hat{h}}_{t}^{- 1} (ε_{t - 2}^{2} / {\hat{g}}_{t - 2}, I (ε_{t - 2} < 0) ε_{t - 2}^{2} / {\hat{g}}_{t - 2}) .

An example of the latter is GARCH(1,1) vs. GARCH(2,1), which leads to

f_{t} = β_{2} h_{t - 2}

, and thus

r_{2 t} = {\hat{h}}_{t}^{- 1} {\hat{h}}_{t - 2} .

The third case is the test of no remaining ARCH. This is a test against multiplicative misspecification,

σ_{t}^{2} = h_{t} g_{t} f_{t},

where

f_{t} = 1

under the null. If the alternative is that there is ARCH of order m left unaccounted for, then

r_{2 t} = ({\hat{ζ}}_{t - 1}^{2}, \dots, {\hat{ζ}}_{t - m}^{2}) .

Appendix A.3. Test of Constant Correlations

The log-likelihood of the auxiliary MTV model for observation t assuming

K = 2

equals

\begin{matrix} ln f_{A} (ζ_{t} | θ) = & - (1 / 2) \sum_{i = 1}^{N} ln g_{i t} - (1 / 2) \sum_{i = 1}^{N} ln h_{i t} - (1 / 2) ln | P_{A t} | \\ - (1 / 2) ε_{t}^{'} {S_{t} D_{t} P_{A t} D_{t} S_{t}}^{- 1} ε_{t}, \end{matrix}

where

P_{A t} = P_{(A 0)} + (t / T) P_{(A 1)} + {(t / T)}^{2} P_{(A 2)}

and

g_{i t} = δ_{i 0} + δ_{i 1} G_{i 1} (t / T, γ_{i 1}, c_{i 1})

; only one transition for notational simplicity, and

h_{i t}

is as in (4). The first sub-block of the score corresponding to the deterministic variance component under H

_{0}

becomes

s_{t} (θ_{g i}) = - \frac{1}{2} (g_{i t}^{- 1} \frac{\partial g_{i t}}{\partial θ_{g i}} + h_{i t}^{- 1} \frac{\partial h_{i t}}{\partial θ_{g i}}) (1 - e_{i}^{'} P_{(A 0)}^{- 1} z_{t} z_{t}^{'} e_{i}),

where

e_{i} = {(0_{i - 1}^{'}, 1, 0_{N - i}^{'})}^{'}

,

i = 1, \dots, N

, and

0_{0}

is an empty set. The sub-block corresponding to the GARCH parameters under H

_{0}

is

s_{t} (θ_{h i}) = - \frac{1}{2} (h_{i t}^{- 1} \frac{\partial h_{i t}}{\partial θ_{h i}}) (1 - e_{i}^{'} P_{(A 0)}^{- 1} z_{t} z_{t}^{'} e_{i}),

i = 1, \dots, N

. The remaining sub-blocks under H

_{0}

equal

\begin{matrix} s_{t} (ρ_{A j}) & = - \frac{1}{2} \frac{\partial vec {(P_{A t})}^{'}}{\partial ρ_{A j}} {vec (P_{(A 0)}^{- 1}) - (P_{(A 0)}^{- 1} \otimes P_{(A 0)}^{- 1}) vec (z_{t} z_{t}^{'})} \\ = - \frac{1}{2} {(t / T)}^{j} U^{'} {vec (P_{(A 0)}^{- 1}) - (P_{(A 0)}^{- 1} \otimes P_{(A 0)}^{- 1}) vec (z_{t} z_{t}^{'})}, \end{matrix}

j = 0, 1, 2

, where

U = \partial

vec(

P_{(A j)}) / \partial ρ_{A j}^{'}

consists of zeroes and ones and is identical for all j. The

N^{2} \times N (N - 1) / 2

matrix

U

is a column-wise collection of vectorised indicator matrices that identify the locations of the particular correlation parameters within the matrix

P_{A t}

. For example, the first correlation parameter in

ρ_{A j}

is located in positions (2,1) and (1,2) in

P_{A t}

. An indicator matrix corresponding to this parameter has ones in those positions and zeros elsewhere. This vectorised indicator matrix is then the first column of matrix

U

and so on. Consequently, the

3 N (N - 1) / 2 \times N^{2}

matrix ∂vec

{(P_{A t})}^{'} / \partial ρ_{A}

equals

\frac{\partial vec {(P_{A t})}^{'}}{\partial ρ_{A}} = [\begin{matrix} 1 \\ (t / T) \\ {(t / T)}^{2} \end{matrix}] \otimes U^{'} .

The information matrix for observation t under H

_{0}

is quite similar to, but simpler than, the corresponding one in Silvennoinen and Teräsvirta (2021). In order to give the matrix a proper expression, we need the commutation matrix

K

, an

N^{2} \times N^{2}

matrix whose

(i, j)

block equals

e_{j} e_{i}^{'}

—that is,

{[K]}_{i j} = e_{j} e_{i}^{'}

; see, for example, Lütkepohl (1996, pp. 115–18). Let the superscript 0 indicate that the corresponding entity is evaluated under H

_{0}

(for example,

g_{i t}^{0}

equals

g_{i t} |_{H_{0}}

, and

\partial g_{i t}^{0} / \partial θ_{g i}

equals

\partial g_{i t} / \partial θ_{g i} |_{H_{0}}

). The matrix is defined in the following lemma.

Lemma A2.

The expectations of the nine blocks of the information matrix at (rescaled) time

t / T

under H

_{0}

:

ρ_{A 1} = ρ_{A 2} = 0_{N (N - 1) / 2}

are

B_{t}^{0} = E s_{t} (θ^{0}) s_{t}^{'} (θ^{0}) = E [\begin{matrix} s_{t} (θ_{g}^{0}) s_{t}^{'} (θ_{g}^{0}) & s_{t} (θ_{g}^{0}) s_{t}^{'} (θ_{h}^{0}) & s_{t} (θ_{g}^{0}) s_{t}^{'} (ρ_{A}) \\ s_{t} (θ_{h}^{0}) s_{t}^{'} (θ_{g}^{0}) & s_{t} (θ_{h}^{0}) s_{t}^{'} (θ_{h}^{0}) & s_{t} (θ_{h}^{0}) s_{t}^{'} (ρ_{A}) \\ s_{t} (ρ_{A}) s_{t}^{'} (θ_{g}^{0}) & s_{t} (ρ_{A}) s_{t}^{'} (θ_{h}^{0}) & s_{t} (ρ_{A}) s_{t}^{'} (ρ_{A}) \end{matrix}] .

The

(i, j)

sub-block of

B_{11 t} = E s_{t} (θ_{g}^{0}) s_{t}^{'} (θ_{g}^{0})

,

i \neq j

, equals

\begin{matrix} {[B_{11 t}]}_{i j} & = E s_{t} (θ_{g i}^{0}) s_{t}^{'} (θ_{g j}^{0}) \\ = \frac{1}{4} (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}}) (\frac{1}{g_{j t}^{0}} \frac{\partial g_{j t}^{0}}{\partial θ_{g j}^{'}} + \frac{1}{h_{j t}^{0}} \frac{\partial h_{j t}^{0}}{\partial θ_{g j}^{'}}) e_{i}^{'} P_{(A 0)}^{- 1} e_{j} e_{i}^{'} P_{(A 0)} e_{j} . \end{matrix}

When

i = j

,

\begin{matrix} {[B_{11 t}]}_{i i} & = E s_{t} (θ_{g i}^{0}) s_{t}^{'} (θ_{g i}^{0}) \\ = \frac{1}{4} (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}}) (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}^{'}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}^{'}}) (1 + e_{i}^{'} P_{(A 0)}^{- 1} e_{i}) . \end{matrix}

The

(i, j)

sub-block of

B_{22 t} = E s_{t} (θ_{h}^{0}) s_{t}^{'} (θ_{h}^{0})

,

i \neq j

, equals

\begin{matrix} {[B_{22 t}]}_{i j} & = E s_{t} (θ_{h i}^{0}) s_{t}^{'} (θ_{h j}^{0}) \\ = \frac{1}{4} (\frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{h i}}) (\frac{1}{h_{j t}^{0}} \frac{\partial h_{j t}^{0}}{\partial θ_{h j}^{'}}) e_{i}^{'} P_{(A 0)}^{- 1} e_{j} e_{i}^{'} P_{(A 0)} e_{j} . \end{matrix}

When

i = j

,

\begin{matrix} {[B_{22 t}]}_{i i} & = E s_{t} (θ_{h i}^{0}) s_{t}^{'} (θ_{h i}^{0}) \\ = \frac{1}{4} (\frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{h i}}) (\frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{h i}^{'}}) (1 + e_{i}^{'} P_{(A 0)}^{- 1} e_{i}) . \end{matrix}

The

(i, j)

sub-block of

B_{12 t} = E s_{t} (θ_{g}^{0}) s_{t}^{'} (θ_{h}^{0})

,

i \neq j

, equals

\begin{matrix} {[B_{12 t}]}_{i j} & = E s_{t} (θ_{g i}^{0}) s_{t}^{'} (θ_{h j}^{0}) \\ = \frac{1}{4} (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}}) (\frac{1}{h_{j t}^{0}} \frac{\partial h_{j t}^{0}}{\partial θ_{h j}^{'}}) e_{i}^{'} P_{(A 0)}^{- 1} e_{j} e_{i}^{'} P_{(A 0)} e_{j} . \end{matrix}

When

i = j

,

\begin{matrix} {[B_{12 t}]}_{i i} & = E s_{t} (θ_{g i}^{0}) s_{t}^{'} (θ_{h i}^{0}) \\ = \frac{1}{4} (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}}) (\frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{h i}^{'}}) (1 + e_{i}^{'} P_{(A 0)}^{- 1} e_{i}) . \end{matrix}

Furthermore, the

(i, j)

sub-block of

E s_{t} (θ_{g}^{0}) s_{t}^{'} (ρ_{A})

equals

\begin{matrix} {[B_{13 t}]}_{i j} & = E s_{t} (θ_{g i}^{0}) s_{t}^{'} (ρ_{A j}) \\ = \frac{1}{4} {(t / T)}^{j} (\frac{1}{g_{i t}^{0}} \frac{\partial g_{i t}^{0}}{\partial θ_{g i}} + \frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{g i}}) {{(e_{i} \otimes e_{i})}^{'} (P_{(A 0)}^{- 1} \otimes I_{N}) + {(e_{i} \otimes e_{i})}^{'} (I_{N} \otimes P_{(A 0)}^{- 1})} U, \end{matrix}

i = 1, \dots, N

;

j = 0, 1, 2

. The

(i, j)

sub-block of

E s_{t} (θ_{h}^{0}) s_{t}^{'} (ρ_{A})

equals

\begin{matrix} {[B_{23 t}]}_{i j} & = E s_{t} (θ_{h i}^{0}) s_{t}^{'} (ρ_{A j}) \\ = \frac{1}{4} {(t / T)}^{j} (\frac{1}{h_{i t}^{0}} \frac{\partial h_{i t}^{0}}{\partial θ_{h i}}) {{(e_{i} \otimes e_{i})}^{'} (P_{(A 0)}^{- 1} \otimes I_{N}) + {(e_{i} \otimes e_{i})}^{'} (I_{N} \otimes P_{(A 0)}^{- 1})} U, \end{matrix}

i = 1, \dots, N

;

j = 0, 1, 2

. Finally, the

(i, j)

sub-block of the last block is equal to

\begin{matrix} {[B_{33 t}]}_{i j} & = E s_{t} (ρ_{A i}) s_{t}^{'} (ρ_{A j}) \\ = \frac{1}{4} {(t / T)}^{i + j} U^{'} M_{A} U, \end{matrix}

i, j = 0, 1, 2

, where

M_{A} = P_{(A 0)}^{- 1} \otimes P_{(A 0)}^{- 1} + (P_{(A 0)}^{- 1} \otimes I_{N}) K (P_{(A 0)}^{- 1} \otimes I_{N}) .

Proof.

See the appendix of Silvennoinen and Teräsvirta (2005) or Silvennoinen and Teräsvirta (2021). □

In order to define the test statistic, let

B_{13 \cdot j}

be the

(i, j)

blocks of

B_{13}

where

i \in {1, \dots, N}

—that is,

B_{13 \cdot j} = [{[B_{13}^{'}]}_{j 1}, \dots, {[B_{13}^{'}]}_{j N}], j = 0, 1, 2

and define

B_{23 \cdot j}

similarly. Partition the matrix

B

as follows:

{\tilde{B}}_{11} = [\begin{matrix} B_{11} & B_{12} & B_{13 \cdot 0} \\ B_{12}^{'} & B_{22} & B_{23 \cdot 0} \\ B_{13 \cdot 0}^{'} & B_{23 \cdot 0}^{'} & {[B_{33}]}_{00} \end{matrix}],

{\tilde{B}}_{12} = [\begin{matrix} B_{13 \cdot 1} & B_{13 \cdot 2} \\ {[B_{33}]}_{01} & {[B_{33}]}_{02} \end{matrix}]

and

{\tilde{B}}_{33} = [\begin{matrix} {[B_{33}]}_{11} & {[B_{33}]}_{12} \\ {[B_{33}^{'}]}_{12} & {[B_{33}]}_{22} \end{matrix}] .

Next, define

{\hat{x}}_{j t} = - \frac{1}{2} {(\frac{t}{T})}^{j} U^{'} {vec (P_{(A 0)}^{- 1}) - ({\hat{P}}_{(A 0)}^{- 1} \otimes {\hat{P}}_{(A 0)}^{- 1}) vec ({\hat{z}}_{t} {\hat{z}}_{t}^{'})},

j = 1, 2

, where

{\hat{z}}_{t}

and

{\hat{P}}_{(A 0)}

equal

z_{t}

and

P_{(A 0)}

estimated under H

_{0}

, respectively. The test statistic

L M_{T} = T (\frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{1 t}^{'}, \frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{2 t}^{'}) {{\tilde{B}}_{22} - {\tilde{B}}_{12}^{'} {({\tilde{B}}_{11}^{0})}^{- 1} {\tilde{B}}_{12}}^{- 1} {(\frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{1 t}^{'}, \frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{2 t}^{'})}^{'}

(A5)

has an asymptotic

χ^{2}

-distribution with

N (N - 1)

degrees of freedom when H

_{0}

holds. To make the test statistic operational, the sub-blocks of the information matrix in (A5) have to be replaced by consistent plug-in estimators.

Appendix A.4. Test for an Additional Transition in the Correlations

The test statistic for an additional transition is constructed in the same way as in Appendix A.3, and the blocks related to the volatility components are identical. However, all blocks related to the correlation need modifications to include the parameters governing the time-varying correlation that exists under the null. This includes both the parameters in the correlation matrices under the null, and their corresponding transition parameters.

Let us define

x_{h i t} = h_{i t}^{- 1} \frac{\partial h_{i t}}{\partial θ_{h i}}

,

x_{g i t} = g_{i t}^{- 1} \frac{\partial g_{i t}}{\partial θ_{g i}} + h_{i t}^{- 1} \frac{\partial h_{i t}}{\partial θ_{g i}}

. Let us also partition the linearised correlation model as

P_{A t} = P_{A t 0} + t / T P_{(A 1)} + {(t / T)}^{2} P_{(A 2)}

, where

P_{A t 0}

contains the time-varying correlation model under the null. When testing L transitions against

L + 1

transitions,

P_{A t 0}

contains

L + 1

correlation matrices

P_{(1)}, \dots, P_{(L + 1)}

and L transition functions

G_{l} (t / T, γ_{l}, c_{l})

(here, we assume

K_{l} = 1

for simplicity),

l = 1, \dots, L

. The information matrix is approximated by its consistent estimator

\hat{B} = T^{- 1} \sum_{t = 1}^{T} E_{t - 1} [s_{t} (θ^{0}) s_{t} {(θ^{0})}^{'}],

where

θ_{0} = {(θ_{g}, θ_{h}, θ_{G}, θ_{ρ_{A 0}}, θ_{ρ_{A 1}})}^{'}

, where

θ_{G}

contains the transition parameters from the L transitions that are present under the null,

θ_{ρ_{A 0}} = {(ρ_{(1)}, \dots, ρ_{(L + 1)})}^{'}

and

θ_{ρ_{A 1}} = {(ρ_{(A 1)}, ρ_{(A 2)})}^{'}

. From here on, the expressions are evaluated at the true parameter values under the null (we omit the additional superscripts of 0 to keep the notation simple).

With this notation, the

(i, j)

sub-block of

{\hat{B}}_{11}

,

i \neq j

, equals

{[{\hat{B}}_{11}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} x_{g j t}^{'} e_{i}^{'} P_{(A t 0)}^{- 1} e_{j} e_{i}^{'} P_{(A t 0)} e_{j} .

When

i = j

,

{[{\hat{B}}_{11}]}_{i i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} x_{g i t}^{'} (1 + e_{i}^{'} P_{(A t 0)}^{- 1} e_{i}) .

Similarly, the

(i, j)

sub-block of

{\hat{B}}_{22}

,

i \neq j

, is equal to

{[{\hat{B}}_{22}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{h i t} x_{h j t}^{'} e_{i}^{'} P_{(A t 0)}^{- 1} e_{j} e_{i}^{'} P_{(A t 0)} e_{j} .

When

i = j

,

{[{\hat{B}}_{22}]}_{i i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{h i t} x_{h i t}^{'} (1 + e_{i}^{'} P_{(A t 0)}^{- 1} e_{i}) .

The

(i, j)

sub-block of

{\hat{B}}_{12}

,

i \neq j

, equals

{[{\hat{B}}_{12}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} x_{h j t}^{'} e_{i}^{'} P_{(A t 0)}^{- 1} e_{j} e_{i}^{'} P_{(A t 0)} e_{j} .

When

i = j

,

{[{\hat{B}}_{12}]}_{i i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} x_{h i t}^{'} (1 + e_{i}^{'} P_{(A t 0)}^{- 1} e_{i}) .

The next blocks deal with the transition parameters. Define

x_{G t} = \frac{\partial vec P_{A t}^{'}}{\partial θ_{G}}

. The lth block of

\frac{\partial vec P_{A t}^{'}}{\partial θ_{G}}

is

(\prod_{i = l + 1}^{L} (1 - G_{i})) \frac{\partial G_{i}}{\partial θ_{G_{i}}} vec {(P_{(l + 1)} - P_{t}^{(l - 1)})}^{'}

using the recursion in (8). The block

{\hat{B}}_{33}

is equal to

{\hat{B}}_{33} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{G t} M_{A} x_{G t}^{'} .

The ith sub-block of

{\hat{B}}_{13}

equals

{[{\hat{B}}_{13}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) x_{G t}^{'}

and the ith sub-block of

{\hat{B}}_{23}

equals

{[{\hat{B}}_{23}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{h i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) x_{G t}^{'} .

Next, we will consider the blocks related to the correlations. The matrix ∂vec

{(P_{A t})}^{'} / \partial θ_{ρ_{A 0}}

equals

\frac{\partial vec {(P_{A t})}^{'}}{\partial θ_{ρ_{A 0}}} = v s . \otimes U^{'} = [\begin{matrix} \prod_{l = 1}^{L} (1 - G_{l t}) \\ G_{1 t} \prod_{l = 2}^{L} (1 - G_{l t}) \\ G_{2 t} \prod_{l = 3}^{L} (1 - G_{l t}) \\ \dots \\ G_{l - 1, t} (1 - G_{L t}) \\ G_{L t} \end{matrix}] \otimes U^{'}

and the matrix ∂vec

{(P_{A t})}^{'} / \partial θ_{ρ_{A 1}}

equals

\frac{\partial vec {(P_{A t})}^{'}}{\partial ρ_{A 1}} = [\begin{matrix} t / T \\ {(t / T)}^{2} \end{matrix}] \otimes U^{'} .

The block

{\hat{B}}_{44}

is equal to

{\hat{B}}_{44} = \frac{1}{4 T} \sum_{t = 1}^{T} v_{t} v_{t}^{'} \otimes U^{'} M_{A} U

and the

(i, j)

sub-block of

{\hat{B}}_{55}

is equal to

{[{\hat{B}}_{55}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} {(t / T)}^{i + j} U^{'} M_{A} U

for

i, j = 1, 2

. The ith sub-block of

{\hat{B}}_{14}

is equal to

{[{\hat{B}}_{14}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{g i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) (v_{t}^{'} \otimes U)

and the

(i, j)

sub-block of

{\hat{B}}_{15}

is equal to

{[{\hat{B}}_{15}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} {(t / T)}^{j} x_{g i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) U,

j = 1, 2

. The corresponding sub-blocks of

{\hat{B}}_{24}

and

{\hat{B}}_{25}

are

{[{\hat{B}}_{24}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{h i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) (v_{t}^{'} \otimes U)

and the

(i, j)

sub-block of

{\hat{B}}_{15}

is equal to

{[{\hat{B}}_{25}]}_{i j} = \frac{1}{4 T} \sum_{t = 1}^{T} {(t / T)}^{j} x_{h i t} (e_{i}^{'} P_{(A t 0)}^{- 1} \otimes e_{i}^{'} + e_{i}^{'} \otimes e_{i}^{'} P_{(A t 0)}^{- 1}) U,

j = 1, 2

. The block

{\hat{B}}_{34}

equals

{\hat{B}}_{34} = \frac{1}{4 T} \sum_{t = 1}^{T} x_{G t} M_{A} (v^{'} \otimes U) .

The ith sub-block of

{\hat{B}}_{35}

equals

{[{\hat{B}}_{35}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} {(t / T)}^{i} x_{G t} M_{A} U,

i = 1, 2

Finally, the ith sub-block of

{\hat{B}}_{45}

is equal to

{[{\hat{B}}_{45}]}_{i} = \frac{1}{4 T} \sum_{t = 1}^{T} {(t / T)}^{i} (v s . \otimes U^{'}) M_{A} U .

Next, define

{\hat{x}}_{j t} = - \frac{1}{2} {(\frac{t}{T})}^{j} U^{'} {vec (P_{(A 0 t)}^{- 1}) - ({\hat{P}}_{(A 0 t)}^{- 1} \otimes {\hat{P}}_{(A 0 t)}^{- 1}) vec ({\hat{z}}_{t} {\hat{z}}_{t}^{'})},

j = 1, 2

, where

{\hat{z}}_{t}

and

{\hat{P}}_{(A 0 t)}

equal

z_{t}

and

P_{(A 0 t)}

estimated under H

_{0}

, respectively. The test statistic

L M_{T} = T (\frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{1 t}^{'}, \frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{2 t}^{'}) {[{\hat{B}}^{- 1}]}_{S W} {(\frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{1 t}^{'}, \frac{1}{T} \sum_{t = 1}^{T} {\hat{x}}_{2 t}^{'})}^{'},

where

{[{\hat{B}}^{- 1}]}_{S W}

is the

N (N - 1) \times N (N - 1)

block in the south-west corner of the inverse of

\hat{B}

. As the matrix

\hat{B}

can have a large dimension, its inverse could be obtained by using block inversion methods, perhaps applying them recursively. The test statistic has an asymptotic

χ^{2}

-distribution with

N (N - 1)

degrees of freedom when H

_{0}

holds.

Appendix B. Simulations of Test Statistics

Appendix B.1. Tests of GARCH Equations

The test for slow moving baseline volatility has a statistic whose distribution is sensitive to the high frequency, GARCH, volatility. For this reason, one cannot use the asymptotic distribution, rather the distribution must be generated via simulation. Further, Silvennoinen and Teräsvirta (2016) showed that the size of the test was distorted if the GARCH parameterisation deviates from the true one. For this reason, a few alternative approaches to estimate the GARCH parameters, and especially the persistence, have been investigated. It should be noted that estimating GARCH without taking the nonstationarity into account will yield overestimated persistence, thereby, impacting the null distribution of the test statistic and thus rendering the test outcomes unreliable. These estimates are given in Table A1.

The baseline volatility may be very different in different series. Therefore, one should not ignore visual inspection of the returns nor rely on general rules of thumb. If there are sufficiently long sections of data where the general level of volatility remains constant, it is advisable to estimate the GARCH parameters over such subsample. In the present case, there are a couple of relatively constant volatility sections—for example, one from November 2003 until October 2007.

The parameter estimates for that calm subperiod are in Table A1. Comparison with the estimates from the entire period GARCH model makes it clear that the neglected nonstationarity has biased the estimates, resulting in high persistence and kurtosis. As the data set has a sufficiently long span of GARCH-type clustering without (visually) significant movement in the general baseline level, relevant estimates are obtained by using that subsample only.

Another approach consists of estimating the GARCH equation over a rolling window such that the intercept is time-varying, targeting the unconditional volatility over each window, while the other parameters are assumed to be constant over the entire sample period and estimated in the usual way. The choice of the window length should consider the general recommendations regarding the sample size when attempting GARCH estimation. Too long a window will be impacted by the slowly changing baseline volatility level, whereas too short a window will yield very uncertain GARCH estimates.

To investigate the properties of this approach, we ran a simulation experiment with a few different baseline volatilities. The window widths varied from 250 to 1000 observations. Figure A1, Figure A2 and Figure A3 depict the distributions of the GARCH estimates and the derived persistence and kurtosis measures, as explained in He and Teräsvirta (1999), for a selection of baseline volatilities and window widths. Based on these experiments, we concluded that a window width of 400 observations yields sufficiently robust results for our application.

The resulting GARCH estimates are reported in Table A1, and they are quite similar to the ones obtained for the aforementioned calm period. This can be interpreted as support for the rolling window method, particularly in situations where visual inspection of data does not reveal a sufficiently long period of constant unconditional volatility.

Overall, it is clear that using simply the GARCH estimates from the entire sample to calibrate the null distribution of the test statistic for the specification of the deterministic component of the volatility is not recommended. For comparison, Table A1 also reports the GARCH estimates from a TV-GARCH model where the TV specification has been completed. The estimated persistence is higher than the ones obtained from the calm period or rolling window variance targeting method, however, as discussed in Silvennoinen and Teräsvirta (2016), underestimation of persistence has a less severe impact on the performance of the TV specification test than does overestimation.

Table A1. Specification stage for the deterministic component in volatilities of each of the four banks.

\tilde{α}

and

\tilde{β}

are the initial estimates used for calibrating the test statistic distribution. The rolling window method allows the GARCH intercept to adjust to target the unconditional variance in a window of size 400. The ‘calm period’ selects the continuous period from November 2003 to October 2007, which has very little visible variation in the baseline volatility. For comparison, the GARCH estimates from the entire sample period are reported along with the final estimates from the TV-GARCH model.

Table A1. Specification stage for the deterministic component in volatilities of each of the four banks.

\tilde{α}

and

\tilde{β}

are the initial estimates used for calibrating the test statistic distribution. The rolling window method allows the GARCH intercept to adjust to target the unconditional variance in a window of size 400. The ‘calm period’ selects the continuous period from November 2003 to October 2007, which has very little visible variation in the baseline volatility. For comparison, the GARCH estimates from the entire sample period are reported along with the final estimates from the TV-GARCH model.

		$\tilde{α}$	$\tilde{β}$	Persistence	Kurtosis
Rolling window 400	ANZ	0.090	0.836	0.926	3.38
	CBA	0.087	0.850	0.937	3.43
	NAB	0.095	0.817	0.912	3.36
	WBC	0.085	0.858	0.943	3.45
Calm period	ANZ	0.073	0.852	0.925	3.24
	CBA	0.081	0.842	0.923	3.29
	NAB	0.066	0.829	0.896	3.14
	WBC	0.091	0.806	0.897	3.28
Entire period GARCH only	ANZ	0.065	0.927	0.992	6.40
	CBA	0.089	0.890	0.979	4.83
	NAB	0.104	0.867	0.971	4.85
	WBC	0.075	0.911	0.986	5.08
Entire period TV-GARCH	ANZ	0.078	0.880	0.957	3.50
	CBA	0.091	0.860	0.950	3.61
	NAB	0.107	0.825	0.931	3.62
	WBC	0.084	0.878	0.962	3.70

Figure A1. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has a single transition. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Figure A1. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has a single transition. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Figure A2. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has an asymmetric double transition. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Figure A2. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has an asymmetric double transition. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Figure A3. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has two double transitions. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Figure A3. Simulated distributions of GARCH estimates and implied persistence and kurtosis measures for a selection of window widths. The baseline

g_{t}

has two double transitions. The dotted vertical lines indicate the true values of the parameters

α

,

β

, persistence, and kurtosis.

Appendix B.2. Evaluation Tests of GARCH Equations

The fact that the evaluation tests discussed in Appendix A.2 are applied to the pre-filtered data

{\hat{P_{t}}}^{- 1 / 2} ε_{t}

is known to potentially alter the distribution of the test statistic. In this section, we present simulation results that show that the size of the tests remains practically unchanged, rendering the tests applicable in the proposed way.

The simulation uses 2000 observations on a bivariate TVGARCH model parametrised as

h_{t} = 0.10 + 0.05 ε_{t - 1}^{2} / g_{t - 1} + 0.85 h_{t - 1}

,

g_{t} = 1 + 3 {(1 + \exp {- e^{3} (t / T - 0.5)})}^{- 1}

. These are coupled with a CCC model with

ρ = 0.5

, and then with an STCC model parametrised as

ρ_{(1)} = 0.3

,

ρ_{(2)} = 0.7

,

G_{t} = {(1 + \exp {- e^{2.5} (t / T - 0.5)})}^{- 1}

. The noise terms are iid standard normal. Two estimation procedures were used, a two-step and a multi-step one.

First step: The individual TVGARCH models are estimated, assuming the series are uncorrelated.
Second step: Estimate the correlation model conditional on the volatility model estimates from the previous step. Then, estimate the TVGARCH models conditional on the correlation estimates.

The misspecification tests are then calculated using the TVGARCH estimates from the second step, and the data is pre-filtered with the correlation estimates from the second step. The multi-step continues repeating the procedure of the second step until no further improvements are achieved.

Table A2. Size simulation for the three types of misspecification tests in Amado and Teräsvirta (2017). 2000 replications.

T = 2000

,

N = 2

. MS1:

g_{t}

additively misspecified, alternative linearised with a first-order term only; MS2-a: GARCH(1,1) vs. GARCH(1,2); MS2-b: GARCH(1,1) vs. GARCH(2,1); MS3: test for remaining ARCH, lag 1.

Table A2. Size simulation for the three types of misspecification tests in Amado and Teräsvirta (2017). 2000 replications.

T = 2000

,

N = 2

. MS1:

g_{t}

additively misspecified, alternative linearised with a first-order term only; MS2-a: GARCH(1,1) vs. GARCH(1,2); MS2-b: GARCH(1,1) vs. GARCH(2,1); MS3: test for remaining ARCH, lag 1.

		Standard			Robust
		10%	5%	1%	10%	5%	1%
CCC two-step	MS1	0.146	0.085	0.020	0.132	0.074	0.016
	MS2-a	0.122	0.064	0.012	0.101	0.048	0.013
	MS2-b	0.143	0.080	0.017	0.108	0.051	0.008
	MS3	0.125	0.061	0.010	0.104	0.054	0.010
STCC two-step	MS1	0.134	0.074	0.023	0.121	0.055	0.015
	MS2-a	0.123	0.059	0.015	0.101	0.045	0.013
	MS2-b	0.122	0.062	0.019	0.087	0.044	0.010
	MS3	0.115	0.058	0.015	0.100	0.050	0.011
CCC multi-step	MS1	0.145	0.083	0.022	0.133	0.073	0.014
	MS2-a	0.116	0.062	0.015	0.097	0.052	0.009
	MS2-b	0.133	0.069	0.018	0.100	0.046	0.010
	MS3	0.120	0.062	0.016	0.107	0.060	0.014
STCC multi-step	MS1	0.147	0.084	0.023	0.135	0.068	0.012
	MS2-a	0.130	0.059	0.011	0.103	0.046	0.006
	MS2-b	0.120	0.067	0.016	0.090	0.039	0.005
	MS3	0.112	0.055	0.012	0.104	0.047	0.009

From Table A2, it is evident that the standard form of the tests is slightly oversized. The robust version of the tests, on the other hand, seems to behave well, and there is no need for any adjustments of the test statistics or their distributions. Therefore, the procedure of removing the correlations between the series prior to applying the evaluation tests can be recommended.

Appendix B.3. Tests of Correlations

The simulation experiment investigates the size of the test in an environment where the multivariate model is correctly specified. The number of data series considered in the system is

N = 2, 5, 10, 20

. The length varies from

T = 25

for the bivariate systems, which is relevant for time series systems in macro applications, up to

T = 1000

, which, in turn, is considered to be a fairly small sample size for high frequency returns data. The length of the time series places a constraint on the dimension of the model—that is, the parametric alternative is only feasible if the number of parameters remains comfortably below the amount of available data points. We simulated the test by both assuming that

D_{t} \equiv I_{N}

and that there is conditional heteroskedasticity in the model:

D_{t} \neq I_{N}

.

When

D_{t} \equiv I_{N},

we found that the results were fairly independent of the structure of the correlations. We used both equicorrelation and Toeplitz matrices in our simulations, and the results remained the same. Table A3 contains the results of a simulation in which

D_{t} \equiv I_{N}

, and the

N \times N

correlation matrix

P = [ρ_{i j}]

is an equicorrelated one with weak (

ρ = 1 / 3

) and moderately strong (

ρ = 2 / 3

) correlation. The table also reports the results from using a Toeplitz correlation matrix such that

[ρ_{i j}] = ρ^{| i - j |},

i, j = 1, \dots, N

with

ρ = 0.5

representing moderate to weak correlation and

ρ = 0.9

representing strong to moderate correlation. It is seen that the empirical size of the test is rather close to the nominal one already when

N = 2

and

T = 100

. The size holds up across the various correlation patterns.

Table A3. Size-study: Test of constant correlations. Data are generated as an MTV-CCC with an equicorrelation coefficient of 0.33 (CEC33) and 0.67 (CEC67) and a Toeplitz structure with a correlation coefficient of 0.5 (CTC50) and 0.9 (CTC90). Tests are based on the first-order polynomial approximation. A total of 5000 replications.

		CEC33			CEC67			CTC50			CTC90
N	T	1%	5%	10%	1%	5%	10%	1%	5%	10%	1%	5%	10%
2	25	0.023	0.076	0.132	0.022	0.069	0.128	0.024	0.074	0.130	0.022	0.070	0.126
	50	0.015	0.063	0.116	0.016	0.064	0.115	0.016	0.064	0.115	0.015	0.062	0.109
	100	0.011	0.056	0.104	0.010	0.054	0.102	0.011	0.056	0.103	0.010	0.051	0.101
	250	0.012	0.055	0.108	0.010	0.054	0.107	0.011	0.055	0.106	0.009	0.053	0.108
	500	0.010	0.051	0.097	0.009	0.049	0.097	0.010	0.050	0.096	0.009	0.050	0.094
	1000	0.010	0.048	0.099	0.010	0.048	0.095	0.010	0.046	0.097	0.010	0.049	0.092
5	100	0.011	0.054	0.112	0.011	0.053	0.110	0.011	0.056	0.112	0.011	0.053	0.111
	250	0.014	0.054	0.099	0.012	0.051	0.099	0.013	0.053	0.100	0.012	0.051	0.101
	500	0.010	0.050	0.104	0.010	0.053	0.106	0.009	0.052	0.101	0.010	0.054	0.105
	1000	0.010	0.056	0.102	0.010	0.052	0.103	0.009	0.053	0.100	0.008	0.053	0.103
10	250	0.013	0.055	0.112	0.013	0.057	0.112	0.013	0.057	0.110	0.012	0.054	0.115
	500	0.009	0.049	0.101	0.010	0.049	0.104	0.008	0.053	0.103	0.010	0.050	0.103
	1000	0.011	0.052	0.102	0.011	0.054	0.105	0.011	0.053	0.099	0.012	0.056	0.103
20	1000	0.012	0.056	0.106	0.012	0.057	0.106	0.013	0.056	0.103	0.012	0.056	0.107

We next turn to the case

D_{t} \neq I_{N}

. Table A4 and Table A5 contain results of size simulations where the sensitivity of the test is examined against combinations for the GARCH persistence and kurtosis as well as a selection of strengths of correlations (the equicorrelated and Toepliz ones described above). The test is generally well-sized.

Table A4. Size-study: Test of constant correlations. Data are generated as an MTV-GARCH-CEC with persistence of 0.95 and 0.97, kurtosis of 4 and 6, and an equicorrelation coefficient of 0.33 and 0.67. Tests are based on the first-order polynomial approximation. A total of 2500 replications.

			CEC33						CEC67
			kurtosis = 4			kurtosis = 6			kurtosis = 4			kurtosis = 6
Persistence	N	T	1%	5%	10%	1%	5%	10%	1%	5%	10%	1%	5%	10%
0.95	2	500	0.012	0.056	0.108	0.016	0.056	0.106	0.016	0.070	0.122	0.016	0.092	0.122
	2	1000	0.009	0.044	0.103	0.009	0.042	0.097	0.011	0.045	0.093	0.009	0.044	0.097
	2	2000	0.008	0.042	0.094	0.007	0.042	0.090	0.010	0.052	0.099	0.009	0.046	0.092
	5	500	0.006	0.062	0.118	0.006	0.070	0.114	0.018	0.076	0.140	0.018	0.082	0.146
	5	1000	0.016	0.060	0.119	0.016	0.061	0.112	0.016	0.059	0.115	0.018	0.060	0.112
	5	2000	0.010	0.058	0.108	0.008	0.051	0.102	0.016	0.060	0.116	0.010	0.052	0.098
	10	500	0.016	0.058	0.118	0.020	0.064	0.114	0.020	0.068	0.116	0.024	0.080	0.128
	10	1000	0.018	0.053	0.104	0.015	0.051	0.101	0.014	0.061	0.111	0.017	0.063	0.110
	10	2000	0.014	0.072	0.126	0.012	0.060	0.112	0.018	0.082	0.142	0.013	0.062	0.118
0.97	2	500	0.010	0.056	0.114	0.012	0.054	0.118	0.020	0.072	0.114	0.014	0.068	0.120
	2	1000	0.011	0.043	0.102	0.011	0.044	0.103	0.012	0.047	0.107	0.013	0.048	0.103
	2	2000	0.009	0.046	0.094	0.007	0.042	0.089	0.010	0.056	0.108	0.012	0.050	0.093
	5	500	0.004	0.066	0.124	0.012	0.056	0.104	0.012	0.088	0.152	0.018	0.086	0.164
	5	1000	0.015	0.063	0.113	0.014	0.067	0.114	0.018	0.063	0.121	0.019	0.060	0.125
	5	2000	0.010	0.060	0.110	0.008	0.050	0.100	0.015	0.060	0.118	0.012	0.050	0.101
	10	500	0.012	0.062	0.108	0.016	0.070	0.112	0.016	0.072	0.112	0.022	0.086	0.148
	10	1000	0.016	0.053	0.100	0.015	0.056	0.107	0.015	0.063	0.113	0.018	0.057	0.110
	10	2000	0.015	0.074	0.132	0.014	0.058	0.108	0.016	0.088	0.142	0.010	0.063	0.112

Table A5. Size-study: Test of constant correlations. Data are generated as an MTV-GARCH-CTC with persistence of 0.95 and 0.97, kurtosis of 4 and 6, and a correlation matrix with a Toeplitz structure with a correlation coefficient of 0.5 and 0.9. Tests are based on the first-order polynomial approximation. A total of 2500 replications.

			CTC50						CTC90
			kurtosis = 4			kurtosis = 6			kurtosis = 4			kurtosis = 6
Persistence	N	T	1%	5%	10%	1%	5%	10%	1%	5%	10%	1%	5%	10%
0.95	2	500	0.010	0.064	0.102	0.010	0.070	0.106	0.018	0.094	0.136	0.026	0.088	0.146
	2	1000	0.009	0.041	0.097	0.011	0.042	0.103	0.014	0.053	0.096	0.020	0.062	0.104
	2	2000	0.008	0.044	0.096	0.009	0.044	0.090	0.017	0.066	0.120	0.014	0.048	0.098
	5	500	0.006	0.062	0.118	0.010	0.058	0.114	0.020	0.120	0.212	0.050	0.134	0.210
	5	1000	0.014	0.060	0.112	0.018	0.064	0.113	0.027	0.076	0.134	0.034	0.093	0.144
	5	2000	0.011	0.057	0.110	0.008	0.052	0.105	0.020	0.075	0.142	0.018	0.058	0.110
	10	500	0.012	0.070	0.120	0.016	0.080	0.128	0.040	0.114	0.172	0.078	0.150	0.230
	10	1000	0.012	0.049	0.100	0.013	0.051	0.102	0.019	0.078	0.127	0.032	0.089	0.147
	10	2000	0.019	0.072	0.127	0.014	0.059	0.111	0.033	0.110	0.178	0.018	0.077	0.140
0.97	2	500	0.014	0.066	0.114	0.018	0.068	0.116	0.016	0.082	0.134	0.030	0.104	0.164
	2	1000	0.009	0.044	0.101	0.008	0.042	0.099	0.016	0.051	0.112	0.022	0.063	0.119
	2	2000	0.010	0.050	0.102	0.009	0.044	0.092	0.024	0.070	0.120	0.015	0.052	0.100
	5	500	0.014	0.056	0.128	0.008	0.074	0.130	0.024	0.134	0.208	0.052	0.160	0.256
	5	1000	0.013	0.059	0.112	0.016	0.066	0.123	0.022	0.082	0.157	0.037	0.102	0.164
	5	2000	0.014	0.062	0.112	0.010	0.052	0.101	0.028	0.086	0.145	0.020	0.066	0.116
	10	500	0.018	0.080	0.128	0.022	0.088	0.130	0.040	0.114	0.172	0.100	0.188	0.278
	10	1000	0.012	0.054	0.105	0.013	0.054	0.107	0.019	0.078	0.127	0.030	0.104	0.181
	10	2000	0.016	0.072	0.132	0.016	0.062	0.110	0.033	0.110	0.178	0.026	0.089	0.150

However, an interesting aspect is that there is slight oversizing when kurtosis decreases (which means shifting the relative weight from

α

to

β

in the GARCH equation, while keeping the persistence constant). A change in persistence does not seem to affect the size of the test. As the dimension of the system increases, the test does not perform equally well. Increasing the sample size does not seem to be able to counteract this (the simulations use

T = 500, 1000, 2000

).

In yet another simulation (results not reported here), we considered the effects of misspecifying the conditional heteroskedasticity on the correlation test. More specifically, when

D_{t} \neq I_{N}

but conditional heteroskedasticity is ignored, the test is, as may be expected, heavily oversized. The obvious conclusion is that the constancy of correlations can only be tested after specifying and estimating both

D_{t}

and

S_{t}

.

Appendix C. Details of Maximisation by Parts

This appendix describing the outlines of the estimation algorithm derives from Silvennoinen and Teräsvirta (2021). The estimation proceeds as follows.

Assume $ln h_{i t} (θ_{h i}, θ_{g i}) = 0$ , $i = 1, \dots, N$ , and estimate parameters $θ_{g} = {(θ_{g 1}, \dots, θ_{g N})}^{'}$ , $i = 1, \dots, N$ , equation by equation, assuming $P_{t} (θ_{ρ}) = I_{N}$ . Denote the estimate $S_{t} ({\hat{θ}}_{g}^{(1, 1)})$ . This means that the deterministic components $g_{i} (t / T, θ_{g i})$ have been estimated once, including the intercept $δ_{i 0}$ in (2).
Estimate $P_{t} (θ_{ρ})$ given $θ_{g} = {\hat{θ}}_{g}^{(1, 1)}$ . This requires a separate iteration because $P_{t} (θ_{ρ})$ is nonlinear in parameters; see (5) and (6). Denote the estimate $P_{t} ({\hat{θ}}_{ρ}^{(1, 1)})$ .
Re-estimate $S_{t} (θ_{g})$ assuming $P_{t} (θ_{ρ}) = P_{t} ({\hat{θ}}_{ρ}^{(1, 1)})$ . This yields $S_{t} ({\hat{θ}}_{g}^{(1, 2)})$ . Then, re-estimate $P_{t} (θ_{ρ})$ given $θ_{g} = {\hat{θ}}_{g}^{(1, 2)}$ . Iterate until convergence. Let the result after $R_{1}$ iterations be $S_{t} (θ_{g}) = S_{t} ({\hat{θ}}_{g}^{(1, R_{1})})$ and $P_{t} (θ_{ρ}) = P_{t} ({\hat{θ}}_{ρ}^{(1, R_{1})})$ . The resulting estimates are maximum likelihood ones under the assumption $D_{t} (θ_{h}, θ_{g}) = I_{N}$ .
Estimate $θ_{h}$ from $D_{t} (θ_{h}, {\hat{θ}}_{g}^{(1, R_{1})})$ using $P_{t} (θ_{ρ}) = P_{t} ({\hat{θ}}_{ρ}^{(1, R_{1})})$ . This is a standard multivariate conditional correlation GARCH estimation step as in Bollerslev (1990), because $S_{t} ({\hat{θ}}_{g}^{(1, R_{1})})$ is fixed and does not affect the maximum and $P_{t} ({\hat{θ}}_{ρ}^{(1, R_{1})})$ is known. In total, steps 1–4 form the first iteration of the maximisation algorithm. Denote the estimate ${\hat{θ}}_{h}^{(1)}$ .
Estimate $θ_{g}$ from $S_{t} (θ_{g})$ keeping $D_{t} ({\hat{θ}}_{h}^{(1)}, {\hat{θ}}_{g}^{(1, R_{1})})$ and $P_{t} ({\hat{θ}}_{ρ}^{(1, R_{1})})$ fixed. This step is analogous to the first part of Step 3. The difference is that $D_{t} ({\hat{θ}}_{h}^{(1)}, {\hat{θ}}_{g}^{(1, R_{1})}) \neq I_{N}$ . Denote the estimator $S_{t} ({\hat{θ}}_{g}^{(2, 1)})$ .
Estimate $P_{t} (θ_{ρ})$ given $θ_{g} = {\hat{θ}}_{g}^{(2, 1)}$ and $θ_{h} = {\hat{θ}}_{h}^{(1)}$ . Denote the estimator $P_{t} ({\hat{θ}}_{ρ}^{(2, 1)})$ . Iterate until convergence, $R_{2}$ iterations. The result: $S_{t} (θ_{g}) = S_{t} ({\hat{θ}}_{g}^{(2, R_{2})})$ and $P_{t} (θ_{ρ}) = P_{t} ({\hat{θ}}_{ρ}^{(2, R_{2})})$ .
Estimate $θ_{h}$ from $D_{t} (θ_{h}, {\hat{θ}}_{g}^{(2, R_{2})})$ using $P_{t} (θ_{ρ}) = P_{t} ({\hat{θ}}_{ρ}^{(2, R_{2})})$ ( $S_{t} ({\hat{θ}}_{g}^{(2, R_{2})})$ is fixed). The result: $θ_{h} = {\hat{θ}}_{h}^{(2)}$ . This completes the second full iteration.
Repeat steps 5–7 and iterate until convergence.

For identification reasons,

δ_{0 i}

,

i = 1, \dots, N

, is frozen to

δ_{i 0} = {\hat{δ}}_{i 0}^{(1, R_{1})}

. This frees the intercepts in

θ_{h i}

. Any positive constant would do for

δ_{i 0}

; however, for numerical reasons, the intercepts are fixed to the values they obtain after the first iteration when

θ_{h}

has not yet been estimated a single time.

In practice, in estimating the slope parameters in transition functions it may be useful to apply the transformation

γ_{i j} = exp {η_{i j}}

, in which case

γ_{i j}

need not be restricted when

η_{i j}

is bounded away from

- \infty

. The motivation for this transformation is that estimating

η_{i j}

instead of

γ_{i j}

is numerically convenient in cases where

γ_{i j}

is large; see Goodwin et al. (2011) or Silvennoinen and Teräsvirta (2016) for discussion.

Another alternative, proposed by Chan and Theoharakis (2011), is to redefine the slope parameter as

γ_{i j} = 1 / η_{i j}^{2}

and estimate

η_{i j}

. The authors show that this also alleviates the convergence problems sometimes found when

γ_{i j}

is large. Ekner and Nejstgaard (2013) aim at the same effect by rescaling

γ_{i j}

to vary between zero and one.

Appendix D. Estimated Transition Equations

This appendix contains the estimated deterministic components in the TV-GARCH Equations (standard deviation estimates in parentheses). Note that the intercept is fixed after the first iteration; hence, it does not have a standard deviation estimate.

\begin{matrix} ANZ : {\hat{g}}_{1 t} = & 2.28 - \underset{(0.059)}{1.234} {(1 + exp {- \underset{(1.223)}{5.715} (t / T - \underset{(0.003)}{0.404})})}^{- 1} \\ + \underset{(1.518)}{12.316} {(1 + exp {- \underset{(0.392)}{5.875} (t / T - \underset{(0.002)}{0.571})})}^{- 1} \\ - \underset{(1.514)}{11.704} {(1 + exp {- \underset{(0.166)}{4.459} (t / T - \underset{(0.004)}{0.623})})}^{- 1} . \end{matrix}

\begin{matrix} CBA : {\hat{g}}_{2 t} = & 1.35 - \underset{(0.054)}{0.525} {(1 + exp {- \underset{(2.545)}{5.638} (t / T - \underset{(0.007)}{0.407})})}^{- 1} \\ + \underset{(1.871)}{9.257} {(1 + exp {- \underset{(0.374)}{5.117} (t / T - \underset{(0.004)}{0.574})})}^{- 1} \\ - \underset{(1.867)}{8.944} {(1 + exp {- \underset{(0.252)}{4.504} (t / T - \underset{(0.006)}{0.621})})}^{- 1} . \end{matrix}

\begin{matrix} NAB : {\hat{g}}_{3 t} = & 1.07 + \underset{(1.273)}{3.843} {(1 + exp {- \underset{(0.130)}{2.518} (t / T - \underset{(0.034)}{0.303})})}^{- 1} \\ - \underset{(1.114)}{3.491} {(1 + exp {- \underset{(0.329)}{3.787} (t / T - \underset{(0.008)}{0.373})})}^{- 1} \\ + \underset{(5.692)}{20.026} {(1 + exp {- \underset{(0.229)}{4.926} (t / T - \underset{(0.003)}{0.576})})}^{- 1} \\ - \underset{(5.676)}{20.039} {(1 + exp {- \underset{(0.123)}{4.183} (t / T - \underset{(0.006)}{0.609})})}^{- 1} . \end{matrix}

\begin{matrix} WBC : {\hat{g}}_{4 t} = & 2.45 - \underset{(0.554)}{3.120} {(1 + exp {- \underset{(0.124)}{2.194} (t / T - \underset{(0.034)}{0.534})})}^{- 1} \\ + \underset{(12.524)}{25.782} {(1 + exp {- \underset{(0.158)}{4.569} (t / T - \underset{(0.006)}{0.585})})}^{- 1} \\ - \underset{(12.682)}{23.616} {(1 + exp {- \underset{(0.375)}{4.767} (t / T - \underset{(0.007)}{0.607})})}^{- 1} . \end{matrix}

The locations of the transitions are remarkably similar across transitions. The first transition of the WBC equation is very slow. The effect of the transition extends over the whole estimation period and is the reason for the post-crisis decline in the value of

{\hat{g}}_{4 t};

see Figure 5.

Notes

1	Available also in https://econ.au.dk/research/researchcentres/creates/research/creates-research-papers/supplementary-downloads/rp-2012-09, accessed on 26 January 2023.
2	The operator vecl $(\cdot)$ stacks the subdiagonal elements of its argument matrix.
3	See Explanatory Statement, Banking (prudential standard) Determination 2007, Nos 5, 12 and 15. https://www.legislation.gov.au/Details/F2007L04593/ (accessed on 26 January 2023), https://www.legislation.gov.au/Details/F2007L04600/ (accessed on 26 January 2023) and https://www.legislation.gov.au/Details/F2007L04603/ (accessed on 26 January 2023).

References

Amado, Christina, and Timo Teräsvirta. 2008. Modelling Conditional and Unconditional Heteroskedasticity with Smoothly Time-Varying Structure. SSE/EFI Working Paper Series in Economics and Finance 691. Stockholm: Stockholm School of Economics. [Google Scholar]
Amado, Cristina, and Timo Teräsvirta. 2013. Modelling volatility by variance decomposition. Journal of Econometrics 175: 153–65. [Google Scholar] [CrossRef]
Amado, Cristina, and Timo Teräsvirta. 2014. Conditional correlation models of autoregressive conditional heteroscedasticity with nonstationary GARCH equations. Journal of Business and Economic Statistics 32: 69–87. [Google Scholar] [CrossRef]
Amado, Cristina, and Timo Teräsvirta. 2017. Specification and testing of multiplicative time-varying GARCH models with applications. Econometric Reviews 36: 421–46. [Google Scholar] [CrossRef]
Amado, Cristina, Annastiina Silvennoinen, and Timo Teräsvirta. 2017. Modelling and forecasting WIG20 daily returns. Central European Journal of Economic Modelling and Econometrics 9: 173–200. [Google Scholar]
Berben, Robert-Paul, and W. Jos Jansen. 2005. Comovement in international equity markets: A sectoral view. Journal of International Money and Finance 24: 832–57. [Google Scholar] [CrossRef]
Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef]
Bollerslev, Tim. 1990. Modelling the coherence in short-run nominal exchange rates: A multivariate generalized ARCH model. Review of Economics and Statistics 72: 498–505. [Google Scholar] [CrossRef]
Box, George E. P., and Gwilym M. Jenkins. 1970. Time Series Analysis, Forecasting Furthermore, Control. San Francisco: Holden-Day. [Google Scholar]
Chan, Felix, and Billy Theoharakis. 2011. Estimating m-regimes STAR–GARCH model using QMLE with parameter transformation. Mathematics and Computers in Simulation 81: 1385–96. [Google Scholar] [CrossRef]
Ekner, Line, and Emil Nejstgaard. 2013. Parameter Identification in the Logistic STAR Model. Discussion Paper 13-07. København: Department of Economics, University of Copenhagen. [Google Scholar]
Engle, Robert F. 2002. Dynamic conditional correlation: A simple class of multivariate generalized autoregressive conditional heteroskedasticity models. Journal of Business and Economic Statistics 20: 339–50. [Google Scholar] [CrossRef]
Engle, Robert, and Bryan Kelly. 2012. Dynamic equicorrelation. Journal of Business & Economic Statistics 30: 212–28. [Google Scholar]
Feng, Yuanhua. 2004. Simultaneously modeling conditional heteroskedasticity and scale change. Econometric Theory 20: 563–96. [Google Scholar] [CrossRef]
Feng, Yuanhua. 2006. A Local Dynamic Conditional Correlation Model. MPRA Paper 1592. Edinburgh: Maxwell Institute for Mathematical Sciences, Heriot-Watt University. [Google Scholar]
Glosten, Lawrence R., Ravi Jagannathan, and David E. Runkle. 1993. On the relation between the expected value and the volatility of the nominal excess return on stocks. Journal of Finance 48: 1779–801. [Google Scholar] [CrossRef]
Godfrey, Leslie G. 1988. Misspecification Tests in Econometrics. Cambridge: Cambridge University Press. [Google Scholar]
Goodwin, Barry K., Matthew T. Holt, and Jeffrey P. Prestemon. 2011. North American oriented strand board markets, arbitrage activity, and market price dynamics: A smooth transition approach. American Journal of Agricultural Economics 93: 993–1014. [Google Scholar] [CrossRef]
He, Changli, and Timo Teräsvirta. 1999. Properties of moments of a family of GARCH processes. Journal of Econometrics 92: 173–92. [Google Scholar] [CrossRef]
Kang, Jian, Johan Stax Jakobsen, Annastiina Silvennoinen, Timo Teräsvirta, and Glen Wade. 2022. A parsimonious test of constancy of a positive definite correlation matrix in a multivariate time-varying GARCH model. Econometrics 10: 30. [Google Scholar] [CrossRef]
Lütkepohl, Helmut. 1996. Handbook of Matrices. Chichester: John Wiley & Sons. [Google Scholar]
Luukkonen, Ritva, Pentti Saikkonen, and Timo Teräsvirta. 1988. Testing linearity against smooth transition autoregressive models. Biometrika 75: 491–99. [Google Scholar] [CrossRef]
Silvennoinen, Annastiina, and Timo Teräsvirta. 2005. Multivariate Autoregressive Conditional Heteroskedasticity with Smooth Transitions in Conditional Correlations. SSE/EFI Working Paper Series in Economics and Finance No. 577. Stockholm: Stockholm School of Economics. [Google Scholar]
Silvennoinen, Annastiina, and Timo Teräsvirta. 2009. Modelling multivariate autoregressive conditional heteroskedasticity with the double smooth transition conditional correlation GARCH model. Journal of Financial Econometrics 7: 373–411. [Google Scholar] [CrossRef]
Silvennoinen, Annastiina, and Timo Teräsvirta. 2015. Modeling conditional correlations of asset returns: A smooth transition approach. Econometric Reviews 34: 174–97. [Google Scholar] [CrossRef]
Silvennoinen, Annastiina, and Timo Teräsvirta. 2016. Testing constancy of unconditional variance in volatility models by misspecification and specification tests. Studies in Nonlinear Dynamics and Econometrics 20: 347–64. [Google Scholar] [CrossRef]
Silvennoinen, Annastiina, and Timo Teräsvirta. 2021. Consistency and asymptotic normality of maximum likelihood estimators of the multiplicative time-varying smooth transition correlation GARCH model. Econometrics and Statistics. in press. [Google Scholar] [CrossRef]
Song, Peter X.-K., Yanqin Fan, and John D. Kalbfleisch. 2005. Maximization by parts in likelihood inference. Journal of the American Statistical Association 100: 1145–58. [Google Scholar] [CrossRef]
Teräsvirta, Timo. 1994. Specification, estimation, and evaluation of smooth transition autoregressive models. Journal of the American Statistical Association 89: 208–18. [Google Scholar]
Teräsvirta, Timo, Dag Tjøstheim, and Clive W. J. Granger. 2010. Modelling Nonlinear Economic Time Series. Oxford: Oxford University Press. [Google Scholar]
Tse, Yiu Kuen, and Albert K. C. Tsui. 2002. A multivariate generalized autoregressive conditional heteroscedasticity model with time-varying correlations. Journal of Business and Economic Statistics 20: 351–62. [Google Scholar] [CrossRef]

Figure 1. The market capitalisation of the Big Four as percentage of ASX200 (left) and of ASX200 Financials Index (right).

Figure 2. Daily returns of the Big Four, from 2 January 1992 to 31 January 2020.

Figure 3. The first 100 autocorrelations of squared returns.

Figure 4. The first 100 autocorrelations of squared standardised returns

ε_{i t}^{2} / {\hat{g}}_{i t}

.

Figure 4. The first 100 autocorrelations of squared standardised returns

ε_{i t}^{2} / {\hat{g}}_{i t}

.

Figure 5. Estimated multiplicative component

{\hat{g}}_{i t}^{1 / 2}

(solid curve) and the absolute returns

| ε_{i t} |

(grey area).

Figure 5. Estimated multiplicative component

{\hat{g}}_{i t}^{1 / 2}

(solid curve) and the absolute returns

| ε_{i t} |

(grey area).

Figure 6. Estimated conditional variance

{\hat{h}}_{i t}

from the GJR-GARCH model.

Figure 6. Estimated conditional variance

{\hat{h}}_{i t}

from the GJR-GARCH model.

Figure 7. Estimated conditional variance

{\hat{h}}_{i t}

from the TV-GJR-GARCH model.

Figure 7. Estimated conditional variance

{\hat{h}}_{i t}

from the TV-GJR-GARCH model.

Figure 8. Estimated correlations. Vertical lines correspond to October 2007 and February 2008.

Figure 9. Estimated pairwise correlations.

Table 1. Univariate estimation results for the four banks. GJR is the GJR-GARCH(1,1) equation, and TV-GJR is the TV-GJR-GARCH equation; standard errors in parentheses.

		$α_{i 0}$	$α_{i 1}$	$κ_{i 1}$	$β_{i 1}$	Persistence	Kurtosis
ANZ	GJR	$\underset{(0.005)}{0.020}$	$\underset{(0.007)}{0.039}$	$\underset{(0.008)}{0.044}$	$\underset{(0.008)}{0.929}$	0.991	3.76
	TV-GJR	$\underset{(0.016)}{0.111}$	$\underset{(0.005)}{0.015}$	$\underset{(0.007)}{0.046}$	$\underset{(0.027)}{0.792}$	0.831	3.02
CBA	GJR	$\underset{(0.005)}{0.035}$	$\underset{(0.008)}{0.060}$	$\underset{(0.011)}{0.063}$	$\underset{(0.010)}{0.886}$	0.977	3.66
	TV-GJR	$\underset{(0.014)}{0.107}$	$\underset{(0.006)}{0.021}$	$\underset{(0.010)}{0.065}$	$\underset{(0.020)}{0.813}$	0.867	3.06
NAB	GJR	$\underset{(0.009)}{0.065}$	$\underset{(0.010)}{0.077}$	$\underset{(0.014)}{0.075}$	$\underset{(0.014)}{0.850}$	0.964	3.68
	TV-GJR	$\underset{(0.019)}{0.152}$	$\underset{(0.006)}{0.021}$	$\underset{(0.009)}{0.058}$	$\underset{(0.030)}{0.731}$	0.780	3.03
WBC	GJR	$\underset{(0.006)}{0.031}$	$\underset{(0.007)}{0.045}$	$\underset{(0.010)}{0.058}$	$\underset{(0.009)}{0.910}$	0.985	3.70
	TV-GJR	$\underset{(0.011)}{0.079}$	$\underset{(0.004)}{0.015}$	$\underset{(0.006)}{0.041}$	$\underset{(0.020)}{0.829}$	0.864	3.02

Table 2. Estimation results for the four banks’ time-varying correlations. A total of 90% of the estimated transition is between the dates 18 October 2006 and 28 February 2008. The centre point of the location corresponds to 28 June 2007 with ± two standard error ranges of 11 May–13 August 2007.

	$P_{(1)}$					$P_{(2)}$
	ANZ	CBA	NAB			ANZ	CBA	NAB
CBA	$\underset{(0.011)}{0.485}$				CBA	$\underset{(0.006)}{0.782}$
NAB	$\underset{(0.010)}{0.503}$	$\underset{(0.010)}{0.525}$			NAB	$\underset{(0.005)}{0.808}$	$\underset{(0.005)}{0.787}$
WBC	$\underset{(0.009)}{0.606}$	$\underset{(0.011)}{0.500}$	$\underset{(0.011)}{0.492}$		WBC	$\underset{(0.004)}{0.830}$	$\underset{(0.005)}{0.818}$	$\underset{(0.005)}{0.814}$
Transition parameters:			c	$η$
			$\underset{(0.002)}{0.552}$	$\underset{(0.162)}{5.020}$

Note: η = ln γ; see Appendix A.2.

Table 3. Estimation results for the four banks’ time-varying bivariate correlations.

	$P_{(1)}$				$P_{(2)}$
	ANZ	CBA	NAB		ANZ	CBA	NAB
CBA	$\underset{(0.011)}{0.484}$			CBA	$\underset{(0.006)}{0.784}$
NAB	$\underset{(0.010)}{0.510}$	$\underset{(0.011)}{0.518}$		NAB	$\underset{(0.005)}{0.811}$	$\underset{(0.005)}{0.785}$
WBC	$\underset{(0.009)}{0.607}$	$\underset{(0.011)}{0.504}$	$\underset{(0.011)}{0.490}$	WBC	$\underset{(0.004)}{0.831}$	$\underset{(0.005)}{0.816}$	$\underset{(0.005)}{0.812}$
Transition parameters:
	c				$η$
	ANZ	CBA	NAB		ANZ	CBA	NAB
CBA	$\underset{(0.005)}{0.550}$			CBA	$\underset{(0.280)}{4.764}$
NAB	$\underset{(0.001)}{0.567}$	$\underset{(0.008)}{0.532}$		NAB	$\underset{(-)}{7.000}$	$\underset{(0.341)}{4.514}$
WBC	$\underset{(0.005)}{0.555}$	$\underset{(0.004)}{0.547}$	$\underset{(0.004)}{0.549}$	WBC	$\underset{(0.308)}{5.198}$	$\underset{(0.254)}{4.761}$	$\underset{(0.294)}{4.873}$

Note: η = ln γ; see Appendix A.2.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hall, A.D.; Silvennoinen, A.; Teräsvirta, T. Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks. Econometrics 2023, 11, 5. https://doi.org/10.3390/econometrics11010005

AMA Style

Hall AD, Silvennoinen A, Teräsvirta T. Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks. Econometrics. 2023; 11(1):5. https://doi.org/10.3390/econometrics11010005

Chicago/Turabian Style

Hall, Anthony D., Annastiina Silvennoinen, and Timo Teräsvirta. 2023. "Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks" Econometrics 11, no. 1: 5. https://doi.org/10.3390/econometrics11010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Building Multivariate Time-Varying Smooth Transition Correlation GARCH Models, with an Application to the Four Largest Australian Banks

Abstract

1. Introduction

2. The MTV Model

3. The Three Stages of Model Building

4. Specification of the MTV Model

4.1. Specification of the Univariate Variance Equations

4.2. Specification of Time-Varying Correlations

5. Estimation of the MTV Model

6. Evaluation of the MTV Model

7. Big Four Results

7.1. Main Features of the Australian Banking Sector 1990–2020

7.2. Modelling the Error Variances

7.3. Modelling the Error Correlations

8. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Test Statistics

Appendix A.1. Test Statistic for TVV-Model Specification

Appendix A.2. Test Statistic for MTV-GARCH Model Evaluation

Appendix A.3. Test of Constant Correlations

Appendix A.4. Test for an Additional Transition in the Correlations

Appendix B. Simulations of Test Statistics

Appendix B.1. Tests of GARCH Equations

Appendix B.2. Evaluation Tests of GARCH Equations

Appendix B.3. Tests of Correlations

Appendix C. Details of Maximisation by Parts

Appendix D. Estimated Transition Equations

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI