On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results

Le Gallo, Julie; Sénégas, Marc-Alexandre

doi:10.3390/econometrics11040025

Open AccessArticle

On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results

by

Julie Le Gallo

^1,*,†

and

Marc-Alexandre Sénégas

^2,†

¹

Center of Economics and Sociology Applied to Rural Areas, UMR1041, l’Institut Agro Dijon, INRAE, University Bourgogne Franche-Comté, 21000 Dijon, France

²

Bordeaux School of Economics, UMR CNRS6060, University of Bordeaux, 33000 Bordeaux, France

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Econometrics 2023, 11(4), 25; https://doi.org/10.3390/econometrics11040025

Submission received: 7 June 2023 / Revised: 31 October 2023 / Accepted: 6 November 2023 / Published: 8 November 2023

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

We provide new analytical results for the implementation of the Hausman specification test statistic in a standard panel data model, comparing the version based on the estimators computed from the untransformed random effects model specification under Feasible Generalized Least Squares and the one computed from the quasi-demeaned model estimated by Ordinary Least Squares. We show that the quasi-demeaned model cannot provide a reliable magnitude when implementing the Hausman test in a finite sample setting, although it is the most common approach used to produce the test statistic in econometric software. The difference between the Hausman statistics computed under the two methods can be substantial and even lead to opposite conclusions for the test of orthogonality between the regressors and the individual-specific effects. Furthermore, this difference remains important even with large cross-sectional dimensions as it mainly depends on the within-between structure of the regressors and on the presence of a significant correlation between the individual effects and the covariates in the data. We propose to supplement the test outcomes that are provided in the main econometric software packages with some metrics to address the issue at hand.

Keywords:

random effects panel data model; Hausman specification test; quasi-demeaned model

JEL Classification:

C12; C23

1. Introduction

As is well known, the implementation of the Hausman specification test (Hausman 1978) might be affected, in practice and in a finite sample setting, by a non-positive definiteness or (in-) definiteness problem for the variance-covariance matrix corresponding to the difference between the efficient estimator and the consistent estimator1. This, in turn, can potentially lead to a negative value of the test statistic, which makes it unreliable for interpreting the test outcome. This problem is usually mentioned in the context of models using instrumental variables (IV) (Baum et al. 2003, pp. 19–22; Staiger and Stock 1997, pp. 567–68) where the Hausman test is performed to assess the endogeneity of the regressors, given a set of instruments. In this case, one solution to ensure a symmetric positive definite (hereafter, SPD) covariance matrix is to use a common and identical estimator for the variance of the (idiosyncratic) error term when confronting the Ordinary Least Squares (OLS) and the IV estimators (Hayashi 2000, pp. 220–33; Baum et al. 2003, pp. 19–22).

Yet, although the issue has been pointed out as a warning by Hausman himself when he addressed the case of a static balanced, panel data model in his seminal presentation of the test (Hausman 1978, footnote 25, p. 1267), it has not been, to the best of our knowledge, further formally examined in this specific framework2. This is surprising, as one of the most widespread applications of the Hausman test is for assessing the relevance of the random (RE) versus fixed effects (FE) specification in a panel data model. From our point of view, this application, however, deserves attention in its own right, and this article aims at filling this gap. Specifically, our contributions are twofold.

We first provide new and detailed analytical results for the implementation of the Hausman test in the case of a static and balanced panel data model with individual effects. In particular, we show that the test statistic is unreliable in a finite sample if the variance of the RE estimator is computed on the basis of the estimation of the quasi-demeaned model (we denote it in what follows QDM)3 rather than the conventional and direct implementation of the Feasible Generalized Least Squares (FGLS) on the RE panel data model. This result directly follows from the way in which standard errors are computed under the QDM approach and which can lead to a positive definiteness problem for the covariance matrix in the Hausman test statistic formula. To establish the unreliability, we perform a systematic analysis of the difference between the Hausman statistics computed under the two approaches as well as of the behavior of the statistic that uses the estimates based on the QDM regression framework. In particular, we show that the latter mainly depends on the within-between structure of the regressors and on the presence of a significant correlation between the individual effects and the covariates in the data4.

Second, based on a review of the main existing econometric software programs that deal with panel data models, we show that the vast majority of the related packages in those programs implement, by default, the Hausman test using the unreliable statistic. This leads us to assess different ways to supplement the test outcomes provided by those programs and/or to circumvent the reliability problem potentially raised by the use of the statistic involved.

The outline of the paper is as follows. First, in Section 2, we show how the two versions of the Hausman test statistics can produce diverging results for some well-known textbook examples in the context of panel data models. Then, in Section 3, we formalize the implications of the two approaches for the Hausman statistic and derive new analytical results regarding the comparison of the two versions of the statistic that follows from each of the two approaches. We also revisit textbook examples and provide some simulation results in the one-regressor case. Finally, in Section 4, we detail how the Hausman test is implemented in a variety of econometric software programs dealing with panel data models and discuss, on the basis of this review, some possible ways to implement this test in a reliable and robust manner with this software. The last section concludes.

2. Motivation

In this section, we illustrate the extent to which significant differences in the values of the Hausman test statistic may arise depending on the approach adopted to estimate the panel data model parameters and, more particularly, those pertaining to the variance components under the random effects (RE) specification.

2.1. Notation

All the case studies considered below are for a standard, linear, static, and balanced panel data model with individual effects (also called the one-way error component-panel data-model). Accordingly, we consider the following relationship:

y_{i t} = α + X_{i t}^{'} \cdot β + u_{i t}

(1)

where

i = 1, \dots, N

denotes the cross-section dimension;

t = 1, \dots, T

the time-series dimension;

y_{i t}

the

i t

-th observation of the dependent variable;

X_{i t}

a column vector of the

i t

-th observation on K explanatory variables;

α

an unknown scalar and

β

a

(K \times 1)

vector of unknown parameters, both to be estimated5. The error term

u_{i t}

is assumed to take the following composite form:

u_{i t} = α_{i}^{*} + ε_{i t}

(2)

where

α_{i}^{*}

denotes the (unobservable) individual-specific (or individual) effect-also called the individual component of

u_{i t}

and

ε_{i t}

denotes the idiosyncratic component of

u_{i t}

. Furthermore, we assume that

α_{i}^{*}

and

ε_{i t}

are independent of each other and that

α_{i}^{*} \sim I I D (0, σ_{α}^{2})

and

ε_{i t} \sim I I D (0, σ_{ε}^{2})

. It is also assumed that

E [X_{i s} \cdot ε_{i t}] = 0

for all

s = 1, \dots, T

.

As is well known, two alternative specifications are usually considered regarding the correlation between the individual effect,

α_{i}^{*}

and the regressors contained in

X_{i t}

. On the one hand, the random-effects (RE) model assumes that this correlation is zero, ensuring that

X_{i t}

is strictly exogenous for

u_{i t}

in (1). On the other hand, the ‘fixed effects’ model allows for a non-zero correlation between the individual effect and the regressors. The Within transformation is then used to obtain an unbiased estimate for

β

.

The Hausman specification test (Hausman 1978) is widely used for testing the no-correlation assumption underlying the RE specification. In our setting, the test is based on the asymptotic properties of the RE and Within (or fixed-effects) estimators of

β

. Both estimators are consistent under the null hypothesis (of no correlation) while, under the alternative, only the Within estimator is consistent as the RE estimator is (asymptotically) biased. Accordingly, the test statistic is built on a distance measure between the Within and RE estimators. In its implementable version, the statistic writes as:

H M_{O}^{j} \equiv {({\hat{β}}_{W} - {\hat{β}}_{R E})}^{'} \cdot {\{v \hat{a} r [{\hat{β}}_{W}] - v \hat{a} r_{j} [{\hat{β}}_{R E}]\}}^{- 1} \cdot ({\hat{β}}_{W} - {\hat{β}}_{R E})

where

{\hat{β}}_{W}

and

v \hat{a} r [{\hat{β}}_{W}]

denote the Within (or fixed effects) estimator of

β

and a consistent estimator of its asymptotic covariance matrix, respectively;

{\hat{β}}_{R E}

and

v \hat{a} r_{j} [{\hat{β}}_{R E}]

denote the RE estimator of

β

and a consistent estimator of its asymptotic covariance matrix, respectively. The index

j = 1, 2

indicates that two approaches can be considered to obtain a consistent estimator of the asymptotic covariance matrix, leading to two versions of the test statistic (see Section 3.3).

2.2. Motivating Examples

We reproduce some outcomes of panel model estimations for well-known case study applications taken from main textbooks in the field. In the following tables, Std Err._1 and Std. Err._2 denote the two sets of standard errors for the parameters implied by the use of two different estimators for the covariance matrix of the RE estimator (detailed below). The values of the two related versions of Hausman test statistics are denoted

{H M}_{O}^{1}

and

{H M}_{O}^{2}

. Other variables and parameters are shown in the tables, the definitions and interpretations of which are left for discussion in Section 3 where we further comment on those results.

2.2.1. Motivating Example 1: Gasoline

Baltagi (2005) provides an interesting example of an important difference between the two versions of the Hausman test statistic in a study on the determinants of gasoline demand over the period 1960–1978 across 18 OECD countries6. The following specification is adopted:

log [G a s / C a r] = α + β_{1} \cdot log [Y / N] + β_{2} \cdot log [P_{M G} / P_{G D P}] + β_{3} \cdot log [C a r / N] + u

where

G a s / C a r

is motor gasoline consumption per auto,

Y / N

is real income per capita,

P_{M G} / P_{G D P}

is real motor gasoline price and

C a r / N

denotes the stock of cars per capita.

Table 1 clearly shows that the two values of the Hausman statistic deviate strongly from each other, even if the null hypothesis is rejected in both cases. It is also interesting to observe that the two sets of standard errors remain close to each other7.

2.2.2. Motivating Example 2: Airline

Greene (2000) relies on the following specification concerning the cost function in the airline industry8:

log [c o s t] = α + β_{1} \cdot log [Q] + β_{2} \cdot log [f u e l p r i c e] + β_{3} \cdot l o a d f a c t o r + u

where

c o s t

is the total cost, in $1000; Q is output, measured in “revenue passenger miles” (index number);

f u e l p r i c e

is fuel price and

l o a d f a c t o r

is a rate of capacity utilization: it is the average rate at which seats on the airline’s planes are filled. The dataset consists of six firms observed yearly for 15 years (1970 to 1984).

In this case (Table 2),

{H M}_{O}^{1} > {H M}_{O}^{2}

while the two values are much closer and clearly lower than in the previous case. They both lead to the rejection of the null. Also, the two sets of standard errors are quite close.

The airline case provides further interesting outcomes when the specification is estimated with two covariates. We single out the regression with

log [f u e l p r i c e]

and

l o a d f a c t o r

as the two covariates. The corresponding estimation results are provided in Table 3.

This new set of results is interesting for the negative sign of the computed

{H M}_{O}^{2}

and furthermore in that

H M_{O}^{1} > |H M_{O}^{2}|

. Using the absolute value of the

{H M}_{O}^{2}

statistic to perform the Hausman test (as some software packages do), the null hypothesis would not be rejected contrary to the outcome implied by

H M_{O}^{1}

. Note at this point that these results are obtained for a sample distribution of the covariate’s observations featuring a structure of the variance largely skewed in its within dimension.

2.2.3. Motivating Example 3: Wage determination

In another application, Greene (2012) relies on Cornwell and Rupert (2008)’s study about the determinants of the returns to schooling. The dataset is a balanced panel of 595 observations on heads of households that runs over the period (1976–1982). Among the specifications examined by Greene, we consider the following one:

\begin{matrix} log [w a g e] & = α + β_{1} \cdot E X P + β_{2} \cdot {(E X P)}^{2} + β_{3} \cdot W K S + β_{4} \cdot O C C + β_{5} \cdot I N D \\ β_{6} \cdot S O U T H + β_{7} \cdot S M S A + β_{8} \cdot M S + β_{9} \cdot U N I O N + u \end{matrix}

where

E X P

denotes the number of years of full-time work experience;

W K S

, the number of weeks worked;

O C C = 1

if the status of the occupation is blue-collar occupation, 0 if not;

I N D = 1

if the individual works in a manufacturing industry, 0 if not;

S O U T H = 1

if the individual resides in the south, 0 if not;

S M S A = 1

if the individual resides in a city, 0 if not;

M S = 1

if the individual is married, 0 if not;

U N I O N = 1

if the individual wage is set by a union contract, 0 if not; lastly

log [w a g e]

denotes the log of the (yearly) wage9.

The estimation results are presented in Table 4. Here, even with a large sample (4765 observations), we observe significant differences between the two Hausman statistics (which are very large) as well as the two sets of the standard errors for the random effects model (the ratio of the latter that is provided by

\sqrt{h}

(see later) is close to

1.3

).

3. The Two Versions of the Hausman Test Statistic

The two versions of the Hausman statistic refer to two possible approaches for estimating the covariance matrix of the estimator of the parameters of the model given in its RE specification (1) and (2). We first start with a formal and explicit presentation of these approaches since their implications for the computation of the Hausman test statistic have remained largely unnoticed.

3.1. The Original Hausman Test Specification in a Balanced Panel Data Model

We rewrite model (1) and (2), stacking the observations over the time and cross-sectional dimensions:

y = ι_{N T} \cdot α + X \cdot β + u

(3)

where

y

is the

(N T \times 1)

vector for the observations of the dependent variable;

ι_{N T}

a vector of ones of dimension

N T

and

X

is the

(N T \times K)

matrix including the observations of the K explanatory variables;

u \equiv {(u_{11} \dots u_{1 T} \dots u_{N 1} \dots u_{N T})}^{'}

is the

(N T \times 1)

vector for the composite error terms. The covariance matrix of

u

is denoted by

Ω

. Given the properties of

α_{i}^{*}

and

ε_{i t}

, it takes the following form:

Ω = σ_{ε}^{2} \cdot Ω^{*}

with

Ω^{*} =

[W + (1 / ψ^{2}) \cdot B]

,

ψ^{2} \equiv σ_{ε}^{2} / σ_{α ε}^{2}

,

σ_{α ε}^{2} \equiv σ_{ε}^{2} + T \cdot σ_{α}^{2}

,

B \equiv I_{N} \otimes (J_{T} / T)

(the Between operator),

W \equiv I_{N T} - B

(the Within operator), where ⊗ is the Kronecker product,

I_{N}

(resp.

I_{T}

) the

(N \times N)

(resp.

(T \times T)

) identity matrix and

J_{T} = ι_{T} \cdot ι_{T}^{'}

a matrix of ones of dimension T.

It is useful to define

Ω_{C}^{*}

such that

Ω_{C}^{*} = Ω^{*} \cdot (I_{N T} - B_{N T}) = [W + (1 / ψ^{2}) \cdot B_{C}]

with

B_{C}

denoting the centered Between operator defined as

B_{C} \equiv B - B_{N T}

and with

B_{N T} \equiv ι_{N T} \cdot {(ι_{N T}^{'} \cdot ι_{N T})}^{- 1} \cdot ι_{N T}^{'}

. We also use

Ω^{* - \frac{1}{2}}

defined as

Ω^{* - \frac{1}{2}} \cdot Ω^{* - \frac{1}{2}} = Ω^{* - 1}

. As B and W are symmetric and idempotent,

Ω^{* - 1} = [W + ψ^{2} \cdot B]

and, in turn:

Ω^{* - \frac{1}{2}} = [W + ψ \cdot B]

. The same applies to

Ω_{C}^{* - \frac{1}{2}}

and

Ω_{C}^{* - 1}

with

B_{C}

replacing B in the previous formulas.

Assume first, that the variance components (and thus

Ω

,

Ω^{*}

and

Ω_{C}^{*}

) are known. Then, it is well established that:

(1) The RE estimator of

β

({\hat{β}}_{R E})

corresponds to the Generalized Least Squares (GLS) estimator of

β

in (3), noted

{\hat{β}}_{G L S}

, and is given by (4) with its covariance matrix by (5):

\begin{matrix} {\hat{β}}_{R E} = {\hat{β}}_{G L S} = {(X^{'} \cdot Ω_{C}^{* - 1} \cdot X)}^{- 1} \cdot (X^{'} \cdot Ω_{C}^{* - 1} \cdot y) \end{matrix}

(4)

\begin{matrix} var [{\hat{β}}_{R E}] = var [{\hat{β}}_{G L S}] = σ_{ε}^{2} \cdot {(X^{'} \cdot Ω_{C}^{* - 1} \cdot X)}^{- 1} \end{matrix}

(5)

(2) The Within (or fixed effects) estimator of

β

,

{\hat{β}}_{W}

, is given by (6) and its covariance matrix by (7):

\begin{matrix} {\hat{β}}_{W} = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot (X_{W}^{'} \cdot y_{W}) \end{matrix}

(6)

\begin{matrix} var [{\hat{β}}_{W}] = σ_{ε}^{2} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \end{matrix}

(7)

with

y_{W} \equiv W \cdot y

and

X_{W} \equiv W \cdot X

.

The GLS estimator

{\hat{β}}_{G L S}

can alternatively be obtained, using the quasi-demeaned model (also called the partial Within transformation model) built from (3) with the premultiplying factor10

Ω^{* - \frac{1}{2}}

:

y_{* *} = ω_{* *} \cdot α + X_{* *} \cdot β + u_{* *}

(8)

with

ω_{* *} \equiv ψ \cdot ι_{N T};

X_{* *} \equiv

Ω^{*^{- \frac{1}{2}}} \cdot X; y_{* *} \equiv Ω^{*^{- \frac{1}{2}}} \cdot y

;

u_{* *} \equiv Ω^{*^{- \frac{1}{2}}} \cdot u

. It is easy to check that the OLS estimator of

β

in (8)-denoted as

{\hat{β}}_{O L S}

-corresponds to

{\hat{β}}_{G L S}

11. Also:

var [{\hat{β}}_{O L S}] = v a r [{\hat{β}}_{G L S}] = σ_{ε}^{2} \cdot {(X_{*}^{'} \cdot X_{*})}^{- 1}

(9)

where

X_{*} \equiv Ω_{C}^{*^{- \frac{1}{2}}} \cdot X

which corresponds to (5) as

X_{*}^{'} \cdot X_{*} = X^{'} \cdot Ω_{C}^{* - 1} \cdot X .

Based on the former estimators, the Hausman specification test statistic initially proposed by Hausman (1978) is:

H M_{O *} \equiv {\hat{q}}^{'} \cdot {\{var [{\hat{β}}_{W}] - var [{\hat{β}}_{R E}]\}}^{- 1} \cdot \hat{q}

(10)

with

\hat{q} \equiv {\hat{β}}_{W} - {\hat{β}}_{R E}

and

var (\hat{z})

denoting the (finite sample) exact covariance matrix of

\hat{z}

. Under the null hypothesis of no-correlation,

H M_{O *}

is asymptotically distributed as a

χ^{2} (K)

.

3.2. Two Estimation Procedures

In practice,

Ω,

Ω^{*}

and

Ω_{C}^{*}

are unknown and replaced by consistent estimators (noted

\hat{Ω}

,

\hat{Ω^{*}}

and

\hat{Ω_{C}^{*}}

, respectively12). In this case, the Hausman test statistic is written as:

H M_{O} \equiv {\hat{β}}_{Δ W R E}^{'} \cdot {\{v \hat{a} r [{\hat{β}}_{W}] - v \hat{a} r_{j} [{\hat{β}}_{R E}]\}}^{- 1} \cdot {\hat{β}}_{Δ W R E}

where:

(1)

{\hat{β}}_{Δ W R E} \equiv {\hat{β}}_{W} - {\hat{β}}_{R E}

with, now,

{\hat{β}}_{R E} = {\hat{β}}_{F G L S}

indicating that the RE estimator corresponds to the Feasible Generalized Least Squares (FGLS) estimator for

β

(

{\hat{β}}_{F G L S}

), accounting for the use of

\hat{Ω_{C}^{*}}

instead of

Ω_{C}^{*}

.

(2)

v \hat{a} r (\hat{z})

is a consistent estimator of the asymptotic covariance matrix of

\hat{z}

built upon the finite-sample estimator of the (exact) variance of

\hat{z}

and with

j = 1, 2

indicating that, for the RE estimator, two approaches are available to compute this matrix.

Under suitable conditions assumed to hold in what follows13,

{\hat{β}}_{F G L S}

and

{\hat{β}}_{G L S}

are asymptotically equivalent and

H M_{O}

is, as

H M_{O *}

, asymptotically distributed as a

χ^{2} (K)

.

We now discuss the choice of variance component estimators that are required to compute

H M_{O}

.

First, a consistent estimator for

var [{\hat{β}}_{W}]

is:

v \hat{a} r [{\hat{β}}_{W}] = {\hat{σ}}_{ε}^{w 2} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1}

, where

{\hat{σ}}_{ε}^{w 2} \equiv ({\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W}) / (N (T - 1) - K)

denotes the consistent estimator for

σ_{ε}^{2}

built from the OLS estimation associated with the Within (transformed) regression model,

{\hat{u}}_{W}

denoting the

(N T \times 1)

vector of the related residuals.

Second, to obtain

v \hat{a} r [{\hat{β}}_{F G L S}]

, two approaches are possible.

3.2.1. Approach 1: The (Direct) FGLS Approach

Relying on the asymptotic equivalence between

{\hat{β}}_{F G L S}

and

{\hat{β}}_{G L S}

, the computation of a consistent, asymptotic covariance matrix estimator for

var [{\hat{β}}_{R E}]

can be considered directly from the expression (5) where

\hat{Ω_{C}^{*}}

is substituted for

Ω_{C}^{*}

and

{\hat{σ}}_{ε}^{2}

for

σ_{ε}^{2}

, which yields:

v \hat{a} r_{1} [{\hat{β}}_{R E}] \equiv v \hat{a} r_{1} [{\hat{β}}_{F G L S}] = {\hat{σ}}_{ε}^{2} \cdot {(X^{'} \cdot {\hat{Ω_{C}^{*}}}^{- 1} \cdot X)}^{- 1}

(11)

with

{\hat{Ω_{C}^{*}}}^{- 1} = [W + {\hat{ψ}}^{2} \cdot B_{C}]

and

{\hat{ψ}}^{2} \equiv ({\hat{σ}}_{ε}^{2} / {\hat{σ}}_{α ε}^{2})

.

{\hat{σ}}_{ε}^{2}

in (11) should logically be chosen as the same estimator as the one entering into14

{\hat{ψ}}^{2}

for computing

\hat{Ω_{C}^{*}}

also appearing in (11). One usually relies on the estimator based on the ‘fixed effects model’ for this purpose, so that we set

{\hat{σ}}_{ε}^{2} = {\hat{σ}}_{ε}^{w 2}

(Swamy-Arora approach). We use that correspondence in what follows.

3.2.2. Approach 2: The Quasi-Demeaning Approach

A consistent (asymptotic) covariance matrix estimator for

{\hat{β}}_{R E}

can alternatively be obtained from the quasi-demeaned regression model (8) considered in its feasible version with

{\hat{Ω^{*}}}^{- \frac{1}{2}}

substituting for

Ω^{*^{- \frac{1}{2}}}

where

{\hat{Ω^{*}}}^{- \frac{1}{2}} = W + \hat{ψ} \cdot B

and

\hat{ψ} = \sqrt{{\hat{ψ}}^{2}}

. We note this feasible version of the QDM model as the FQDM regression model. In this case, and relying again on the asymptotic equivalence between

{\hat{β}}_{G L S}

and

{\hat{β}}_{F G L S}

, the resulting “plug-in” estimator can then be computed from the formula giving the covariance matrix for the OLS estimator of

β

in (8), i.e., (9), where

{\tilde{X}}_{*}

substitutes for

X_{*}

and

{\hat{σ}}_{ε}^{* 2}

for

σ_{ε}^{2}

:

v \hat{a} r_{2} [{\hat{β}}_{R E}] \equiv v \hat{a} r_{2} [{\hat{β}}_{F G L S}] = {\hat{σ}}_{ε}^{* 2} \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}

(12)

In line with the quasi-demeaning approach, the computation of

{\hat{σ}}_{ε}^{* 2}

is, here, usually considered as a byproduct of the OLS estimation process at work for the FQDM regression model. Accordingly,

{\hat{σ}}_{ε}^{* 2}

is computed as

{\hat{σ}}_{ε}^{* 2} \equiv ({\hat{\tilde{u}}}_{* *}^{'} \cdot {\hat{\tilde{u}}}_{* *}) / (N T - (K + 1))

where

{\hat{\tilde{u}}}_{* *}

denotes the

N T

vector of the OLS residuals in the FQDM regression model.

From (11) and (12), we observe that

v \hat{a} r_{1} [{\hat{β}}_{R E}] \neq v \hat{a} r_{2} [{\hat{β}}_{R E}]

as

{\hat{σ}}_{ε}^{w 2} \neq {\hat{σ}}_{ε}^{* 2}

. Thus, the two approaches differ in providing two distinct estimators for the variance of the (RE) estimator insofar as they rely on two different estimators of the variance component,

σ_{ε}^{2}

. This leads to what we call, in the following, a disturbance variance disconnect problem.15

3.3. Comparing the Two Versions

From the previous results, it follows that two possible expressions are available for an implementable version of the Hausman test statistic, depending on which estimator of the asymptotic covariance matrix for

{\hat{β}}_{R E}

is chosen.

3.3.1. Two Statistics

Using

v \hat{a} r_{1} [{\hat{β}}_{R E}]

, we have:

H M_{O}^{1} \equiv {\hat{β}}_{Δ W R E}^{'} \cdot {\{{\hat{σ}}_{ε}^{w 2} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} - {\hat{σ}}_{ε}^{w 2} \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}\}}^{- 1} \cdot {\hat{β}}_{Δ W R E}

(13)

or, using

v \hat{a} r_{2} [{\hat{β}}_{R E}]

, we have:

H M_{O}^{2} \equiv {({\hat{β}}_{Δ W R E})}^{'} \cdot {\{{\hat{σ}}_{ε}^{w 2} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} - {\hat{σ}}_{ε}^{* 2} \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}\}}^{- 1} \cdot {\hat{β}}_{Δ W R E}

(14)

Hausman (1978) originally proposed using

H M_{O}^{1}

, which he considered as the legitimate computational version of

H M_{O *}

(Hausman 1978, footnote 25, p. 1267):16

“Note that the elements of $\hat{q}$ and its standard errors are simply calculated given the estimates ${\hat{β}}_{F E}$ and of ${\hat{β}}_{F G L S}$ and their standarderrors, making sure to adjust to use the fixed effects estimate of $σ_{ε}^{2}$ ”.

On the other hand, as we will see in Section 4, most software programs compute the Hausman test statistic based on the second measure,

H M_{O}^{2}

. Consequently, it is important to analyze how

H M_{O}^{2}

behaves in relation to

H M_{O}^{1}

in finite sample settings. For that purpose, define the ratio h as

h \equiv {\hat{σ}}_{ε}^{* 2} / {\hat{σ}}_{ε}^{w 2}

.

H M_{O}^{2}

can then be rewritten as:

H M_{O}^{2} = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{w 2})}^{- 1} \cdot {\{{(X_{W}^{'} \cdot X_{W})}^{- 1} - h \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}\}}^{- 1} \cdot {\hat{β}}_{Δ W R E}

(15)

Comparing with (13), we note that

H M_{O}^{2}

diverges from

H M_{O}^{1}

as

h \neq 1

.

3.3.2. Main Results

To go further into the comparison of the two versions of the Hausman test statistic, we rewrite their expressions as:

\begin{matrix} H M_{O}^{1} & = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{w 2} \cdot Γ)}^{- 1} \cdot {\hat{β}}_{Δ W R E} \end{matrix}

(16)

\begin{matrix} H M_{O}^{2} & = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{w 2} \cdot \tilde{Γ})}^{- 1} \cdot {\hat{β}}_{Δ W R E} \end{matrix}

(17)

where we define

Γ = {(X_{W}^{'} \cdot X_{W})}^{- 1} - {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}

and

\tilde{Γ} = {(X_{W}^{'} \cdot X_{W})}^{- 1} - h \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}

.

From (16) and (17), the comparison between

H M_{O}^{1}

and

H M_{O}^{2}

is based on the one between

\tilde{Γ}

and

Γ

. Note that

({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) =

(X_{W}^{'} \cdot X_{W}) + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}})

with

X_{B_{C}} \equiv B_{C} \cdot X

and can be rewritten as

({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) = H^{*} \cdot (X_{W}^{'} \cdot X_{W})

where

H^{*} \equiv I_{K} + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}}) \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1}

. It follows that

Γ

writes as

Γ = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot (H^{*} - I_{K}) \cdot H^{* - 1}

and

\tilde{Γ}

as

\tilde{Γ} = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot (H^{*} - h \cdot I_{K}) \cdot H^{* - 1}

.

Finally, define

h_{min}^{*} \equiv min (σ (H^{*}))

and

h_{max}^{*} \equiv max (σ (H^{*}))

with

σ (H^{*})

denoting the spectrum of

H^{*}

and with:

1 < h_{min}^{*} < h_{max}^{*}

.

We establish the following results (see Appendix A for details and related proofs):

$Γ$ is a symmetric positive definite (SPD) matrix. It follows that $H M_{O}^{1}$ is a positive-definite quadratic form.
$\tilde{Γ}$ can be either a symmetric positive or a negative definite matrix or even an indefinite matrix depending on specific conditions holding for h. As a consequence, $H M_{O}^{2}$ can be of either sign (and even of indeterminate sign a priori) depending on the values taken by h. Specifically, we have:
(a)
$\tilde{Γ}$ is a symmetric positive definite (SPD) matrix iff $h < h_{min}^{*}$ . In this case, $H M_{O}^{2}$ is a positive-definite quadratic form
(b)
$\tilde{Γ}$ is a symmetric negative definite (SND) matrix iff $h > h_{max}^{*}$ . In this case, $H M_{O}^{2}$ is a negative-definite quadratic form
(c)
$\tilde{Γ}$ is indefinite iff $h_{min}^{*} \leq h \leq h_{max}^{*}$ . In this case, $H M_{O}^{2}$ can be of either sign, which is indeterminate a priori.

Based on those results, comparing

H M_{O}^{2}

relative to

H M_{O}^{1}

depends on whether

\tilde{Γ}

is SPD or SND. We then have:

If $\tilde{Γ}$ is SPD, the relevant comparison relies upon the magnitude $(H M_{O}^{2} - H M_{O}^{1})$ . We have ${H M}_{O}^{2} {≷ H M}_{O}^{1}$ whenever $h ≷ 1$
If $\tilde{Γ}$ is SND, it necessarily follows that $H M_{O}^{2} < 0 < H M_{O}^{1}$ .

In establishing those results, we directly echo the discussions, mentioned in the introduction, about the positive definiteness of the variance-covariance matrix estimator in the expression of the Hausman test statistic. As we observe, whether this matrix is SPD or not (which translates into whether

Γ

or

\tilde{Γ}

is SPD or not) depends on the choice of the estimator for

σ_{ε}^{2}

, which, itself, hinges on the approach that is adopted to estimate the variance components associated with the RE model. In other terms,

Γ

is, by construction, an SPD matrix, and this has to be related to the use of the same estimator for

σ_{ε}^{2}

, i.e.,

{\hat{σ}}_{ε}^{w 2}

, in the computation of the covariance matrix estimator. On the other hand, this is not necessarily the case for

\tilde{Γ}

and this has to do with the fact that two different estimators have been considered for

σ_{ε}^{2}

,

{\hat{σ}}_{ε}^{w 2}

and

{\hat{σ}}_{ε}^{* 2}

(

h \neq 1

).

Table 5 provides an overview of all possible cases. A wide spectrum of outcomes can be obtained for the value of

H M_{O}^{2}

depending on the value of h. Unreliable results for the test may arise notably when

H M_{O}^{2}

is negative.

3.3.3. Back to the Case Studies

Based on h and the proposed metrics for

H^{*}

, we can now analyze the mechanisms driving the various outcomes observed for the case studies selected in Section 2.2.

From Table 1, Table 2, Table 3 and Table 4, in 3 cases out of 4,

h_{min}^{*} < h < h_{max}^{*}

. It follows that in those cases,

\tilde{Γ}

is an indefinite matrix. As a consequence, the sign of

H M_{O}^{2}

is a priori indeterminate as well the relative magnitudes of

H M_{O}^{1}

and

H M_{O}^{2}

. The observed outcome depends on the specific value taken by

{\hat{β}}_{Δ W R E}

for the sample considered.

Conversely, in the two covariates’ regression case drawn from Greene, where

{H M}_{O}^{2}

is computed as a negative scalar, we logically have

h > h_{max}^{*}

and even

h > 2 \cdot h_{max}^{*} - 1

which is consistent with

H M_{O}^{1} > |H M_{O}^{2}|

, what we, by the way, also observe.

3.4. What about h?

As we have shown, the value of h is key for determining the outcome of the Hausman test if it is measured through

H M_{O}^{2}

. In this section, we analyze the main determinants for this ratio and provide an illustration through simulations in the single regressor case.

3.4.1. Determinants

The value of h essentially derives from the comparison between

{\hat{σ}}_{ε}^{w 2}

and

{\hat{σ}}_{ε}^{* 2}

and therefore, in turn, from the two residual sums of squares that are associated with, respectively, the Within model

({\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W})

and the feasible, quasi demeaning model

({\hat{\tilde{u}}}_{* *}^{'} \cdot {\hat{\tilde{u}}}_{* *})

. We show (see Appendix B) that the following relationship holds between the two expressions:

{\hat{\tilde{u}}}_{* *}^{'} \cdot {\hat{\tilde{u}}}_{* *} = {\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W} + {\hat{ψ}}^{2} \cdot {\hat{u}}_{B}^{'} \cdot {\hat{u}}_{B} + Δ^{*}

(18)

where

{\hat{u}}_{B}

denotes the

N T

vector of the OLS regression residuals for the Between (transformed) regression model

y_{B} = ι_{N T} \cdot α + X_{B} \cdot β + u_{B}

with

y_{B} \equiv B \cdot y; X_{B} \equiv B \cdot X

,

u_{B} \equiv B \cdot u

and where

Δ^{*}

is defined as:

Δ^{*} \equiv {({\hat{β}}_{W} - {\hat{β}}_{R E})}^{'} \cdot Γ^{- 1} \cdot ({\hat{β}}_{W} - {\hat{β}}_{R E})

where

Γ^{- 1}

is defined as in (16) and can be written as:

Γ^{- 1} = {({\hat{ψ}}^{2})}^{- 1} \cdot (X_{W}^{'} \cdot X_{W}) \cdot (X_{B_{C}}^{'} \cdot

{X_{B_{C}})}^{- 1} \cdot ({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})

. From the definition of

{\hat{σ}}_{α ε}^{2}

and

{\hat{σ}}_{ε}^{w 2}

, we obtain

{\hat{u}}_{B}^{'} \cdot {\hat{u}}_{B} = (N - (K + 1)) \cdot {\hat{σ}}_{α ε}^{2}

and

{\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W} = (N (T - 1) - K) \cdot {\hat{σ}}_{ε}^{w 2}

, so that (18) can be rewritten as:

{\hat{σ}}_{ε}^{* 2} = [1 - \frac{K}{N T - (K + 1)}] \cdot {\hat{σ}}_{ε}^{w 2} + \frac{Δ^{*}}{N T - (K + 1)}

(19)

from which it follows that:

h = 1 + η \cdot (\frac{Δ^{*}}{K \cdot {\hat{σ}}_{ε}^{w 2}} - 1)

(20)

with

η \equiv K / (N T - (K + 1))

. Furthermore, considering the definition of

H M_{O}^{1}

given in (16), we have:

Δ^{*} = {\hat{σ}}_{ε}^{w 2} \cdot H M_{O}^{1}

and therefore:

h = 1 + \frac{H M_{O}^{1} - K}{N T - (K + 1)}

(21)

so that

h ≷ 1

whenever

H M_{O}^{1} ≷ K

.

Taking advantage of the relationship between h and

H M_{O}^{1}

, we identify two categories of determinants for

H M_{O}^{1}

and h from (16) and (21):

The first category is related to the structure of the data at hand, i.e., the Between and Within components of the (empirical) covariance matrix of the explanatory variables. They are captured by the matrices $(X_{W}^{'} \cdot X_{W})$ and $(X_{B_{C}}^{'} \cdot X_{B_{C}})$ , which influence the structure of $H^{*}$ (and therefore the magnitude of its eigenvalues).
The second category is linked to the correlation between the individual effects $α_{i}$ and the regressors contained in $X_{i t}$ , which determines the extent of the (asymptotic as well as finite sample) bias for ${\hat{β}}_{R E}$ (with respect to $β$ ). This affects the gap between ${\hat{β}}_{W}$ (that is unbiased) and ${\hat{β}}_{R E}$ and therefore the value of ${\hat{β}}_{Δ W R E}$ .

These same factors in turn influence the determination of

H M_{O}^{2}

, the value of which is mostly linked to the comparison between h and the eigenvalues of

H^{*}

.

With respect to the behavior of

H M_{O}^{2}

compared to

H M_{O}^{1}

, two mechanisms are at work. Assume that

H M_{O}^{1}

is large because of a significant distance between the RE and Within estimator. On the one hand, we could expect that

H M_{O}^{2}

will also be large and therefore that both statistics will correctly lead to reject the null hypothesis. This is because when

H M_{O}^{1}

is large, the further h will be from 1 from the upside, and in turn, the more likely it will be for the condition

1 < h

to prevail, which leads to

H M_{O}^{2} > H M_{O}^{1}

as long as h remains below

h_{m i n}

. On the other hand, the larger

H M_{O}^{1}

, the more likely it will actually be that

h > h_{m i n}

and this could be all the more the case as the structure of the covariance matrix would be such that

h_{m i n}

(or even

h_{max}

) is relatively small. In this case, the sign of

H M_{O}^{2}

becomes indeterminate which does not allow for a clear conclusion about the relative magnitude of the two test statistics and creates a possible divergence for the interpretation of the test.

3.4.2. Illustrations in the Single Regressor Case

To illustrate the role of the previous factors as well as to clarify their interpretation, we perform Monte-Carlo simulations on the behavior of the main magnitudes involved in the comparison between (

H M_{O}^{1}

) and (

H M_{O}^{2}

) in a single-regressor model

(K = 1)

.

Preliminary Results

In such a setting,

X = \{x\}

and the various expressions for the main estimators and statistics simplify accordingly. We measure the total sample variance in the observations for

x

with

s_{x}^{2}

where

s_{x}^{2} = {(N \cdot T)}^{- 1} \cdot x_{T}^{2}

and

x_{T}^{2} \equiv x^{'} \cdot I_{C} \cdot x

, with

I_{C} \equiv I_{N T} - B_{N T}

. We then define

x_{W}^{2}

\equiv x^{'} \cdot W \cdot x

and

x_{B_{C}}^{2} \equiv x^{'} \cdot B_{C} \cdot x

. By construction, we have

x_{T}^{2} = x_{B_{C}}^{2} + x_{W}^{2} .

Hence,

x_{W}^{2}

and

x_{B_{C}}^{2}

can be used as a measure of, respectively, the Within and the Between components (up to a

N T

- factor) of the total (empirical) variance in the NT observations contained in

x

. Finally, define

θ_{W} = x_{W}^{2} / x_{T}^{2}

the share of the Within variance in the total variance. Then, substituting, we obtain:

\begin{matrix} h_{m i n} = h_{m a x} \equiv h^{*} & = 1 + \frac{{\hat{ψ}}^{2} \cdot x_{B_{C}}^{2}}{x_{W}^{2}} = 1 + \frac{{\hat{ψ}}^{2} \cdot (1 - θ_{W})}{θ_{W}} \\ H M_{O}^{1} & = ν \cdot x_{W}^{2} \cdot \frac{h^{*}}{h^{*} - 1} \\ h & = 1 + η \cdot (ν \cdot x_{W}^{2} \cdot \frac{h^{*}}{h^{*} - 1} - 1) \\ H M_{O}^{2} & = ν \cdot x_{W}^{2} \cdot \frac{h^{*}}{h^{*} - h} \end{matrix}

with

ν = {({\hat{σ}}_{ε}^{2})}^{- 1} \cdot {({\hat{β}}_{Δ W R E})}^{2}

and

η = 1 / (N T - 2)

.

Design of Simulations

We perform Monte-Carlo simulations on the behavior of the previous four quantities,

h^{*}

, h,

H M_{O}^{1}

and

H M_{O}^{2}

. The details of the simulation design are presented in Appendix C. We generate several series of

y_{i t}^{a}

based on a model

y_{i t}^{a} = α + β \cdot x_{i t}^{a} + u_{i t}^{a}

, where we fix

α = β = 1

and we let the other parameters of the simulation vary:

N = (20, 40, 80)

,

T = (20, 40, 80)

,

s_{x}^{2} = (0.01, 1, 100)

,

θ_{W} = (0.1, 0.5, 0.9)

,

σ_{u}^{2} = (0.01, 1, 100)

,

ρ_{x u} = (- 0.99, - 0.9, - 0.5, 0, 0.5, 0.9, 0.99)

and

ρ_{u} = (0.1, 0.5, 0.9)

with the same notations as before,

ρ_{x u}

is the correlation between x and u (on the cross-sectional dimension) and

ρ_{u}

is the intra-class coefficient of the error term (share of the within variance in the total variance in u)17. Then, for each combination of the parameters, we perform 199 replications and compute the median of

H M_{O}^{1}

and

H M_{O}^{2}

. To explore the main dimensions of variability of

H M_{O}^{1}

and

H M_{O}^{2}

, we perform an ANCOVA analysis (Table A2 in Appendix C) where we regress, respectively, each of the means and medians on the levels of the different parameters and the interactions that we found significant.

Results of the Simulations

The results are similar whether we consider the mean or the medians over the replications of the levels of

H M_{O}^{1}

and

H M_{O}^{2}

for each combination of parameters. The value of

H M_{O}^{1}

and

H M_{O}^{2}

significantly depend on T,

θ_{W}

, high absolute values of

ρ_{x u}

and

ρ_{u}

. Conversely, the scale parameters

x_{T}^{2}

and

σ_{u}^{2}

do not have a significant impact on

H M_{O}^{1}

and

H M_{O}^{2}

.

The visual representation of the behavior of (the median)

H M_{O}^{1}

and

H M_{O}^{2}

for

N = 80

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

and

ρ_{u} = 0.5

is in Figure 1. This illustrates the strong dependence of the values of the Hausman statistics on

θ_{W}

on the one hand and on the correlation between

x

and

u

on the other hand. In particular, for high absolute values of

ρ_{x u}

and high values of

θ_{W}

,

H M O_{2}

even yields negative values.

This latter result is, for example, consistent with the regression outcomes obtained with

log [f u e l p r i c e]

as a single covariate in the framework of motivating Example 2 [Airline]. In this case (see Table A3),

{H M}_{O}^{2}

is computed as a negative scalar with

H M_{O}^{1} > |H M_{O}^{2}|

. Note also the extremely large value of

θ_{W}

for the covariate which partly drives the values taken by

h^{*}

and h.

Finally, looking more closely at the distribution of negative values for

H M_{O}^{2}

(Figure A1 in Appendix C), we clearly see that they are an increasing function of

ρ_{u}

,

θ_{W}

and

ρ_{x u}

(correlation between x and u) in absolute value. While the empirical size of both tests remains around

5 %

with little variations for the various values of the parameters, the power of

H M_{O}^{2}

consequently drops to 0 for the highest values of

ρ_{u}

,

θ_{W}

and

ρ_{x u}

(Figure A2 in Appendix C).

4. The Implementation of the Hausman Test in Standard Econometric Software Packages for Panel Data: A Brief Review and Discussion

In this section, we review how six well-known econometric software packages deal with the implementation of the Hausman test in a standard panel data model and provide some discussion.

4.1. Review

STATA programming commands for the estimation of the random-effects panel data models (xtreg with the re option) rely on the specification of the quasi-demeaned model in (8). As a consequence, the random effects parameter estimates as well as its “conventional” covariance matrix estimate are provided as standard outputs of the OLS regression performed on that model. In particular, the vce(conventional) - default - command yields the (asymptotic) covariance matrix estimate based on the standard variance estimator for OLS regression. This corresponds to $v \hat{a} r_{2} [{\hat{β}}_{F G L S}]$ with, accordingly, ${\hat{σ}}_{ε}^{* 2}$ used for the residual variance estimate. The default version of the command for implementing the standard Hausman test (the hausman command) corresponds accordingly to $H M_{O}^{2}$ .
The R PLM package developed by Croissant and Millo (2008) allows estimating a wide range of panel data models with R software. Regarding the random-effects specification, the estimation process can be implemented via the plm function, whose model argument takes the random option. Croissant and Millo (2008) point out that it could have been possible to program the computation of the covariance matrix estimator for ${\hat{β}}_{F G L S}$ directly from the formula (11), “once the variance components have been estimated and hence the covariance matrix of errors”. However, to limit the computational costs associated with the inversion of the ( $N T \times N T$ ) matrix ( $\hat{Ω}$ ) or ( $\hat{Ω_{C}^{*}}$ ) and the related memory limits to store it, plm resorts to the specification and estimation of the quasi-demeaning estimator (8). Then, the coefficients’ covariance matrix estimator is readily calculated by applying the standard OLS formulas which, in the R language, go through the vcov() command.
The phtest command computes the Hausman test in plm. Its main arguments are the two-panel model objects that underlie the comparison (ex. model = within and model = random). The corresponding estimates of the asymptotic covariances matrices provided under both models are thus used to compute the Hausman statistic corresponding to $H M_{O}^{2}$ .
EViews estimates the random effects models using feasible GLS. The first step refers to the estimation of the covariance matrix for the composite error formed by the effects and the idiosyncratic disturbance. The EViews 9 User’s Guide II notes “Once the component variances have been estimated, we form an estimator of the composite residual covariance, and then GLS transform the dependent and regressor data”. As for the computation of the FGLS estimate, Eviews uses the quasi-demeaned model specification and proceeds on this basis. However, the calculation of the related coefficients’ covariance matrix is based on the direct application of the formula (11) and thus corresponds to $v \hat{a} r_{1} [{\hat{β}}_{F G L S}]$ . The procedure for the Hausman test then corresponds to $H M_{O}^{1}$ .
MATLAB provides estimation methods for the standard fixed (Within), between, and random effects models with the the panel data toolbox. Panel data models are estimated using the panel(·) function with the options argument set to re for the random effects model specification. The random effects FGLS estimates are based on the quasi-demeaned model and the asymptotic variance-covariance matrix for statistical inference is accordingly provided by $v \hat{a} r_{2} [{\hat{β}}_{F G L S}]$ (see Equation (18) in Alvarez et al. (2017)). Then, hausmantest computes the Hausman test where the input of the hausmantest function requires the output structures of the two estimations to be compared. Accordingly, the statistics that are computed correspond to $H M_{O}^{2}$ .
GAUSS: The GAUSS Times Serie∞s MT 3.0 TSMT provides a fixed effects and random effects models (TSCS) package that can be implemented through the tsmt library and the one-in-all tscsFit procedure. Another possibility is to use the pdlib GAUSS library and the randomEffects procedure in it. Both procedures implement the quasi-demeaning transformation on the original dataset and apply the standard OLS estimator on the transformed data so as to form the FGLS estimate. The covariance matrix estimate comes as a direct by-product of the OLS outcome so that $v \hat{a} r_{2} [{\hat{β}}_{F G L S}]$ is used. The Hausman test provided in the tscsFit procedure is implemented accordingly and corresponds to $H M_{O}^{2}$ .
SAS (SAS ETS 13.2) provides estimation methods for the standard fixed (Within), between, and random effects models in the balanced and unbalanced cases with the PANEL procedure toolbox. Standard panel data models are estimated using the PROC PANEL command with the MODEL statement specifying the regression model and the assumptions for the error structure. Specifically, FIXONE and RANONE must be used to specify the fixed-effect and the random-effect models, respectively (in the cross-sectional one-way case). In the latter case, various methods (but not the Swamy-Arora approach) are proposed to estimate, in the first stage, the variance components (through the VCOMP = option). It is explicitly indicated that the random effects FGLS estimates are then based, in the balanced case, on these variance components estimates through the quasi-demeaning approach, where ‘the random effects $β$ is then the result of simple OLS on the transformed data’ (see SAS ETS 13.2 User Manual (2014), p. 1417). The estimator for the asymptotic variance-covariance matrix is thus provided by $v \hat{a} r_{2} [{\hat{β}}_{F G L S}]$ . The Hausman statistic is automatically generated and reported as a conventional F statistic, with the statistic computed as $H M_{O}^{2}$ .

4.2. Discussion

As the previous review indicates, in all but one of the packages discussed above, the Hausman test is, by default, implemented through the computation of

H M_{O}^{2}

with the quasi-demeaning estimator. The rationale for such a choice is computational. Indeed, the quasi-demeaning approach allows avoiding the inversion of the (

N T \times N T

) matrix

\hat{Ω}

or

Ω_{C}^{*}

, which can be computationally costly (in terms of time and rounding errors). Conversely, the quasi-demeaned model only requires partially demeaning the variables with

θ \equiv 1 - \hat{ψ}

as the partial demeaning factor. Yet, as a counterpart of this standard OLS regression,

{\hat{σ}}_{ε}^{* 2}

is naturally chosen to compute the residual variance estimate and, in turn, yields to

H M_{O}^{2}

, which might, as we have seen, be an unreliable statistic for the Hausman test. In what follows, we explore some ways to circumvent the problems posed by the use of this statistic.

(1) First, it is possible to compute

H M_{O}^{1}

and still rely on the quasi-demeaning approach to estimate the parameters of the RE model (which, as mentioned before, is the default case in the vast majority of available econometric software). This can been easily seen from the relationship (21) that we established between h and

H M_{O}^{1}

. Once h has been determined, which only requires the OLS residual sums of squares from the estimation of the Within and the quasi-demeaning estimator, we can derive

H M_{O}^{1}

. Hence, the following procedure can be suggested, if required, to supplement the existing programs.

Use the quasi-demeaning estimator to compute the RE estimator for $β$ and ${\hat{σ}}_{ε}^{* 2}$ .
Use ${\hat{σ}}_{ε}^{* 2}$ and ${\hat{σ}}_{ε}^{w 2}$ (within regression) to compute h.
Rearranging (21), obtain $H M_{O}^{1}$ from h as: $H M_{O}^{1} = (N T - (K + 1)) \cdot (h - 1) + K$ .
Implement the Hausman test on the basis of $H M_{O}^{1} .$

(2) Second, depending on the software packages considered, some programming options can be used to fix the potential ‘variance disconnect’ problem associated with the use of the statistic

H M_{O}^{2}

.

For example, in STATA, it is possible to use the sigmamore and/or sigmaless option commands when implementing the Hausman test. As indicated in STATA instructions: “sigmamore and sigmaless specify that the two covariance matrices used in the test be based on a common estimate of disturbance variance. sigmamore specifies that the covariance matrices be based on the estimated disturbance variance from the efficient estimator. sigmaless specifies that the covariance matrices be based on the estimated disturbance variance from the consistent estimator”. Following the lines of Hausman’s seminal approach would lead to the choice of sigmaless option, whereby the variance estimator, is based on the Within model, would be used18. This would ensure the test to be performed upon the

H M_{0}^{1}

statistics. The choice of sigmamore19 would imply to consider a third test statistics,

H M_{0}^{3}

, where the common disturbance variance estimator would be based on the quasi-demeaned model, so that we would have:

H M_{O}^{3} = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{* 2} \cdot Γ)}^{- 1} \cdot {\hat{β}}_{Δ W R E}

. Comparing

H M_{O}^{3}

and

H M_{O}^{1}

, we observe that

H M_{0}^{1} > H M_{0}^{3}

whenever

h < 1

. Thus, the more likely it would be to favor (even unduly) the rejection of the null hypothesis on the basis of

H M_{0}^{3}

when

h > 1

.

Some packages also offer the possibility to rely on an alternative expression for the Hausman statistic that does not involve the use of the RE estimator, so that it is immune to the variance disconnect problem. This expression was initially proposed by Hausman and Taylor (1981) and is based on the difference between the Between and Within estimators

{\hat{q}}^{* *} \equiv {\hat{β}}_{W} - {\hat{β}}_{B}

. Hausman and Taylor (1981, pp. 1382–83), establish that the resulting version of the Hausman statistic is numerically exactly identical to the one that is built upon

\hat{q}

and used above, that is:

{\hat{q}}^{'} \cdot {[var (\hat{q})]}^{- 1} \cdot \hat{q} = {\hat{q}}^{* *'} \cdot {[var ({\hat{q}}^{* *})]}^{- 1} \cdot {\hat{q}}^{* *}

. Such a solution can be notably implemented in the R PLM package using the phtest command and specifying as arguments model = within and model = between.

(3) Finally, two other approaches that depart from the separate estimation of the FE and RE model parameters - which underlies the standard implementation of the Hausman test - can be emphasized. They have the advantage of solving the disturbance variance estimator disconnect problem, while allowing, more generally, for a robust implementation of the Hausman test20.

(3.1) The first of these approaches relies on implementing an auxiliary regression, that was initially proposed by Hausman himself together with the presentation of the standard specification test (see also Mundlak 1978). This regression takes the following form:

{\tilde{y}}_{* *} = {\tilde{Z}}_{* *} \cdot δ + X_{W} \cdot γ + η

with

{\tilde{Z}}_{* *} \equiv [{\tilde{ω}}_{* *} ⋮ {\tilde{X}}_{* *}]

where

{\tilde{ω}}_{* *}

\equiv \hat{ψ} \cdot ι_{N T}

and

{\tilde{X}}_{* *} \equiv

{\hat{Ω^{*}}}^{- \frac{1}{2}} \cdot X; {\tilde{y}}_{* *} \equiv {\hat{Ω^{*}}}^{- \frac{1}{2}} \cdot y

and

η

a vector of standard random disturbances.

It can be shown that the formula of the standard Wald test statistic for testing whether

γ = 0

in the previous regression framework is equivalent to the one of the standard Hausman test statistic as expressed in terms of the difference between the Between and Within estimators (see Hausman and Taylor 1981 and above)21. Resorting to this auxiliary regression framework has two advantages. First, it involves only one estimator for the covariance matrix in the Wald test statistic formula, that one for

γ

, which is immune to the positive definiteness problem that can be encountered with the standard Hausman test statistic. Second, and as underlined by Baltagi and Liu (2007), it can be made robust to heteroskedasticity of unknown form (see, also, Arellano 1993).22 Once the variables have been transformed to be included as regressors in the auxiliary regression framework, the latter can be implemented in a rather standard way in any of the software econometric packages we have reviewed supra.

(3.2) The second approach goes through implementing White (1982)’s reformulated Hausman specification test that is based on the Maximum Likelihood (ML) estimation of the FE and RE model parameters. The related test statistic takes the form:

H W = n \cdot {({\tilde{β}}_{M L E}^{W} - {\tilde{β}}_{M L E}^{R E})}^{'} \cdot \tilde{S} \cdot ({\tilde{β}}_{M L E}^{W} - {\tilde{β}}_{M L E}^{R E})

with

n \equiv (N \cdot T)

and

{\tilde{β}}_{M L E}^{W}

(resp.

{\tilde{β}}_{M L E}^{R E}

) denoting the ML estimator related to the Within (resp RE) regression framework;

\tilde{S}

would serve as the covariance matrix estimator and involves the information matrices for both estimators of

β

, as well as outer products of scores within and between the two models under concern. White (1982) shows that

\tilde{S}

remains positive definite even under misspecification (including heteroskedasticity).

The last two procedures we have presented could even be suggested to be used in the first place when assessing the relevance of the RE-model specification as they allow for globally robust implementation of the Hausman test in the context of panel data models (if only, insofar as they do not require to be directly based on the use of the RE (FGLS) disturbance variance estimator).

5. Conclusions

In this paper, we provide new analytical results of the behavior of the Hausman statistic for the test of orthogonality between the individual effects and the error term in a static and balanced panel data model. We compare the Hausman statistic computed with direct FGLS implementation and the Hausman statistic computed on the quasi-demeaned model. We show that this difference depends upon several parameters; in particular, the between-within structure of the regressors. We show by means of a Monte Carlo simulation in the single regressor case and of a set of well-known textbook examples that the difference can be substantial and that in some cases, the Hausman statistic computed on the basis of the quasi-demeaned model can yield strong negative values. Therefore, despite its computational advantage, the quasi-demeaned model should not be used prima facie as the basis of the computation of the Hausman statistic. We suggest, if needed, to supplement the existing software instructions so as to be able to compute in any case the relevant statistic. Extensions can include deriving these analytical results for unbalanced panel data models, two-way component models and dynamic panel models.

Author Contributions

Conceptualization, M.-A.S.; methodology, J.L.G. and M.-A.S.; validation, J.L.G.; formal analysis, M.-A.S.; investigation, M.-A.S.; resources, J.L.G.; data curation, J.L.G. and M.-A.S.; writing—original draft preparation, M.-A.S.; writing—review and editing, J.L.G. and M.-A.S.; visualization, J.L.G.; supervision, J.L.G.; project administration, M.-A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data are available upon request from the authors.

Acknowledgments

The authors thank the referees for their very useful comments and their careful reading of the first version submitted, which allowed to improve the paper. They also gratefully acknowledge the precious help of Alain Bachelot about the analysis of the definiteness of some matrices in the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Hausman Test Statistics

Appendix A.1. Matrix Conditions

We first prove the following lemma (denoted below as Lemma A1):

Lemma A1.

Let R and M be two (square) matrices of size K. We assume that R and

(R \cdot M)

are symmetric positive definite (SPD). Then, M can be diagonalized and there exists a non singular matrix P such that:

P^{'} \cdot R \cdot P = I

(A1)

and

P^{'} \cdot R \cdot M \cdot P = d i a g [λ_{i} (M)]

(A2)

with

λ_{i} (M)

denoting the i-th eigenvalue of matrix M and

d i a g [λ_{i} (M)]

the diagonal matrix whose elements are the eigenvalues of matrix M

(i = 1, \dots K)

.

Proof.

According to the spectral theorem (see Axler 2014), there exists a non-singular matrix

O_{R}

such that

O_{R}^{- 1} \cdot R \cdot O_{R} = d i a g [λ_{i} (R)], O_{R}^{^{'}} = O_{R}^{- 1}, 0 < λ_{i} (R), \forall i

with

λ_{i} (R)

denoting the i-th eigenvalue of matrix R and

d i a g [λ_{i} (R)]

the diagonal matrix whose elements are the eigenvalues of matrix R

(i = 1, \dots K) .

It follows that:

d i a g [\frac{1}{\sqrt{λ_{i} (R)}}] \cdot O_{R}^{^{'}} \cdot R \cdot O_{R} \cdot d i a g [\frac{1}{\sqrt{λ_{i} (R)}}] = I

so that:

Q^{^{'}} \cdot R \cdot Q = I, with Q \equiv O_{R} \cdot d i a g [\frac{1}{\sqrt{λ_{i} (R)}}]

(A3)

We then consider the symmetric (positive definite) matrix C, defined as:

C \equiv R \cdot M

Consider in turn the matrix

Q^{^{'}} \cdot C \cdot Q

that is symmetric. The spectral theorem ensures that there is an non-singular matrix

O_{C}

that satisfies:

O_{C}^{- 1} \cdot Q^{^{'}} \cdot C \cdot Q \cdot O_{C} = d i a g [λ_{i} (Q^{'} \cdot C \cdot Q)] with O_{C}^{- 1} = O_{C}^{'}

(A4)

Define the following matrix:

P \equiv Q \cdot O_{C}

(A5)

(A4) ensures that

P^{'} \cdot C \cdot P = d i a g [λ_{i} (Q^{'} \cdot C \cdot Q)] .

(A6)

We have:

P^{'} \cdot R \cdot P = O_{C}^{'} \cdot Q^{'} \cdot R \cdot Q \cdot O_{C} = O_{C}^{'} \cdot O_{C} = I

which leads to (A1).

Taking into account this result and refering on (A6), we see that:

\begin{matrix} d i a g [λ_{i} (Q^{'} \cdot C \cdot Q)] & = P^{'} \cdot R \cdot M \cdot P \\ = P^{'} \cdot R \cdot (P \cdot P^{- 1}) \cdot M \cdot P \\ = (P^{'} \cdot R \cdot P) \cdot P^{- 1} \cdot M \cdot P = P^{- 1} \cdot M \cdot P \end{matrix}

This shows that M can be diagonalized and that

d i a g [λ_{i} (M)] = d i a g [λ_{i} (Q^{'} \cdot C \cdot Q)] = P^{'} \cdot R \cdot M \cdot P

(A7)

which is (A2). □

Appendix A.2. Applications

We obtain the following properties and results:

We first note that $Γ \equiv {(X_{W}^{'} \cdot X_{W})}^{- 1} - {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}$ and $\tilde{Γ} \equiv {(X_{W}^{'} \cdot X_{W})}^{- 1} - h \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}$ are two symmetric (real valued) matrices. This comes from the definition of $({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})$ which is computed as the sum of two symmetric matrices, $({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) =$ $(X_{W}^{'} \cdot X_{W}) + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}})$ .
We have23 $({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) ≻ (X_{W}^{'} \cdot X_{W})$ and, in turn24, ${(X_{W}^{'} \cdot X_{W})}^{- 1} ≻ {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1}$ . It follows that $Γ$ is a symmetric positive definite (SPD) matrix.

As defined in the main text,

H^{*} \equiv I_{K} + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}}) \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1}

while

({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) = H^{*} \cdot (X_{W}^{'} \cdot X_{W})

. Then:

Note $R = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ and $M = H^{*}$ . By construction R and $(R \cdot M)$ are two real positive symmetric definite matrices (this is because $R \cdot M = {(X_{W}^{'} \cdot X_{W})}^{- 1}$ and that R can be written as $R = {[(X_{W}^{'} \cdot X_{W}) + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}})]}^{- 1}$ which is the inverse of the sum of two symmetric positive definite matrices).
We then deduce from Lemma1 that $H^{*}$ can be diagonalised and that there exists a non singular matrix P such that:

$\begin{matrix} d i a g [λ_{i} (H^{*})] & = P^{'} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1} \cdot H^{*} \cdot P \\ = P^{'} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot P \end{matrix}$

As $[P^{'} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot P]$ is SPD (given that ${(X_{W}^{'} \cdot X_{W})}^{- 1}$ is itself SPD), this ensures that the spectrum of $H^{*}$ is only composed of strictly positive elements.
Let $R = {(X_{W}^{'} \cdot X_{W})}^{- 1}$ and $M = H^{* - 1}$ . By construction, R and $(R \cdot M)$ are again two real symmetric positive matrices. We deduce from Lemma1 that $H^{* - 1}$ can be diagonalised and that there exists a non singular matrix P such that:

$\{\begin{matrix} P^{'} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot P = I \\ P^{'} \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1} \cdot P = d i a g [λ_{i} (H^{* - 1})] = d i a g [λ_{i}^{- 1} (H^{*})] \end{matrix}$
Noticing that $Γ = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot (H^{*} - I_{K}) \cdot H^{* - 1} = {(X_{W}^{'} \cdot X_{W})}^{- 1} - {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ , then, on the basis of the results above, we can write $(P^{'} \cdot Γ \cdot P)$ as:

$P^{'} \cdot Γ \cdot P = d i a g [1 - \frac{1}{λ_{i} (H^{*})}]$

Then, let $x$ denote a non-null vector, and setting $y \equiv P^{- 1} \cdot x$ , we obtain the spectral decomposition of $Γ$ as:

$x^{'} \cdot Γ \cdot x = y^{'} \cdot (P^{'} \cdot Γ \cdot P) \cdot y = \sum_{i} (1 - \frac{1}{λ_{i} (H^{*})}) \cdot y_{i}^{2}$

Since we know that $Γ$ is SPD, we obtain from the latter decomposition that $\forall i, 1 - 1 / (λ_{i} (H^{*})) > 0$ . which implies that $h_{min}^{*} > 1$ .
Further, noticing that $\tilde{Γ} = {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot (H^{*} - h \cdot I_{K}) \cdot H^{* - 1} = {(X_{W}^{'} \cdot X_{W})}^{- 1} - h \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ and proceeding as for above, we can write $(P^{'} \cdot \tilde{Γ} \cdot P)$ as:

$P^{'} \cdot \tilde{Γ} \cdot P = d i a g [1 - \frac{h}{λ_{i} (H^{*})}]$

Let $x$ denote a non-null vector, and setting $y \equiv P^{- 1} \cdot x$ , we obtain the spectral decomposition of $\tilde{Γ}$ as:

$x^{'} \cdot \tilde{Γ} \cdot x = y^{'} \cdot (P^{'} \cdot \tilde{Γ} \cdot P) \cdot y = \sum_{i} (1 - \frac{h}{λ_{i} (H^{*})}) \cdot y_{i}^{2}$

From this spectral decomposition, we conclude that
–
$\tilde{Γ}$ will be SPD if and only if $\forall i, 1 - \frac{h}{λ_{i} (H^{*})} > 0$ , that is, if and only if $h < h_{min}^{*}$ is fulfilled.
–
$\tilde{Γ}$ will be SND if and only if $\forall i, 1 - \frac{h}{λ_{i} (H^{*})} < 0$ , that is, if and only if $h > h_{max}^{*}$ is fulfilled.

We now use the previous results to compare

H M_{O}^{1}

and

H M_{O}^{2}

. For that purpose, we must distinguish according to whether

H M_{O}^{2}

is (for sure) a positive or (for sure) a non-positive quadratic form. This does in turn depend on whether

\tilde{Γ}

is SPD or SND.

If $\tilde{Γ}$ is SPD, the relevant comparison can be built on the magnitude $H M_{O}^{1} - H M_{O}^{2}$ . The former can be written as:

$\begin{matrix} H M_{O}^{1} - H M_{O}^{2} & = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{w 2})}^{- 1} \cdot [Γ^{- 1} - {\tilde{Γ}}^{- 1}] \cdot {\hat{β}}_{Δ W R E} \\ = {({\hat{σ}}_{ε}^{w 2})}^{- 1} \cdot {\hat{β}}_{Δ W R E}^{'} \cdot Ξ \cdot {\hat{β}}_{Δ W R E} \end{matrix}$

with $Ξ \equiv Γ^{- 1} - {\tilde{Γ}}^{- 1}$ .
Thus, $H M_{O}^{1} ≷ H M_{O}^{2}$ whenever $Ξ$ is SPD or SND. Given the definition of $Ξ$ , and since $\tilde{Γ}$ is SPD, this, in turn, depends on whether $\tilde{Γ} - Γ$ is SPD or SND.
Then, observe that $\tilde{Γ} - Γ = (1 - h) \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ .
As ${(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ is SPD, it follows that whether $\tilde{Γ} - Γ$ is SPD or SND depends on whether $h ≶ 1$ .

If $\tilde{Γ}$ is SND, the relevant comparison is between $H M O_{1}$ and $|H M O_{2}|$ , and can be built on $H M O_{1} - |H M O_{2}|$ . The former magnitude can be written as:

$\begin{matrix} H M_{O}^{1} - |H M O_{2}| & = {\hat{β}}_{Δ W R E}^{'} \cdot {({\hat{σ}}_{ε}^{w 2})}^{- 1} \cdot [Γ^{- 1} - {\tilde{Γ}}^{* - 1}] \cdot {\hat{β}}_{Δ W R E} \\ = {({\hat{σ}}_{ε}^{w 2})}^{- 1} \cdot {\hat{β}}_{Δ W R E}^{'} \cdot Ξ^{*} \cdot {\hat{β}}_{Δ W R E} \end{matrix}$

where ${\tilde{Γ}}^{*} = - \tilde{Γ}$ and $Ξ^{*} \equiv Γ^{- 1} - {\tilde{Γ}}^{* - 1}$ .
Thus, $H M_{O}^{1} ≷ |H M O_{2}|$ whenever $Ξ^{*}$ is SPD or SND. Given the definition of $Ξ^{*}$ , and as ${\tilde{Γ}}^{*}$ is SPD (since $\tilde{Γ}$ is SND), this, in turn, depends on whether $Γ - {\tilde{Γ}}^{*}$ is SND or SPD.
Then, observe that $Γ - {\tilde{Γ}}^{*} = Γ + \tilde{Γ} = 2 \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} - (1 + h) \cdot {(X_{W}^{'} \cdot X_{W})}^{- 1} \cdot H^{* - 1}$ .
Proceeding as for $\tilde{Γ}$ above, we can write $(P^{'} \cdot (Γ + \tilde{Γ}) \cdot P)$ as:

$P^{'} \cdot (Γ + \tilde{Γ}) \cdot P = d i a g [2 - \frac{1 + h}{λ_{i} (H^{*})}]$

Note by $x$ a non-null vector, and setting $y \equiv P^{- 1} \cdot x$ , we obtain the spectral decomposition of $(Γ + \tilde{Γ})$ as:

$x^{'} \cdot (Γ + \tilde{Γ}) \cdot x = y^{'} \cdot (P^{'} \cdot (Γ + \tilde{Γ}) \cdot P) \cdot y = \sum_{i} (2 - \frac{1 + h}{λ_{i} (H^{*})}) \cdot y_{i}^{2}$

From this spectral decomposition, we conclude that:
–
$(Γ + \tilde{Γ})$ will be SPD if and only if $\forall i, 2 - \frac{1 + h}{λ_{i} (H^{*})} > 0$ , that is, if and only if $h < 2 \cdot h_{min}^{*} - 1$ .
–
Conversely, $(Γ + \tilde{Γ})$ will be SND if and only if $\forall i, 2 - \frac{1 + h}{λ_{i} (H^{*})} < 0$ , that is, if and only if $h > 2 \cdot h_{max}^{*} - 1$ .

Hence, we obtain the following table covering all cases:

Table A1. Review of all cases.

$h < h_{min}^{*}$
( $\tilde{Γ}$ is SPD and ${H M}_{O}^{2} > 0$ )
${0 < h < 1 < h}_{min}^{*}$	$\tilde{Γ} ≻ Γ$	${H M}_{O}^{1} > {H M}_{O}^{2}$
${1 < h < h}_{min}^{*}$	$Γ ≻ \tilde{Γ}$	${H M}_{O}^{2} > {H M}_{O}^{1}$
$h_{min}^{} < {h < h}_{max}^{}$
( $\tilde{Γ}$ and sign of ${H M}_{O}^{2}$ a priori indefinite)
${1 < h}_{min}^{} < {h < h}_{max}^{}$	$\tilde{Γ}$ indefinite	${H M}_{O}^{1} ≶ {H M}_{O}^{2}$
$h_{max}^{*} < h$
( $\tilde{Γ}$ is SND and ${H M}_{O}^{2} < 0$ )
${1 < h}_{max}^{} < {h < 2 \cdot h}_{min}^{} - 1$	$Γ ≻ (- \tilde{Γ})$	$\|{H M}_{O}^{2}\| > {H M}_{O}^{1}$
${2 \cdot h}_{min}^{} {- 1 < h < 2 \cdot h}_{max}^{} - 1$	$(Γ + \tilde{Γ})$ indefinite	$\|{H M}_{O}^{2}\| ≶ {H M}_{O}^{1}$
${2 \cdot h}_{max}^{*} - 1 < h$	$(- \tilde{Γ}) ≻ Γ$	${H M}_{O}^{1} > \|{H M}_{O}^{2}\|$

Appendix B. Relation between Within Residuals and Quasi-Demeaned Residuals

Preliminary Result: We first show that:

{\hat{\tilde{u}}}_{* *} = {\hat{u}}_{W} + \hat{ψ} \cdot {\hat{u}}_{B} + G \cdot ({\hat{β}}_{W} - {\hat{β}}_{B})

Proof.

We start with the definition of

{\hat{\tilde{u}}}_{* *}

, the

N T

vector of the OLS residuals in the FQDM regression model. We have:

{\hat{\tilde{u}}}_{* *} = {\tilde{y}}_{* *} - {\tilde{ω}}_{* *} \cdot {\hat{α}}_{F G L S} - {\tilde{X}}_{* *} \cdot {\hat{β}}_{F G L S}

(A8)

with

{\tilde{ω}}_{* *} \equiv \hat{ψ} \cdot ι_{N T};

{\tilde{X}}_{* *} \equiv

{\hat{Ω^{*}}}^{- \frac{1}{2}} \cdot X; {\tilde{y}}_{* *} \equiv {\hat{Ω^{*}}}^{- \frac{1}{2}} \cdot y

. Using the definition of

{\hat{Ω^{*}}}^{- \frac{1}{2}} \equiv W + \hat{ψ} \cdot B

, we can rewrite (A8) as:

\begin{matrix} {\hat{\tilde{u}}}_{* *} & = {\hat{u}}_{W} + \hat{ψ} \cdot {\hat{u}}_{B} + X_{W} \cdot ({\hat{β}}_{W} - {\hat{β}}_{F G L S}) \\ + \hat{ψ} \cdot ι_{N T} \cdot ({\hat{α}}_{B} - {\hat{α}}_{F G L S}) + \hat{ψ} \cdot X_{B} \cdot ({\hat{β}}_{B} - {\hat{β}}_{F G L S}) \end{matrix}

(A9)

where

{\hat{u}}_{B}

denotes the

N T

vector of the OLS regression residuals for the Between (transformed) regression model;

X_{B} \equiv B \cdot X

and

{\hat{β}}_{B}

is the OLS estimator of

β

in the Between transformed model which can be expressed as

{\hat{β}}_{B} = {(X_{B_{C}}^{'} \cdot X_{B_{C}})}^{- 1} \cdot (X_{B_{C}}^{'} \cdot y)

with

X_{B_{C}} \equiv B_{C} \cdot X

and (as a reminder)

B_{C}

denoting the centered Between operator defined as

B_{C} \equiv B - B_{N T}

with

B_{N T} \equiv ι_{N T} \cdot {(ι_{N T}^{'} \cdot ι_{N T})}^{- 1} \cdot ι_{N T}^{'}

.

Then, noting that

{\hat{α}}_{J} = {(ι_{N T}^{'} \cdot ι_{N T})}^{- 1} \cdot ι_{N T}^{'} \cdot (y - X \cdot {\hat{β}}_{J})

for

J = B,

F G L S

, which leads to:

ι_{N T} \cdot ({\hat{α}}_{B} - {\hat{α}}_{F G L S}) = B_{N T} \cdot X \cdot ({\hat{β}}_{F G L S} - {\hat{β}}_{B})

and using the following relationship between

{\hat{β}}_{W},

{\hat{β}}_{B}

and

{\hat{β}}_{F G L S}

:

{\hat{β}}_{F G L S} = Π \cdot {\hat{β}}_{B} + [I_{K} - Π] \cdot {\hat{β}}_{W}

(A10)

with

Π \equiv {\hat{ψ}}^{2} \cdot {[X^{'} \cdot {\hat{Ω_{C}^{*}}}^{- 1} \cdot X]}^{- 1} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}}) =

{\hat{ψ}}^{2} \cdot {[{\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}]}^{- 1} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}})

and, as a reminder,

{\hat{Ω_{C}^{*}}}^{- 1} \equiv (W + {\hat{ψ}}^{2} \cdot B_{C})

, we can rewrite equation (A9) as:

{\hat{\tilde{u}}}_{* *} = {\hat{u}}_{W} + \hat{ψ} \cdot {\hat{u}}_{B} + G \cdot ({\hat{β}}_{W} - {\hat{β}}_{B})

(A11)

with:

\begin{matrix} G & \equiv X_{W} \cdot Π - \hat{ψ} \cdot (B - B_{N T}) \cdot X \cdot [I_{K} - Π] \\ = X_{W} \cdot Π - \hat{ψ} \cdot X_{B_{C}} \cdot [I_{K} - Π] \end{matrix}

□

Other results: Define

Π^{*}

and

G^{*}

such that

Π = \hat{ψ} \cdot Π^{*}

and

G = \hat{ψ} \cdot G^{*}

. We have:

G^{*} = X_{W} \cdot Π^{*} - X_{B_{C}} \cdot [I_{K} - \hat{ψ} \cdot Π^{*}]

and replacing in (A11):

{\hat{\tilde{u}}}_{* *} = {\hat{u}}_{W} + \hat{ψ} \cdot [{\hat{u}}_{B} + G^{*} \cdot ({\hat{β}}_{W} - {\hat{β}}_{B})]

(A12)

Using the property according to which

B_{C} \cdot W = W \cdot B_{C} = 0

, we deduce, after manipulations, from (A12), the following relationship between the two residual sums of squares

{\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W}

and

{\hat{\tilde{u}}}_{* *}^{'} \cdot {\hat{\tilde{u}}}_{* *}

:

{\hat{\tilde{u}}}_{* *}^{'} \cdot {\hat{\tilde{u}}}_{* *} = {\hat{u}}_{W}^{'} \cdot {\hat{u}}_{W} + {\hat{ψ}}^{2} \cdot {\hat{u}}_{B}^{'} \cdot {\hat{u}}_{B} + Δ^{*}

which corresponds to the equation provided in the main text in Section 3.4.1 and where:

Δ^{*} \equiv {\hat{ψ}}^{2} \cdot {({\hat{β}}_{W} - {\hat{β}}_{B})}^{'} \cdot Σ^{*} \cdot ({\hat{β}}_{W} - {\hat{β}}_{B})

(A13)

and25:

\begin{matrix} Σ^{*} & \equiv G^{*'} \cdot G^{*} \\ = (X_{B_{C}}^{'} \cdot X_{B_{C}}) \cdot {({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})}^{- 1} \cdot (X_{W}^{'} \cdot X_{W}) \end{matrix}

Finally, noting from (A10) that:

{\hat{β}}_{W} - {\hat{β}}_{B} = Π^{- 1} \cdot [{\hat{β}}_{W} - {\hat{β}}_{F G L S}]

, we can rewrite (A13) as:

Δ^{*} = {\hat{ψ}}^{2} \cdot {({\hat{β}}_{W} - {\hat{β}}_{F G L S})}^{'} \cdot {(Π^{- 1})}^{'} \cdot Σ^{*} \cdot Π^{- 1} \cdot ({\hat{β}}_{W} - {\hat{β}}_{F G L S})

(A14)

Define

{\tilde{Σ}}^{*} \equiv {(Π^{- 1})}^{'} \cdot Σ^{*} \cdot Π^{- 1}

. Using the definition of

Σ^{*}

and

Π

, this can be written as:

\begin{matrix} {\tilde{Σ}}^{*} & = {({\hat{ψ}}^{4})}^{- 1} \cdot (X_{W}^{'} \cdot X_{W}) \cdot {(X_{B_{C}}^{'} \cdot X_{B_{C}})}^{- 1} \cdot ({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*}) \\ = {({\hat{ψ}}^{2})}^{- 1} \cdot Γ^{- 1} \end{matrix}

with

Γ^{- 1}

being written as:

Γ^{- 1} = {({\hat{ψ}}^{2})}^{- 1} \cdot (X_{W}^{'} \cdot X_{W}) \cdot {(X_{B_{C}}^{'} \cdot X_{B_{C}})}^{- 1} \cdot ({\tilde{X}}_{*}^{'} \cdot {\tilde{X}}_{*})

.

Replacing in (A14), we have:

Δ^{*} = {({\hat{β}}_{W} - {\hat{β}}_{F G L S})}^{'} \cdot Γ^{- 1} \cdot ({\hat{β}}_{W} - {\hat{β}}_{F G L S})

which is the expression provided in the main text for

Δ^{*}

.

Appendix C. Simulations in the Single Regressor Case $(K = 1)$

In the simulations for

K = 1

, we focus on the role played by the structure in

X

and in u on the values taken by h and

h^{*}

as well as by the Hausman statistics

{H M}_{O}^{1}

and

{H M}_{O}^{2}

. Those structural features are captured, respectively, by

s_{x}^{2}

and

θ_{W}

on the one hand and by

σ_{u}^{2}

and

ρ_{u}

on the other hand where

σ_{u}^{2} \equiv σ_{α}^{2} + σ_{ε}^{2}

and

ρ_{u} \equiv \frac{σ_{α}^{2}}{σ_{u}^{2}}

(

ρ_{u}

is called the intra-class correlation coefficient in the variance component literature). We let also the degree of correlation between the regressor and the (composite) error term on the cross-sectional dimension vary across the experiments (we call it

ρ_{x u}

).

Accordingly, for a given value of

(s_{x}^{2}; θ_{W}; σ_{u}^{2}; ρ_{u}; ρ_{x u})

as well as for a given size of the sample (N and T given), we generate (with replications) one series for

u = {\{u_{i t}\}}_{i = 1, \dots, N}^{t = 1, \dots, T}

and one for

x = {\{x_{i t}\}}_{i = 1, \dots, N}^{t = 1, \dots, T}

and consequently one series for

y = {\{y_{i t}\}}_{i = 1, \dots, N}^{t = 1, \dots, T}

), on the basis of which the different estimation procedures and the computation of the statistics of interest can be implemented.

Appendix C.1. Generating u it and x it

Each series for

x

and

u

is built such that the (sample) values of their total variances and Within variance shares in the total variance correspond to the ones that we set beforehand. To compute those series we adopt Nerlove’s approach (Nerlove 1971) and proceed in two steps.

First, we draw N pairs for the random vector $Z_{b} \equiv {(Z_{b}^{x} Z_{b}^{u})}^{'}$ in the bivariate normal distribution $N (0, Σ_{Z_{b}})$ with: $Σ_{Z_{b}} \equiv (\begin{matrix} σ_{Z_{b}^{x}}^{2} & σ_{Z_{b}^{x u}}^{2} \\ σ_{Z_{b}^{x u}}^{2} & σ_{Z_{b}^{u}}^{2} \end{matrix})$ , $σ_{Z_{b}^{x u}}^{2} = ρ_{x u} \cdot σ_{Z_{b}^{x}}^{2} \cdot σ_{Z_{b}^{u}}^{2}$ ; $σ_{Z_{b}^{x}}^{2} = (1 - θ_{W}) \cdot σ_{X}^{2}$ and $σ_{Z_{b}^{u}}^{2} =$ $ρ_{u} \cdot σ_{u}^{2}$ .
Second, for each $i = 1, \dots, N$ , we draw T pairs for the random vector $Z_{w} \equiv {(Z_{w}^{x} Z_{w}^{u})}^{T}$ in the bivariate normal distribution $N (0, Σ_{Z_{w}})$ with $Σ_{Z_{w}} \equiv (\begin{matrix} σ_{Z_{w}^{x}}^{2} & 0 \\ 0 & σ_{Z_{b}^{u}}^{2} \end{matrix})$ with $σ_{Z_{w}^{x}}^{2} = θ_{W} \cdot σ_{X}^{2}$ and $σ_{Z_{w}^{u}}^{2} =$ $(1 - ρ_{u}) \cdot σ_{u}^{2}$ .

Then, the following variables are built:

\begin{matrix} x_{B_{i}} & = \sqrt{σ_{Z_{b}^{x}}^{2}} \cdot \frac{(z_{b_{i}}^{x} - {\bar{z}}_{b}^{x})}{\sqrt{v a r (z_{b}^{x})}}; u_{B_{i}} = \sqrt{σ_{Z_{b}^{u}}^{2}} \cdot \frac{(z_{b_{i}}^{u} - {\bar{z}}_{b}^{u})}{\sqrt{v a r (z_{b}^{u})}} \\ x_{W_{i t}} & = \sqrt{σ_{Z_{w}^{x}}^{2}} \cdot \frac{(z_{w_{i t}}^{x} - {\bar{z}}_{w_{i}}^{x})}{\sqrt{v a r (z_{w_{i}}^{x})}}; u_{W_{i t}} = \sqrt{σ_{Z_{w}^{u}}^{2}} \cdot \frac{(z_{w_{i t}}^{u} - {\bar{z}}_{w_{i}}^{u})}{\sqrt{v a r (z_{w_{i}}^{u})}} i = 1, \dots, N \end{matrix}

with for

j = x, u

:

{\bar{z}}_{b}^{j} = \frac{1}{N} \cdot (\sum_{i = 1}^{i = N} z_{b_{i}}^{j})

{\bar{z}}_{w_{i}}^{j} = \frac{1}{T} \cdot (\sum_{t = 1}^{t = T} z_{w_{i t}}^{j})

;

v a r (z_{b}^{j}) =

\frac{1}{N} \cdot (\sum_{i = 1}^{i = N} {(z_{b_{i}}^{j} - {\bar{z}}_{b}^{j})}^{2})

;

v a r (z_{w_{i}}^{j}) = \frac{1}{T} \cdot (\sum_{i = 1}^{i = N} {(z_{w_{i t}}^{j} - {\bar{z}}_{w_{i}}^{j})}^{2})

.

Finally we compute

x_{i t}

as

x_{i t} =

x_{B_{i}} + x_{W_{i t}}

and

u_{i t}

as

u_{i t} =

u_{B_{i}} + u_{W_{i t}}

.

This design ensures that

s_{x}^{2} = σ_{X}^{2}

,

s_{u}^{2} = σ_{u}^{2}

,

θ_{W} = \frac{x_{W}^{2}}{x_{T}^{2}}

and

ρ_{u} = 1 - \frac{u^{T} \cdot W \cdot u}{u^{T} \cdot u}

with

s_{x}^{2}

(resp.

s_{u}^{2}

) denoting the sample variance for

x

(resp.

u

).

Appendix C.2. Shaping the Experiment

We generate the different series for

{\{y_{i t}\}}_{i = 1, \dots, N}^{t = 1, \dots, T}

according to the following relationship:

y_{i t} = α + β \cdot x_{i t} + u_{i t}

(A15)

In the simulations, we set

α = β = 1

. We let the other parameters of the simulation vary:

N = (20, 40, 80)

,

T = (20, 40, 80)

,

x_{T}^{2} = (0.01, 1, 100)

,

θ_{W} = (0.1, 0.5, 0.9)

,

σ_{u}^{2} = (0.01, 1, 100)

,

ρ_{x u} = (0.1, 0.5, 0.9)

and

ρ_{u} = (0.1, 0.5, 0.9)

.

Appendix C.3. ANCOVA Results

Table A2. ANCOVA on simulation results.

	Dependent Variable:
	Mean ${HM}_{0}^{1}$ (1)	Mean ${HM}_{0}^{2}$ (2)	Median ${HM}_{0}^{1}$ (3)	Median ${HM}_{0}^{2}$ (4)
$N = 40$ (ref = 20)	−24.158	134.826 **	−24.132	136.294 **
	(37.475)	(56.029)	(36.677)	(55.435)
$N = 80$ (ref = 20)	−72.354 *	405.306 ***	−72.281 **	409.057 ***
	(37.475)	(56.029)	(36.677)	(55.435)
$T = 40$ (ref = 20)	63.206 ***	111.042 ***	60.953 ***	110.473 ***
	(16.759)	(25.057)	(16.402)	(24.791)
$T = 80$ (ref = 20)	126.892 ***	306.754 ***	122.040 ***	303.316 ***
	(16.759)	(25.057)	(16.402)	(24.791)
$x_{T}^{2} = 1$ (ref = 0.01)	0.039	−1.105	0.322	−0.300
	(16.759)	(25.057)	(16.402)	(24.791)
$x_{T}^{2} = 100$ (ref = 0.01)	0.277	2.784	0.803	−0.171
	(16.759)	(25.057)	(16.402)	(24.791)
$θ_{W} = 0.5$ (ref = 0.1)	90.452 ***	−18.134	83.656 ***	−16.453
	(29.028)	(43.400)	(28.410)	(42.940)
$θ_{W} = 0.9$ (ref = 0.1)	184.362 ***	−35.323	167.161 ***	−30.374
	(29.028)	(43.400)	(28.410)	(42.940)
$ρ_{x u} = - 0.99$ (ref = 0)	1083.156 ***	640.633 ***	1057.507 ***	643.296 ***
	(25.600)	(38.275)	(25.055)	(37.869)
$ρ_{x u} = - 0.9$ (ref = 0)	147.213 ***	214.549 ***	141.792 ***	202.157 ***
	(25.600)	(38.275)	(25.055)	(37.869)
$ρ_{x u} = - 0.5$ (ref = 0)	13.481	14.882	12.514	5.921
	(25.600)	(38.275)	(25.055)	(37.869)
$ρ_{x u} = 0.5$ (ref = 0)	13.436	3.664	12.469	5.878
	(25.600)	(38.275)	(25.055)	(37.869)
$ρ_{x u} = 0.9$ (ref = 0)	147.237 ***	215.027 ***	141.974 ***	201.985 ***
	(25.600)	(38.275)	(25.055)	(37.869)
$ρ_{x u} = 0.99$ (ref = 0)	1083.935 ***	641.078 ***	1058.341 ***	643.762 ***
	(25.600)	(38.275)	(25.055)	(37.869)
$σ_{u}^{2} = 1$ (ref = 0.01)	−0.265	3.723	0.343	0.056
	(16.759)	(25.057)	(16.402)	(24.791)
$σ_{u}^{2} = 100$ (ref = 0.01)	0.400	0.780	1.234	−0.119
	(16.759)	(25.057)	(16.402)	(24.791)
$ρ_{u} = 0.5$ (ref = 0.1)	90.544 ***	−20.710	83.885 ***	−15.796
	(29.028)	(43.400)	(28.410)	(42.940)
$ρ_{u} = 0.9$ (ref = 0.1)	184.478 ***	−31.878	167.171 ***	−29.092
	(29.028)	(43.400)	(28.410)	(42.940)
$N = 40$ : $θ_{W} = 0.5$	86.977 **	−16.080	87.205 **	−15.864
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 80$ : $θ_{W} = 0.5$	261.702 ***	−41.165	262.250 ***	−48.630
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 40$ : $θ_{W} = 0.9$	175.791 ***	−27.656	176.433 ***	−30.434
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 80$ : $θ_{W} = 0.9$	525.795 ***	−89.808	526.255 ***	−92.123
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 40$ : $ρ_{u} = 0.5$	86.826 **	−13.059	86.837 **	−16.015
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 80$ : $ρ_{u} = 0.5$	261.064 ***	−45.820	261.054 ***	−48.512
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 40$ : $ρ_{u} = 0.9$	175.736 ***	−30.548	176.043 ***	−31.087
	(41.052)	(61.376)	(40.177)	(60.726)
$N = 80$ : $ρ_{u} = 0.9$	526.388 ***	−84.972	526.704 ***	−92.531
	(41.052)	(61.376)	(40.177)	(60.726)
Constant	−446.938 ***	−246.347 ***	−430.229 ***	−246.886 ***
	(35.552)	(53.154)	(34.795)	(52.590)
Observations	5103	5103	5103	5103
R²	0.580	0.165	0.579	0.169
Adjusted R²	0.578	0.161	0.577	0.165
Residual Std. Error (df = 5076)	488.758	730.740	478.346	722.994
F Statistic (df = 26; 5076)	269.238 ***	38.691 ***	268.675 ***	39.743 ***

Note: *

p <

0.1; **

p <

0.05; ***

p <

0.01.

Figure A1. Distribution of the proportion of negative values of

H M_{O}^{2}

,

N = 80

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

.

Figure A1. Distribution of the proportion of negative values of

H M_{O}^{2}

,

N = 80

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

.

Figure A2. Power of

H M_{O}^{2}

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

.

Figure A2. Power of

H M_{O}^{2}

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

.

Appendix D. Regression Outcomes in the Single Regressor Case for the Motivating Example 2 [Airline]

Table A3. Estimation results for Motivating example 2, Airline, single covariate.

Specification		Intercept	$log [fuelprice]$
Within specification	Coef.	−	$0.7785$
Within specification	Std Err.	−	$0.0279$
Between specification	Coef.	$476.260$	$- 36.248$
Between specification	Std Err.	$136.439$	$10.684$
Random effect specification	Coef.	$3.4264$	$0.7783$
	Std Err. 1	$0.4245$	$0.0279$
	Std Err. 2	$0.4502$	$0.0296$
${H M}_{O}^{1}$	$12.0100$
${H M}_{O}^{2}$	$- 0.0006$
${\hat{σ}}_{ε}^{2}$	$0.0456$
${\hat{σ}}_{ε}^{* 2}$	$0.0513$
${\hat{ψ}}^{2}$	$0.0095$
h	$1.1251$
$h^{*}$	$1.0000$
${2 \cdot h}^{*} - 1$	$1.0000$
Within variance			$0.6521$
Between variance			$0.0005$
Within variance/Total variance (in %)	$θ_{W}$		$99.9282$
Between variance/Total variance (in %)	$θ_{B}$		$0.0718$

Notes

1	Schreiber (2008), based on Holly (1982), examines cases in which this problem can arise even asymptotically, when the alternative hypothesis is true.
2	In particular, this issue is generally not addressed in the leading textbooks in panel data econometrics. An exception is Wooldridge (2010), (chp. 10, pp. 289–90), but he merely mentions the possibility of obtaining a non-positive definite covariance matrix if different estimates of the error term variance are used, suggesting a way out, which we further discuss below.
3	See Nerlove (1971) and Fuller and Battese (1973), for an original exposition of the transformed, quasi-demeaned model and the related approach.
4	Schreiber (2008) also considers a panel data framework as an illustration for the asymptotic results and highlights cases where the matrix is not SPD in a context where the error term variance estimates differ. He does not, however, focus on the comparison of the different estimation approaches in the random effects model and their implications for the computation of the Hausman test statistic, as we do.
5	We assume in what follows that there is no time-invariant regressor in $X_{i t}$ , so that it is possible to compute the $β$ -estimator with the Within transformation of $X_{i t}$ (see below).
6	See the initial study by Baltagi and Griffin (1973). The dataset is available at: https://www.wiley.com/legacy/wileychi/baltagi/datasets.html, accessed on 2 May 2023.
7	Interestingly, in an updated version of his textbook, Baltagi (2021) modified the presentation of the Gasoline case study compared to the one provided in 2005 and presented here. In this update, there is no more disconnection between the two versions of the statistic, and only one version is considered, $H M_{O}^{1}$ . While, in both presentations, the estimations are drawn from the STATA software package, the second presentation benefits from the use of the sigmaless option command that fixes the computation of the estimator for the idiosyncratic component of the error term. See infra in Section 4.2.
8	The original study is from Greene (1999). The dataset is available at: http://pages.stern.nyu.edu/~wgreene/Text/tables/tablelist5.htm, accessed on 2 May 2023.
9	The dataset is available at: http://pages.stern.nyu.edu/~wgreene/Text/Edition7/tablelist8new.htm, accessed on 2 May 2023.
10	A typical $i t$ -observation for $y^{}$ is given by $y_{i t}^{} = y_{i t} - \hat{θ} \cdot {\bar{y}}_{i .}$ where $θ \equiv 1 - \hat{ψ}$ and ${\bar{y}}_{i .} \equiv \frac{1}{T} (\sum_{t = 1}^{t = T} y_{i t})$ . A similar transformation is applied for each of the components of $X$ , hence the quasi-demeaning expression for the transformed model that is obtained in that way.
11	Indeed, ${\hat{β}}_{O L S} = {(X_{* }^{'} \cdot (I_{N T} - B_{N T}) \cdot X_{ })}^{- 1} \cdot (X_{ }^{'} \cdot (I_{N T} - B_{N T}) \cdot y_{ })$ , which is equivalent to (4) given the definition of $X_{ }$ , $y_{ }$ and $Ω_{C}^{ - 1}$ .
12	Given the definition of those matrices, this replacement relies on the use of consistent estimators for the variance components, i.e., $σ_{ε}^{2}$ , $σ_{α}^{2}$ and/or $σ_{α ε}^{2}$ . For a discussion about these variance component estimators, see, among others, Amemiya (1971); Fuller and Battese (1974); Maddala (1971); Nerlove (1971); Swamy and Arora (1972); Wallace and Hussain (1969).
13	In particular, we assume that $\hat{Ω^{}}$ (resp. $\hat{Ω_{C}^{}}$ ) is a consistent estimator for $Ω^{}$ (resp. $Ω_{C}^{}$ ). See Wooldridge (2010) for a discussion on these conditions.
14	The estimator for $σ_{α ε}^{2}$ is usually computed from the sum of the squares of the OLS regression residuals, $({\hat{u}}_{B}^{'} \cdot {\hat{u}}_{B})$ , for the Between (transformed) regression model with ${\hat{σ}}_{α ε}^{2} = \frac{{\hat{u}}_{B}^{'} \cdot {\hat{u}}_{B}}{N - (K + 1)}$ . (Swamy-Arora approach), see infra.
15	It can be easily checked that the two approaches give rise to the same (RE) estimator for the parameters, $α$ and $β$ .
16	The emphasis is added by us. The notation used by Hausman for the fixed-effects estimate of $β$ $({\hat{β}}_{F E})$ corresponds to our ${\hat{β}}_{W}$ .
17	The intra-class coefficient drives the value of $ψ^{2}$ . It can be shown, indeed, that: $ψ^{2} = \frac{1 - ρ_{u}}{1 - ρ_{u} \cdot (1 - T)}$ .
18	Note that this estimator is also used to compute ${\hat{ψ}}^{2}$ and in turn $\hat{Ω_{C}^{*}}$ , which makes it fully, logically consistent with respect to the FGLS regression model framework.
19	This is, e.g., recommended by Cameron and Trivedi (2009), p. 360.
20	We thank both referees for having highlighted those approaches and suggested to account for them in this discussion subsection.
21	In this case, indeed, $\hat{γ} = {\hat{q}}^{* *}$ .
22	Baltagi and Liu show in (Baltagi and Liu 2007) that the Hausman test can be obtained equivalently from other artificial regressions, involving the use of the set of Between-transformed regressor variables, $X_{B}$ , or even the set of the initial regressor variables, $X$ . They also discuss the case where the auxiliary regression can accommodate the presence of potentially endogenous regressors. With respect to the issue of weak instruments in this context, see also (Staiger and Stock 1997).
23	Take two symmetric matrices A and B. We denote by $A ≻ B$ the property according to which $A - B$ is a positive definite matrix (what we can also write as $A - B ≻ 0$ ).
24	If A and B are two symmetric positive definite matrices and are non-singular, then $A ≻ B ⟹ B^{- 1} ≻ A^{- 1}$ .
25	As a reminder, note that $({\tilde{X}}_{}^{'} \cdot {\tilde{X}}_{}) = (X_{W}^{'} \cdot X_{W}) + {\hat{ψ}}^{2} \cdot (X_{B_{C}}^{'} \cdot X_{B_{C}})$ .

References

Alvarez, Inmaculada C., Javier Barbero, and José L. Zofío. 2017. A Panel Data Toolbox for MATLAB. Journal of Statistical Software 76: 1–17. [Google Scholar] [CrossRef]
Amemiya, Takeshi. 1971. The Estimation of the Variances in a Variance-Components Model. International Economic Review 12: 1–13. [Google Scholar] [CrossRef]
Arellano, Manuel. 1993. On the testing of correlated effects with panel data. Journal of Econometrics 59: 87–97. [Google Scholar] [CrossRef]
Axler, Sheldon. 2014. Linear Algebra Done Right. New York: Springer. [Google Scholar]
Baltagi, Badi H. 2005. Econometric Analysis of Panel Data, 3rd ed. Chichester and Hoboken: J. Wiley & Sons. [Google Scholar]
Baltagi, Badi H. 2021. Econometric Analysis of Panel Data, 6th ed. Springer texts in Business and Economics. Cham: Springer. [Google Scholar] [CrossRef]
Baltagi, Badi H., and James M. Griffin. 1973. Gasoline demand in the oecd: An application of pooling and testing procedures. European Economic Review 22: 626–32. [Google Scholar] [CrossRef]
Baltagi, Badi H., and Long Liu. 2007. Alternative ways of obtaining Hausman’s test using artificial regressions. Statistics & Probability Letters 77: 1413–17. [Google Scholar] [CrossRef]
Baum, Christopher F., Mark E. Schaffer, and Steven Stillman. 2003. Instrumental Variables and GMM: Estimation and Testing. The Stata Journal: Promoting Communications on Statistics and Stata 3: 1–31. [Google Scholar] [CrossRef]
Cameron, Adrian Colin, and Pravin K. Trivedi. 2009. Microeconometrics Using Stata. College Station: Stata Press. [Google Scholar]
Cornwell, Christopher, and Peter Rupert. 2008. Efficient estimation with panel data: An empirical comparison of instrumental variable estimators. Journal of Applied Econometrics 3: 149–55. [Google Scholar]
Croissant, Yves, and Giovanni Millo. 2008. Panel Data Econometrics in R: The plm Package. Journal of Statistical Software 27: 1–43. [Google Scholar] [CrossRef]
Fuller, Wayne A., and George E. Battese. 1973. Transformations for Estimation of Linear Models with Nested-Error Structure. Journal of the American Statistical Association 68: 626–32. [Google Scholar] [CrossRef]
Fuller, Wayne A., and George E. Battese. 1974. Estimation of linear models with crossed-error structure. Journal of Econometrics 2: 67–78. [Google Scholar] [CrossRef]
Greene, William. 1999. Frontier Production Functions. In Handbook of Applied Econometrics Volume II: Microeconomics. Edited by Pesaran M. Hashem and Schmidt Peter. Oxford: Blackwell Publishing Ltd., pp. 75–153. [Google Scholar] [CrossRef]
Greene, William. 2000. Econometric Analysis, 4th ed. Upper Saddle River: Prentice Hall Internat. [Google Scholar]
Greene, William. 2012. Econometric Analysis, 7th ed. Pearson Series in Economics; Boston and Munich: Pearson. [Google Scholar]
Hausman, Jerry A. 1978. Specification Tests in Econometrics. Econometrica 46: 1251–71. [Google Scholar] [CrossRef]
Hausman, Jerry A., and William E. Taylor. 1981. Panel data and unobservable individual effects. Econometrica 49: 1377–98. [Google Scholar]
Hayashi, Fumio. 2000. Econometrics. Princeton: Princeton University Press. [Google Scholar]
Holly, Alberto. 1982. A Remark on Hausman’s Specification Test. Econometrica 50: 749–60. [Google Scholar] [CrossRef]
Maddala, Gangadharrao Soundalyara. 1971. The Use of Variance Components Models in Pooling Cross-Section and Time Series Data. Econometrica 39: 341–57. [Google Scholar]
Mundlak, Yair. 1978. On the pooling of time series and cross section data. Econometrics 46: 69–85. [Google Scholar]
Nerlove, Marc. 1971. A Note on Error Components Models. Econometrica 39: 383–96. [Google Scholar] [CrossRef]
Schreiber, Sven. 2008. The Hausman Test Statistic Can Be Negative even Asymptotically. Jahrbücher für Nationalökonomie und Statistik 228: 394–405. [Google Scholar] [CrossRef]
Staiger, Douglas, and James H. Stock. 1997. Instrumental Variables Regression with Weak Instruments. Econometrica 65: 557–86. [Google Scholar] [CrossRef]
Swamy, Paravastu Aananta Venkata Bhattandha, and Swarnjit S. Arora. 1972. The Exact Finite Sample Properties of the Estimators of Coefficients in the Error Components Regression Models. Econometrica 40: 261–75. [Google Scholar] [CrossRef]
Wallace, T. Dudley, and Ashiq Hussain. 1969. The Use of Error Components Models in Combining Cross Section with Time Series Data. Econometrica 37: 55. [Google Scholar] [CrossRef]
White, Halbert. 1982. Maximum Likelihood Estimation of Misspecified Models. Econometrica 50: 1–25. [Google Scholar] [CrossRef]
Wooldridge, Jeffrey M. 2010. Econometric Analysis of Cross Section and Panel Data, 2nd ed. Cambridge, MA: MIT Press. [Google Scholar]

Figure 1. Distribution of the median of

H M_{O}^{1}

and

H M_{O}^{2}

,

N = 80

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

and

ρ_{u} = 0.5

.

Figure 1. Distribution of the median of

H M_{O}^{1}

and

H M_{O}^{2}

,

N = 80

,

T = 80

,

s_{x}^{2} = σ_{u}^{2} = 1

and

ρ_{u} = 0.5

.

Table 1. Estimation results for motivating example 1, Gasoline.

Specification		Intercept	$log [Y / N]$	$log [P_{MG} / P_{GDP}]$	$log [Car / N]$
Within/fixed effects	Coef.	−	$0.6622$	$- 0.3217$	$- 0.6405$
Within/fixed effects	Std Err.	−	$0.0734$	$0.0441$	$0.0297$
Between	Coef.	$2.5416$	$0.9676$	$- 0.9635$	$- 0.7953$
Between	Std Err.	$0.5268$	$0.1557$	$0.1329$	$0.0825$
Random effects	Coef.	$1.997$	$0.5550$	$- 0.4204$	$- 0.6068$
	Std Err._1	$0.1782$	$0.0572$	$0.0387$	$0.0247$
	Std Err._2	$0.1843$	$0.0591$	$0.0400$	$0.0255$
${H M}_{O}^{1}$	$26.49505$
${H M}_{O}^{2}$	$302.8037$
${\hat{σ}}_{ε}^{w 2}$	$0.0085$
${\hat{σ}}_{ε}^{* 2}$	$0.0091$
${\hat{ψ}}^{2}$	$0.0116$
h	$1.069$
$h_{min}^{*}$	$1.0409$
$h_{max}^{*}$	$2.0837$
Within variance			$0.0507$	$0.0162$	$0.3089$
Between variance			$0.3508$	$0.4424$	$1.1725$
With. var. share (in %)	$θ_{W}$		$12.6255$	$3.5325$	$20.8518$
Betw. var. share (in %)	$θ_{B}$		$87.3745$	$96.4675$	$79.1482$

Table 2. Estimation results for motivating example 2, Airline.

Specification		Intercept	$log [Q]$	$log [fuelprice]$	$loadfactor$
Within	Coef.	−	$0.9193$	$0.4175$	$- 1.0704$
Within	Std Err.	−	$0.0299$	$0.0152$	$0.2017$
Between	Coef.	$85.8087$	$0.7825$	$- 5.5239$	$- 1.7510$
Between	Std Err.	$56.4830$	$0.1088$	$4.4788$	$2.7432$
Random effects	Coef.	$9.6279$	$0.9067$	$0.4228$	$- 1.0645$
	Std Err._1	$0.2098$	$0.0256$	$0.0140$	$0.1998$
	Std Err._2	$0.2102$	$0.0256$	$0.0140$	$0.2000$
${H M}_{O}^{1}$	$3.249$
${H M}_{O}^{2}$	$2.1247$
${\hat{σ}}_{ε}^{w 2}$	$0.0036$
${\hat{σ}}_{ε}^{* 2}$	$0.0036$
${\hat{ψ}}^{2}$	$0.0152$
h	$1.0029$
$h_{min}^{*}$	$1.000$
$h_{max}^{*}$	$1.3690$
Within variance			$0.1751$	$0.6521$	$0.0021$
Between variance			$1.1340$	$0.0005$	$0.0007$
With. var. share (in %)	$θ_{W}$		$13.3777$	$99.9282$	$76.0391$
Betw. var. share (in %)	$θ_{B}$		$86.6223$	$0.0718$	$23.9609$

Table 3. Estimation results for motivating example 2, Airline, two covariates.

Specification		Intercept	$log [fuelprice]$	$loadfactor$
Within	Coef.	−	$0.7445$	$0.8684$
Within	Std Err.	−	$0.0384$	$0.6780$
Between	Coef.	$419.760$	$- 32.304$	$10.964$
Between	Std Err.	$136.203$	$10.541$	$8.880$
Random effects	Coef.	$3.3673$	$0.7393$	$0.9931$
	Std Err._1	$0.4179$	$0.0384$	$0.6758$
	Std Err._2	$0.4471$	$0.0411$	$0.7230$
${H M}_{O}^{1}$	$14.5905$
${H M}_{O}^{2}$	$- 0.2470$
${\hat{σ}}_{ε}^{w 2}$	$0.0452$
${\hat{σ}}_{ε}^{* 2}$	$0.0518$
${\hat{ψ}}^{2}$	$0.0106$
h	$1.1447$
$h_{min}^{*}$	$1.0000$
$h_{max}^{*}$	$1.0066$
Within variance (1)			$0.6522$	$0.0021$
Between variance (2)			$0.00047$	$0.0007$
With. var. share (in %)	$θ_{W}$		$99.9282$	$76.0391$
Betw. var. share (in %)	$θ_{B}$		$0.0718$	$23.9609$

Table 4. Estimation results for Motivating example 3, Wage.

Specification		Intercept	$Exp$	${(Exp)}^{2}$	$WKS$	$OCC$	$IND$	$SOUTH$	$SMSA$	$MS$	$UNION$
Within	Coef.	−	$0.1132$	$- 0.0004$	$0.0008$	$- 0.0215$	$0.0192$	$- 0.0019$	$- 0.0425$	$- 0.0297$	$0.0328$
Within	Std Err.	−	$0.0025$	$0.00005$	$0.0006$	$0.0138$	$0.0154$	$0.0343$	$0.0194$	$0.0190$	$0.014923$
Between	Coef.	$5.7222$	$0.0275$	$- 0.0005$	$0.0089$	$- 0.3536$	$0.0460$	$- 0.1082$	$0.1815$	$0.3837$	$0.0891$
Between	Std Err.	$0.1918$	$0.0053$	$0.0001$	$0.0040$	$0.0309$	$0.0282$	$0.0284$	$0.0283$	$0.0352$	$0.0324$
Random effects	Coef.	$5.4668$	$0.0838$	$- 0.0008$	$0.0012$	$- 0.1270$	$- 0.0194$	$- 0.0822$	$- 0.0030$	$- 0.0092$	$0.0374$
	Std Err._1	$0.0417$	$0.0022$	$0.0001$	$0.006$	$0.0123$	$0.0134$	$0.0214$	$0.0157$	$0.0165$	$0.0133$
	Std Err._2	$0.0554$	$0.0029$	$0.0001$	$0.0008$	$0.0164$	$0.0178$	$0.0284$	$0.0208$	$0.0219$	$0.01763$
${\hat{σ}}_{ε}^{w 2}$	$0.0231$		${H M}_{O}^{1}$	$3177.583$
${\hat{σ}}_{ε}^{* 2}$	$0.0407$		${H M}_{O}^{2}$	$7569.713$
h	$1.7626$		${\hat{ψ}}^{2}$	$0.0368$
$h_{min}^{*}$	$1.0221$
$h_{max}^{*}$	$2.6757$
$\tilde{h}$	$2.0566$
Within variance (1)			4	$0.0087$	$15.5347$	$0.0300$	$0.0233$	$0.0048$	$0.0149$	$0.0155$	$0.0254$
Between variance (2)			$116.2324$	$0.0001$	$10.7666$	$0.2199$	$0.2157$	$0.2012$	$0.2114$	$0.1356$	$0.2061$
With. var. share (in %)	$θ_{W}$		$3.3269$	$3.3118$	$59.0643$	$11.9970$	$9.7561$	$2.3308$	$6.60679$	$10.2570$	$10.9640$
Betw. var. share (in %)	$θ_{B}$		$96.6731$	$96.6881$	$40.9357$	$88.0029$	$90.2439$	$97.6691$	$93.3932$	$89.7430$	$89.03604$

Table 5. Review of all cases.

$h < h_{min}^{*}$
( $\tilde{Γ}$ is SPD and ${H M}_{O}^{2} > 0$ )
${0 < h < 1 < h}_{min}^{*}$	$\tilde{Γ} ≻ Γ$	${H M}_{O}^{1} > {H M}_{O}^{2} > 0$
${1 < h < h}_{min}^{*}$	$Γ ≻ \tilde{Γ}$	${H M}_{O}^{2} > {H M}_{O}^{1} > 0$
$h_{min}^{} {< h < h}_{max}^{}$
( $\tilde{Γ}$ and sign of ${H M}_{O}^{2}$ a priori indefinite)
${1 < h}_{min}^{} {< h < h}_{max}^{}$	$\tilde{Γ}$ indefinite	${H M}_{O}^{1} ≶ {H M}_{O}^{2}$
$h_{max}^{*} < h$
( $\tilde{Γ}$ is SND and ${H M}_{O}^{2} < 0$ )
${1 < h}_{max}^{*} < h$	$\tilde{Γ}$ negative definite	${H M}_{O}^{1} > 0 > {H M}_{O}^{2}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Le Gallo, J.; Sénégas, M.-A. On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results. Econometrics 2023, 11, 25. https://doi.org/10.3390/econometrics11040025

AMA Style

Le Gallo J, Sénégas M-A. On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results. Econometrics. 2023; 11(4):25. https://doi.org/10.3390/econometrics11040025

Chicago/Turabian Style

Le Gallo, Julie, and Marc-Alexandre Sénégas. 2023. "On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results" Econometrics 11, no. 4: 25. https://doi.org/10.3390/econometrics11040025

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Proper Computation of the Hausman Test Statistic in Standard Linear Panel Data Models: Some Clarifications and New Results

Abstract

1. Introduction

2. Motivation

2.1. Notation

2.2. Motivating Examples

2.2.1. Motivating Example 1: Gasoline

2.2.2. Motivating Example 2: Airline

2.2.3. Motivating Example 3: Wage determination

3. The Two Versions of the Hausman Test Statistic

3.1. The Original Hausman Test Specification in a Balanced Panel Data Model

3.2. Two Estimation Procedures

3.2.1. Approach 1: The (Direct) FGLS Approach

3.2.2. Approach 2: The Quasi-Demeaning Approach

3.3. Comparing the Two Versions

3.3.1. Two Statistics

3.3.2. Main Results

3.3.3. Back to the Case Studies

3.4. What about h?

3.4.1. Determinants

3.4.2. Illustrations in the Single Regressor Case

Preliminary Results

Design of Simulations

Results of the Simulations

4. The Implementation of the Hausman Test in Standard Econometric Software Packages for Panel Data: A Brief Review and Discussion

4.1. Review

4.2. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Hausman Test Statistics

Appendix A.1. Matrix Conditions

Appendix A.2. Applications

Appendix B. Relation between Within Residuals and Quasi-Demeaned Residuals

Appendix C. Simulations in the Single Regressor Case ( K = 1 )

Appendix C.1. Generating u it and x it

Appendix C.2. Shaping the Experiment

Appendix C.3. ANCOVA Results

Appendix D. Regression Outcomes in the Single Regressor Case for the Motivating Example 2 [Airline]

Notes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix C. Simulations in the Single Regressor Case $(K = 1)$