Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance

Robitzsch, Alexander

doi:10.3390/axioms11030119

Open AccessArticle

Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance

by

Alexander Robitzsch

^1,2

¹

Department of Educational Measurement, IPN, Leibniz Institute for Science and Mathematics Education, 24118 Kiel, Germany

²

Centre for International Student Assessment (ZIB), 24118 Kiel, Germany

Axioms 2022, 11(3), 119; https://doi.org/10.3390/axioms11030119

Submission received: 20 February 2022 / Revised: 2 March 2022 / Accepted: 7 March 2022 / Published: 9 March 2022

(This article belongs to the Section Mathematical Analysis)

Download Versions Notes

Abstract

:

Factor analysis is one of the most important statistical tools for analyzing multivariate data (i.e., items) in the social sciences. An essential case is the comparison of multiple groups on a one-dimensional factor variable that can be interpreted as a summary of the items. The assumption of measurement invariance is a frequently employed assumption that enables the comparison of the factor variable across groups. This article discusses different estimation methods of the multiple-group one-dimensional factor model under violations of measurement invariance (i.e., measurement noninvariance). In detail, joint estimation, linking methods, and regularized estimation approaches are treated. It is argued that linking approaches and regularization approaches can be equivalent to joint estimation approaches if appropriate (robust) loss functions are employed. Each of the estimation approaches defines identification constraints of parameters that quantify violations of measurement invariance. We argue in the discussion section that the fitted multiple-group one-dimensional factor analysis will likely be misspecified due to the violation of measurement invariance. Hence, because there is always indeterminacy in determining group comparisons of the factor variable under noninvariance, the preference of particular fitting strategies such as partial invariance over alternatives is unjustified. In contrast, researchers purposely define fitting functions that minimize the extent of model misspecification due to the choice of a particular (robust) loss function.

Keywords:

multiple groups; confirmatory factor analysis; measurement invariance; one-dimensional factor model; loss function; misspecification; linking; regularization; structural equation modeling; robust estimation; noninvariance

MSC:

62H25

1. Introduction

Factor analysis is one of the most important statistical tools for analyzing multivariate data (i.e., items) in the social sciences [1,2]. An important case is the comparison of multiple groups on a one-dimensional factor variable that can be interpreted as a summary of the multivariate input data. To enable a comparison on the factor variable, identification constraints for model estimation must be posed [3,4,5].

A popular and heavily discussed identification is the assumption of measurement invariance (MI, [6,7]) that assumes the existence of invariant (i.e., equal) item parameters across groups. Noninvariant item parameters occur if not all parameters are equal across groups. Practitioners and applied methodologists frequently claim that MI or only weak violations of MI (i.e., partial invariance) are necessary to enable group comparisons on the factor variable [8,9,10]. In this article, we discuss different estimation methods of the one-dimensional factor model and their implied identification constraints in the violation of MI. In more detail, we focus on joint estimation (maximum likelihood estimation), linking approaches (Haberman linking, invariance alignment), and regularized estimation (lasso-type regularization, fused regularization, Bayesian approximate invariance). We derive identification constraints on parameters that quantify violations of MI under different estimation methods. By doing so, it turns out that joint estimation, linking, and regularization can be interpreted quite similarly under certain specifications. Therefore, this work discusses competing approaches for handling violations of measurement invariance in a unified framework and provides conditions for which these approaches can provide similar results. We also try to convince the reader that there cannot be a right choice of an approach on how to handle measurement invariance because an overidentified minimization problem is made identifiable to selecting a fitting function. We derive implied identification constraints by the different fitting functions and discuss the similarity in the sequel of this article.

The article is structured as follows: In Section 2, we discuss two important one-dimensional factor models and their estimation: the tau-equivalent and the tau-congeneric measurement model. In Section 3, the tau-equivalent model with noninvariant item intercepts. Section 4 treats the tau-congeneric model with noninvariant item intercepts and invariant item loadings, while Section 5 also allows noninvariant item loadings. Finally, Section 6 closes with a discussion.

2. One-Dimensional Factor Model

Assume that there are I random variables

X_{1}, \dots, X_{I}

. These variables are also referred to as items. Denote by

X = (X_{1}, \dots, X_{I})

the vector of all items. Denote by

μ = E (X)

the vector of means containing the entries

E (X_{i})

and by

Σ = Var (X)

the covariance matrix containing entries

σ_{i j} = Cov (X_{i}, X_{j})

for

i \neq j

. In one-dimensional factor analysis, we represent the I items by a one-dimensional factor variable F. Hence, the covariances among items are presented by a rank-one matrix. In the following, we discuss two main measurement models of one-dimensional factor analysis: the tau-equivalent and the tau-congeneric models [11,12,13,14].

2.1. Tau-Equivalent Model

We now assume a one-dimensional factor F in the tau-equivalent model [14]:

X_{i} = ν_{i} + F + ε_{i}, Var (ε_{i}) = ϕ,

where the residuals

ε_{i}

are uncorrelated. Note that it is assumed that there are equal loadings

λ

and equal residual variances

θ

, while item intercepts

ν_{i}

are item-specific parameters. For identification, we assume

E (F) = 0

and

ψ = Var (F)

is estimated. Denote by

I

the

I \times I

identity matrix and by

1

an

I \times 1

vector of ones. Then, the covariance matrix

Σ

of the items

X

is represented by a model-implied covariance matrix

Σ_{0}

Σ_{0} = ψ 11^{⊤} + ϕ I .

Note that the covariance matrix is parsimoniously represented by only two parameters. The mean vector

μ = E (X) = ν

is estimated without constraints.

2.2. Tau-Congeneric Model

In the tau-congeneric measurement model [11], item-specific loadings and item-specific residual variances are allowed:

X_{i} = ν_{i} + λ_{i} F + ε_{i}, Var (ε_{i}) = ϕ_{i},

where residuals

ε_{i}

are uncorrelated across items. Denote by

λ = (λ_{1}, \dots, λ_{I})

the vector of item loadings

λ_{i}

and

Φ = diag (ϕ_{1}, \dots, ϕ_{I})

the diagonal matrix containing residual variances

ϕ_{i}

. For reasons of identification, we set

E (F) = 0

and

Var (F) = 1

. The covariance matrix

Σ

is modelled as

Σ_{0} = λ λ^{⊤} + Φ .

2.3. Overview of Estimation Methods

The tau-equivalent and the tau-congeneric model are special cases of structural equation models that impose restrictions on the mean vector and the covariance matrix [15]. In maximum likelihood (ML) estimation assuming multivariate normality of

X

, the empirical mean vector

\bar{x}

and empirical covariance matrix

S

are sufficient statistics. We denote by

θ

the vector of all estimated parameters and define the fitting function

F_{ML} (θ; \bar{x}, S) = - \frac{N}{2} (- I log (2 π) + \log | Σ_{0} (θ) | + tr (S Σ_{0} {(θ)}^{- 1}) + {(\bar{x} - μ_{0} (θ))}^{⊤} Σ_{0} {(θ)}^{- 1} (\bar{x} - μ_{0} (θ))) .

(1)

In this article, we are only concerned with the statistical behavior of parameter estimates in the population (i.e., infinite large sample sizes). Then, the sample quantities

\bar{x}

and

S

are replaced by population parameters

μ

and

Σ

. The fitting function in Equation (1) can then be rewritten as

F_{ML} (θ; μ, Σ) = - \log | Σ_{0} (θ) | - tr (Σ Σ_{0} {(θ)}^{- 1}) - {(μ - μ_{0} (θ))}^{⊤} Σ_{0} {(θ)}^{- 1} (μ - μ_{0} (θ)) .

In practice, the model-implied covariance matrix will be misspecified [16], and

θ

is a pseudo-true parameter that is defined as the maximizer of the fitting function

F_{ML}

.

A more general class of fitting functions is weighted least squares (WLS) estimation [15]. The parameter vector

θ

is determined as the minimizer of

F_{WLS} (θ; μ, σ) = {(μ - μ_{0} (θ))}^{⊤} W_{1} (μ - μ_{0} (θ)) + {(σ - σ_{0} (θ))}^{⊤} W_{2} (σ - σ_{0} (θ))

(2)

with known weight matrices

W_{1}

and

W_{2}

. The vectors

σ

and

σ_{0}

contain the nonduplicated elements from covariance matrices

Σ

and

Σ_{0} (θ)

. Diagonally weighted least squares (DWLS) estimation results by choosing diagonal weight matrices

W_{1}

and

W_{2}

. If these matrices are identity matrices, unweighted least squares (ULS) estimation is obtained. Interestingly, the minimization in (2) can be interpreted as a nonlinear least squares estimation problem with sufficient statistics

μ

and

Σ

as input data [17].

It has been shown that ML estimation can be approximately written as DWLS estimation [18] with particular weight matrices. DWLS can be generally written as

F_{DWLS} (θ; μ, σ) = \sum_{i = 1}^{I} w_{1 i} {(μ_{i} - μ_{0, i} (θ))}^{2} + \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j} {(σ_{i j} - σ_{0, i j} (θ))}^{2},

(3)

where

μ_{i}

etc. indicate the corresponding elements of vectors defined in (3). In ML estimation, the weights are approximately determined by

w_{1 i} = 1 / u_{i}^{2}

and

w_{2 i j} = 1 / (u_{i}^{2} u_{j}^{2})

, where

u_{i}^{2}

are sample unique standardized variances with

u_{i}^{2} = ϕ_{i} / σ_{i i}

. With smaller residual variances

ϕ_{i}

, more trust is put on a mean

μ_{i}

or a covariance

σ_{i j}

in the fitting function. This kind of weighting seems questionable in the case of misspecified models [18].

The model deviations

μ_{i} - μ_{0, i} (θ)

and

σ_{i j} - σ_{0, i j} (θ)

can be differently weighted by replacing the least squares functions with robust fitting functions

ρ

[19,20]:

F_{rob} (θ; μ, σ) = \sum_{i = 1}^{I} w_{1 i} ρ (μ_{i} - μ_{0, i} (θ)) + \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j} ρ (σ_{i j} - σ_{0, i j} (θ)) .

(4)

Siemsen and Bollen [19] proposed the absolute value function

ρ (x) = | x |

for fitting the factor analysis model. This fitting function is robust to a few model violations such as unmodelled correlations of residuals

ε_{i}

. Alternative robust loss functions such as

ρ (x) = {| x |}^{p}

with

0 < p < 1

can ensure even more model-robust estimates [21].

2.4. Estimation in the Presence of Slight Model Misspecifications

We now study the behavior of the estimate

θ

as the minimizer of

F_{rob}

in (4) (see [16,22]). A nondifferentiable loss function

ρ

is substituted by a differentiable approximation (e.g.,

ρ (x) = | x |

is replaced by

ρ (x) = {(x^{2} + ε)}^{1 / 2}

for a small

ε > 0

; see [21]) in the following derivation.

We investigate slight model misspecifications for the mean and the covariance structures. For the mean structure, we define residuals

γ_{i} (θ) = μ_{i} - μ_{0, i} (θ)

for

i = 1, \dots, I

. Furthermore, we define

δ_{i j} (θ) = σ_{i j} - σ_{0, i j} (θ) \neq 0

for model deviations in the covariance structure. The estimate

θ

is obtained by taking the derivative of

F_{rob}

with respect to

θ

and setting it to zero:

H (σ, γ, θ) = \sum_{i = 1}^{I} w_{1 i} ρ^{'} (μ_{i} - μ_{0, i} (θ)) a_{i} (θ) + \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j} ρ^{'} (σ_{i j} - σ_{0, i j} (θ)) b_{i j} (θ) = 0,

(5)

where

a_{i} (θ) = \frac{\partial μ_{0, i}}{\partial θ}

and

b_{i j} (θ) = \frac{\partial σ_{0, i j}}{\partial θ}

. A parameter estimate

θ

is obtained by computing the root of

H

in Equation (5). Note that

θ

is a function of the mean vector

μ

and the stacked covariance matrix

σ

.

Assume that there is a true parameter

θ_{0}

if all model deviations

γ_{i}

and

δ_{i j}

would be zero. That is, we assume

H (μ_{0} (θ_{0}), σ_{0} (θ_{0}), θ_{0}) = 0 .

(6)

Now, use the notation

γ = μ - μ_{0} (θ_{0})

and

δ = σ - σ_{0} (θ_{0})

. A first-order Taylor expansion of

H

using (6) provides

H (μ_{0} (θ_{0}) + γ, σ_{0} (θ_{0}) + δ, θ) ≃ H_{μ} (μ_{0} (θ_{0})) γ + H_{σ} (σ_{0} (θ_{0})) δ + H_{θ} (θ_{0}) (θ - θ_{0}) = 0,

(7)

where

H_{μ} (μ_{0} (θ_{0})) = (\frac{\partial H}{\partial μ}) |_{μ = μ_{0} (θ_{0})}

,

H_{σ} (σ_{0} (θ_{0})) = (\frac{\partial H}{\partial σ}) |_{σ = σ_{0} (θ_{0})}

and

H_{θ} (θ_{0}) = (\frac{\partial H}{\partial θ}) |_{θ = θ_{0}}

. Note that we suppress arguments in

H_{μ}

,

H_{σ}

and

H_{θ}

in our abbreviated notation. Using the approximation (7), we obtain

θ = θ_{0} - H_{θ}^{- 1} (θ_{0}) \{H_{μ} (μ_{0} (θ_{0})) γ + H_{σ} (σ_{0} (θ_{0})) δ\} .

(8)

Model deviations

γ

and

δ

enter the computation according to

H_{μ} (μ_{0} (θ_{0})) γ + H_{σ} (σ_{0} (θ_{0})) δ = \sum_{i = 1}^{I} w_{1 i} ρ^{″} (γ_{i}) a_{i} (θ_{0}) γ_{i} + \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j} ρ^{″} (δ_{i j}) b_{i j} (θ_{0}) δ_{i j} .

(9)

When using the square loss function

ρ (x) = x^{2}

in ULS estimation,

ρ^{″} (x) = 2

and all model deviations contribute equally in the adapted parameter estimate

θ

. In contrast, when using a robust loss function

ρ (x) = {| x |}^{p}

for

p \leq 1

, model deviations

γ_{i}

and

δ_{i j}

are differentially weighted according to

ρ^{″} (γ_{i})

and

ρ^{″} (δ_{i j})

in Equation (9), respectively [21].

The result in Equation (8) highlights that model deviations

γ

and

δ

enter the computation of the model parameter

θ

. With suitable loss function

ρ

, the influence of model deviations can be reduced if the second derivative

ρ^{″}

is sufficiently small for gross model misspecifications.

3. Group Comparisons in the Tau-Equivalent Model with Noninvariant Item Intercepts

Suppose that there are a fixed number of G groups. In each of the G groups, there exists a mean vector

μ_{g} = (μ_{1 g}, \dots, μ_{I g})

and a covariance matrix

Σ_{g} = {(σ_{i j g})}_{i j}

for items

X_{g} = (X_{1 g}, \dots, X_{I g})

. A latent variable

F_{g}

in a one-dimensional factor model summarizes the distribution of items in each group

g = 1, \dots, G

. Define

α_{g} = E (F_{g})

and

ψ_{g} = Var (F_{g})

. In the sequel, we discuss the identification of

α_{g}

and

ψ_{g}

for various fitting functions (i.e., estimation methods).

We now model the mean structure

μ_{g}

and the covariance structure

Σ_{g}

with the tau-equivalent one-dimensional factor model in each group g. By assuming the identification constraint

E (F_{g}) = 0

and estimating

ψ_{g} = Var (F_{g})

, this poses restrictions

μ_{g} = ν_{g} and Σ_{g} = ψ_{g} 11^{⊤} + ϕ_{g} I .

(10)

It can be seen from (10) that the mean structure is estimated without constraints, while severe constraints on the covariance structure are imposed.

If the mean

α_{g}

and the variance

ψ_{g}

of the factor variable

F_{g}

should also be determined in each group for enabling a comparison of the factor variable

F_{g}

across groups, additional identification constraints have to be posed. This can be seen by including these parameters in Equation (10)

μ_{g} = α_{g} 1 + ν_{g} and Σ_{g} = ψ_{g} 11^{⊤} + ϕ_{g} I .

(11)

A popular identification constraint is to assume invariant item intercepts

ν_{0}

across groups. In this case of MI, (11) simplifies to

μ_{g} = α_{g} 1 + ν_{0} and Σ_{g} = ψ_{g} 11^{⊤} + ϕ_{g} I .

(12)

The group means and group variances can be identified by assuming

α_{1} = 0

. The condition (12) can be characterized as scalar invariance [7]. The MI assumption (12) can be statistically tested [7]. If the MI hypothesis (12) is not rejected,

α_{g}

and

ψ_{g}

can be uniquely determined. In the violation of MI (i.e., measurement noninvariance; MNI), there is indeterminacy in defining group means and group variances. Identification constraints are implicitly posed by assuming particular fitting functions. We discuss several alternative fitting functions and draw relations among the different approaches below. In the following treatment, we allow group-specific item intercepts in the data-generating model:

μ_{g} = α_{g} 1 + ν_{0} + ν_{g}^{*} and Σ_{g} = ψ_{g} 11^{⊤} + ϕ_{g} I .

(13)

The group-specific item intercepts

ν_{g}^{*}

are residuals that describe differences from the common item intercepts

ν_{0}

. Hence, violation of MI (i.e., MNI) is represented in

ν_{g}^{*}

. Condition (13) is also characterized as metric invariance [7].

3.1. Joint Estimation

In joint estimation, group means, group variances and common item parameters are estimated. However, MNI effects are not explicitly modeled as additional parameters. Group-specific means

μ_{g}

and covariances

Σ_{g}

are used for determining the vector of model parameters

θ = (ν_{0}, α_{2}, \dots, α_{G}, ψ_{1}, \dots, ψ_{G}, ϕ_{1}, \dots, ϕ_{G})

. In Section 2.3, it was argued that many estimation methods like ML estimation could be (approximately) characterized as DWLS estimation. DWLS uses the square loss function, but one can stick to the more general case of a loss function

ρ

(see Equation (4)). Using a set of known weights

w_{1 i g}

and

w_{2 i j g}

for

i, j = 1, \dots, I

and

g = 1, \dots, G

, the following fitting function is minimized:

F (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - μ_{0, i g} (θ)) + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j g} ρ (σ_{i j g} - σ_{0, i j g} (θ)) .

(14)

Note that the order in the summation in (14) across groups (index g) and items (indices i and j) can be swapped. The model assumption (13) for

μ_{0, i g} (θ)

and

σ_{0, i j g} (θ)

can be included in (14), and we obtain

F (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - α_{g} - ν_{0 i}) + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j g} ρ (σ_{i j g} - ψ_{g} - ϕ_{g} 1_{{i = j}}),

(15)

where

1_{A}

denotes the indicator function for a set A. Because of the invariance assumption for item loadings in (13), the second term in (15) will be exactly zero and

ψ_{1} \dots, ψ_{G}

,

ϕ_{1}, \dots, ϕ_{G}

can be uniquely determined by data. One can choose

ψ_{g} = σ_{i j g}

for any

i \neq j

. Then,

ϕ_{g} = σ_{i i g} - ψ_{g}

for any i. The group means

α_{g}

and common item intercepts

ν_{0, i}

can be estimated by minimizing

F_{1} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - α_{g} - ν_{i 0}) .

(16)

It can be seen that (16) corresponds to an analysis of variance model in which the two-way data

μ_{i g}

for items

i = 1, \dots, I

and groups

g = 1, \dots, G

is represented by two sets of main effects

α_{g}

and

ν_{0 i}

[23]. For item intercepts

ν_{i 0}

, we obtain estimating equations

\frac{\partial F_{1}}{\partial ν_{i 0}} = - \sum_{g = 1}^{G} w_{1 i g} ρ^{'} (μ_{i g} - α_{g} - ν_{i 0}) = 0 for i = 1, \dots, I .

(17)

For group means

α_{g}

, we similarly obtain

\frac{\partial F_{1}}{\partial α_{g}} = - \sum_{i = 1}^{I} w_{1 i g} ρ^{'} (μ_{i g} - α_{g} - ν_{i 0}) = 0 for g = 2, \dots, G .

(18)

Due to the assumption (13), we have

μ_{i g} - α_{g} - ν_{i 0} = ν_{i g}^{*}

for the group-specific item intercepts. Hence, it follows from (18) that

\sum_{i = 1}^{I} w_{1 i g} ρ^{'} (ν_{i g}^{*}) = 0 for all g = 2, \dots, G .

(19)

From (17), we get

\sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ^{'} (μ_{i g} - α_{g} - ν_{i 0}) = 0 .

(20)

From (19) and (20), we finally obtain

\sum_{i = 1}^{I} w_{1 i g} ρ^{'} (ν_{i g}^{*}) = 0 for all g = 1, \dots, G .

(21)

The finding in Equation (21) demonstrates that there is an assumption of group-specific residual item intercepts

ν_{i g}^{*}

when fitting a multiple group factor model under violation of MI. Hence, group means

ψ_{g}

depend on choosing sets of weights

w_{1 i g}

and a loss function

ρ

.

When choosing

w_{1 i g} \equiv 1

, the condition of partial invariance (PI; [24]) is received for the loss function

ρ (x) = | x |^{p}

for

p \to 0

which takes the value of 0 iff

x = 0

and 1 for

x \neq 0

. In PI, it is typically assumed there is a subset of items for which

ν_{i g 0}^{*} \neq 0

(i.e.,

ρ (ν_{i g 0}^{*}) = 1

). For the majority of items, it holds that

ν_{i g 0}^{*} = 0

(i.e.,

ρ (ν_{i g 0}^{*}) = 0

). The loss function in (16) then minimizes the number of group-specific residual item intercepts that differ from zero [25].

3.2. Linking

In linking methods [26], the one-dimensional factor model is firstly estimated in each of the groups. In the tau-equivalent model, the group variance

ψ_{g}

can be identified, but the group-specific estimation only provides identified item intercepts

ν_{i g}

that are given as (see (13))

ν_{i g} = α_{g} + ν_{i 0} + ν_{i g}^{*} .

Note that item intercepts

ν_{i g}

coincide with group-specific item means

μ_{i g} = E (X_{i g})

.

In a second step in the linking approach, the intercepts are used to determine group means

α_{g}

and common item intercepts

ν_{i 0}

[21]. By defining

θ = (α_{2}, \dots, α_{G}, ν_{10}, \dots, ν_{I 0})

, a linking function H defined by

H (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (ν_{i g} - α_{g} - ν_{i 0})

(22)

using some set of weights

w_{1 i g}

that are chosen equal to one in many applications. Again, the order in the summation in (22) across groups (index g) and items (indices i and j) can be swapped. The linking function in (22) can be considered as Haberman linking (HL; [21,27,28]). In [28], the loss function

ρ (x) = x^{2}

was used while [21] treated the robust loss function

ρ (x) = {| x |}^{p}

for

p \in [0, 2]

. Note that for the tau-equivalent model, the minimization problem (22) is exactly the same as the minimization problem (16) in joint estimation. This is trivial because the item intercepts coincide with the observed group-specific item means and if the covariance structure is correctly specified. Hence, the same condition as in (21) for group-specific residual item intercepts

ν_{i g}^{*}

.

An alternative linking approach has been proposed that avoids estimating common item intercepts

ν_{i 0}

. In invariance alignment (IA; [29]), the following function G is minimized for determining

θ = (α_{2}, \dots, α_{G})

while setting

α_{1} = 0

:

G (θ) = \sum_{g = 1}^{G} \sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{1 i g} w_{1 i h} ρ (ν_{i g} - ν_{i h} - α_{g} + α_{h}),

(23)

where the loss function

ρ (x) = {| x |}^{p}

for

p \geq 0

is utilized [21,30]. The power

p = 0.5

is most frequently chosen because it is the default in the software package Mplus [30]. It was empirically found that the IA minimization in (23) provides very similar group mean estimates as the minimization in (22) that also involves the estimation of common item intercepts [21]. Indeed, the loss function

ρ (x) = {| x |}^{p}

is a subadditive function for

p \leq 1

[31] which means that

ρ (x + y) \leq ρ (x) + ρ (y) for all x, y \in R .

(24)

By defining

x = ν_{i g} - α_{g} - ν_{i 0}

and

y = - (ν_{i h} - α_{h} - ν_{i 0})

, we get from (23) by using (24)

G (θ) \leq \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ (ν_{i g} - α_{g} - ν_{i 0}) = \tilde{H} (θ),

(25)

where the weights

{\tilde{w}}_{1 i g}

are defined as

{\tilde{w}}_{1 i g} = 2 w_{1 i g} \sum_{h = 1}^{G} w_{1 i h} .

(26)

Hence, the majorizing function

\tilde{H}

in (25) is exactly given by the minimization function H in (22) when using properly defined weights

{\tilde{w}}_{1 i g}

but the same loss function

ρ

. As a conclusion, joint estimation and linking methods can be regarded as exchangeable in the tau-equivalent model. They pose the same identification constraints on group-specific residual item intercepts.

Note that the two-step linking approach can be rewritten as a one-step estimation approach with overidentified parameters. Identification is ensured by posing side constraints implied by the linking function [32]. A joint optimization problem can be formulated by using Lagrange multipliers. Assume that there are associated weights

w_{1 i g}^{*}

with group-wise ML estimation and different weights

w_{i g}

in the linking method. Suppose that we parametrize

ν_{i g} = ν_{i 0} + α_{g} + {\tilde{ν}}_{i g}

. The reformulated one-step fitting function

F_{Lagrange}

using Lagrange multipliers

ℓ_{1 g}

and

ℓ_{2 i}

of the two-step linking approaches is given by (see [32])

F_{Lagrange} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g}^{*} {(μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g})}^{2} + \sum_{g = 2}^{G} ℓ_{1 g} \sum_{i = 1}^{I} w_{1 i g} ρ^{'} ({\tilde{ν}}_{i g}) + \sum_{i = 1}^{I} ℓ_{2 i} \sum_{g = 1}^{G} w_{1 i g} ρ^{'} ({\tilde{ν}}_{i g}),

(27)

where

θ

now also includes the

G - 1 + I

Lagrange multipliers

ℓ_{1 g}

and

ℓ_{2 i}

. Note that the second and third term in (27) corresponds to estimating equations obtained from the linking method by determining group means

α_{g}

and common item intercepts

ν_{i 0}

.

3.3. Regularization

MNI has also been tackled by regularization methods [33,34,35]. The main idea is to introduce nonidentified group-specific residual item intercepts

{\tilde{ν}}_{i g}

in ML estimation. The estimation problem becomes identifiable by adding a penalty function

Ƥ

to the negative likelihood function [36]. Again, the covariance structure is assumed to be correctly specified. The estimated model parameters are collected in the vector

θ = (ν_{10}, \dots, ν_{1 I}, α_{2}, \dots, α_{G}, \dots, {\tilde{ν}}_{i g}, \dots)

. Using the result (3), the fitting function can be approximately written

F_{reg} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g})}^{2} + \sum_{g = 1}^{G} \sum_{i = 1}^{I} Ƥ (κ, {\tilde{ν}}_{i g}),

(28)

where

κ > 0

is a tuning parameter. A popular penalty function is the lasso penalty

Ƥ (κ, x) = κ | x |

, but alternative lasso-type penalty functions with similar behavior to the former one around

x = 0

but more desirable statistical properties have been proposed [36]. Alternatively, the ridge penalty

Ƥ (κ, x) = κ x^{2}

can be used that controls the variability of effects od MNI.

When using lasso-type penalty functions, the residual intercepts

{\tilde{ν}}_{i g}

can be interpreted as outliers. Indeed, for the lasso penalty, it has been shown that the minimization of

F_{reg}

in (28) using regularization is equivalent to robust regression with outlier detection [37,38]. By defining

\tilde{θ} = (ν_{10}, \dots, ν_{1 I}, α_{2}, \dots, α_{G})

and an appropriate loss function

\tilde{ρ}

, the minimization problem (28) can be rewritten as

F_{rob} (\tilde{θ}) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} \tilde{ρ} (μ_{i g} - α_{g} - ν_{i 0}),

Hence, regularized ML estimation can be equivalently recognized as joint estimation using a particular loss function that enables the efficient detection of outliers.

We now characterize the solution of (28) in more detail. From the behavior of the lasso penalty it is reasonable to assume that estimated residual item intercepts

{\tilde{ν}}_{i g} = 0

equal zero iff

| ν_{i g}^{*} | < κ

holds for true residual item intercepts

ν_{i g}^{*}

. On the other hand, we can assume that

{\tilde{ν}}_{i g} = ν_{i g}^{*}

iff

| ν_{i g}^{*} | > κ

. For determining the group mean

α_{g}

, we get the estimating equation

\frac{\partial F_{reg}}{\partial α_{g}} = - 2 \sum_{i = 1}^{I} w_{1 i g} (μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g}) = 0 .

(29)

By relying on the just mentioned properties for estimated residual item intercepts

{\tilde{ν}}_{i g}

, we obtain the condition

\sum_{i = 1}^{I} w_{1 i g} ν_{i g}^{*} 1_{{| ν_{i g}^{*} | < κ}} = 0 for all g = 1, \dots, G .

(30)

The result in (30) indicates the MNI cancels out on average for small effects

ν_{i g}^{*}

that fulfill

| ν_{i g}^{*} | < κ

. Note that this set of effects is implicitly estimated in regularized ML.

In the case of a general penalty function

Ƥ

, define

Ƥ_{1} = \frac{\partial Ƥ}{\partial x}

. Note that we replace a nondifferentiable penalty function with a differentiable approximation

Ƥ

[33,39]. For determining the group mean

α_{g}

, the condition (29) does not change. For determining

{\tilde{ν}}_{i g}

, we get the estimating equation

- w_{1 i g} (μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g}) + Ƥ_{1} (κ, {\tilde{ν}}_{i g}) = - w_{1 i g} (ν_{i g}^{*} - {\tilde{ν}}_{i g}) + Ƥ_{1} (κ, {\tilde{ν}}_{i g}) = 0 .

(31)

By using (31), there exists a function

Q

such that

{\tilde{ν}}_{i g} = Q (ν_{i g}^{*})

. Moreover, by summing (31) across items

i = 1, \dots, I

, we receive

\sum_{i = 1}^{I} \{- w_{1 i g} (ν_{i g}^{*} - {\tilde{ν}}_{i g}) + Ƥ_{1} (κ, {\tilde{ν}}_{i g})\} = 0 .

Because (29) holds, we get

\sum_{i = 1}^{I} w_{1 i g} (ν_{i g}^{*} - {\tilde{ν}}_{i g}) = 0 and \sum_{i = 1}^{I} Ƥ_{1} (κ, {\tilde{ν}}_{i g}) = 0 .

This means that estimated effects

{\tilde{ν}}_{i g} = Q (ν_{i g}^{*})

somehow vanish on average according to their contribution in

Ƥ_{1}

.

Instead of introducing group-specific residual item intercepts

ν_{i g}^{*}

in regularized ML estimation, one can employ fused regularized ML estimation [40] that relies on overidentified group-specific item intercepts

{\overset{˘}{ν}}_{i g}

. The fitting function is fused regularized ML is defined as

F_{fusedreg} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} - {\overset{˘}{ν}}_{i g})}^{2} + \sum_{g = 1}^{G - 1} \sum_{h = g + 1}^{G} \sum_{i = 1}^{I} Ƥ (κ, {\overset{˘}{ν}}_{i g} - {\overset{˘}{ν}}_{i h}) .

(32)

In this case, the parameter vector

θ

does not include common item intercepts

ν_{0}

. The nonidentification issue of ML estimation is solved by defining a penalty function

Ƥ

on deviations

{\overset{˘}{ν}}_{i g} - {\overset{˘}{ν}}_{i h}

. By using lasso-type penalty functions in (32), clusters of

{\overset{˘}{ν}}_{i g}

coefficients will be obtained. If there are only a few outlying parameters for each item, estimated group means

α_{g}

from fused regularized ML using the fitting function in (32) will often be similar to those in regularized ML estimation using the fitting function in (28).

Another popular approach to handling MNI is the Bayesian approximate measurement invariance model (BAMI; [41,42,43]). The tau-equivalent model is estimated with an overidentified parameter vector that includes all item intercepts

ν_{i g}

. To ensure the identification of the model, a normal prior with known variance on all pairwise deviations

ν_{i g} - ν_{i h}

is posed. The normal prior distribution on

ν_{i g} - ν_{i h}

can be regarded as a ridge penalty function of the form

\tilde{κ} {(ν_{i g} - ν_{i h})}^{2}

(see, e.g., [44]). Hence, BAMI can be recognized as fused regularized ML estimation with a particular penalty function in (32).

Interestingly, Battauz [39] showed for regularized estimation in the four-parameter item response model that the ridge penalty on differences

ν_{i g} - ν_{i h}

can be rewritten as a penalty for residual item intercepts

{\tilde{ν}}_{i g} = ν_{i g} - ν_{i 0}

:

\tilde{κ} \sum_{g = 1}^{G} \sum_{h = 1}^{H} \sum_{i = 1}^{I} {(ν_{i g} - ν_{i h})}^{2} = κ \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{ν}}_{i g}^{2}

using an appropriate tuning parameter

κ

. By replacing the Markov chain Monte Carlo estimation method of the BAMI model with regularized ML estimation, we obtain the fitting function

F_{BAMI} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g})}^{2} + κ \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{ν}}_{i g}^{2},

which is regularized ML estimation using a ridge penalty function. For determining group means

α_{g}

, we get the estimating equation

\sum_{i = 1}^{I} w_{1 i g} (μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g}) = 0 .

(33)

For determining

{\tilde{ν}}_{i g}

, we get

- w_{1 i g} (μ_{i g} - α_{g} - ν_{i 0} - {\tilde{ν}}_{i g}) + κ {\tilde{ν}}_{i g} = 0

Hence, the estimated group-specific residual item intercepts are shrunken estimates of

ν_{i g}^{*}

{\tilde{ν}}_{i g} = \frac{w_{1 i g}}{w_{1 i g} + κ} (μ_{i g} - α_{g} - ν_{i 0}) = \frac{w_{1 i g}}{w_{1 i g} + κ} ν_{i g}^{*}

(34)

that fulfill

\sum_{i = 1}^{I} {\tilde{ν}}_{i g} = 0

. By using (34), we get from (33)

\sum_{i = 1}^{I} w_{1 i g} (ν_{i g}^{*} - {\tilde{ν}}_{i g}) = \sum_{i = 1}^{I} \frac{κ w_{1 i g}}{w_{1 i g} + κ} ν_{i g}^{*} = 0 .

With equal weights

w_{1 i g}

for all items within a group g, this shows that BAMI and ML pose the same identification constraints on

ν_{i g}^{*}

; that is,

\sum_{i = 1}^{I} ν_{i g}^{*} = 0

.

A variant of IA has been proposed that uses the output of BAMI for determining the alignment solution [45,46,47]. BAMI produces adjusted group-specific item means

{\tilde{μ}}_{i g} = α_{g} + ν_{i 0} + {\tilde{ν}}_{i g}

, which are subsequently used as input data for Bayesian IA. Note that

{\tilde{μ}}_{i g} = α_{g} + ν_{i 0} + {\tilde{ν}}_{i g} = μ_{i g} + {\tilde{ν}}_{i g} - ν_{i g}^{*} = μ_{i g} - \frac{κ}{w_{1 i g} + κ} ν_{i g}^{*} .

(35)

We now substitute the quantity

{\tilde{μ}}_{i g}

obtained in (35) in the IA fitting function (see (25) and weights

{\tilde{w}}_{1 i g}

defined in (26)):

\tilde{H} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ ({\tilde{μ}}_{i g} - α_{g} - ν_{i 0}) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ (μ_{i g} - \frac{κ}{w_{1 i g} + κ} ν_{i g}^{*} - α_{g} - ν_{i 0}) .

The estimating equation for

α_{g}

is then given by

\sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ^{'} (μ_{i g} - \frac{κ}{w_{1 i g} + κ} ν_{i g}^{*} - α_{g} - ν_{i 0}) = 0 .

Using the definition

μ_{i g} = α_{g} + ν_{i 0} + ν_{i g}^{*}

, we get the identification constraint

\sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ^{'} (\frac{w_{1 i g}}{w_{1 i g} + κ} ν_{i g}^{*}) = \sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ^{'} ({\tilde{ν}}_{i g}) = 0 .

(36)

When equal weights

w_{1 i g}

(and

{\tilde{w}}_{1 i g}

) are used in (36), the identification constraints

\sum_{i = 1}^{I} ρ^{'} (ω ν_{i g}^{*}) = 0

with a scaling factor

ω > 0

are obtained. For

ρ (x) = x^{2}

, it holds that

ρ^{'} (x) = 2 x

, and we receive the same identification constraints as those obtained with ULS estimation or DWLS estimation with equal weights.

4. Group Comparisons in the Tau-Congeneric Model with Noninvariant Item Intercepts

In the following, we discuss group comparisons in the tau-congeneric model. We investigate the consequences of MNI in item intercepts but assume MI in item loadings; that is, item loadings are invariant, and metric invariance holds [6,7]. It will be shown that the derivations of the tau-equivalent model only have slightly to be modified.

For the following examinations, we assume common loadings

λ_{0}

where the first loading

λ_{10}

is fixed to 1 for reasons of identification in the data-generating model

μ_{g} = α_{g} λ_{0} + ν_{0} + ν_{g}^{*} and Σ_{g} = ψ_{g} λ_{0} λ_{0}^{⊤} + Φ_{g},

where

Φ_{g}

is a diagonal matrix of group-specific residual variances. Violation of MI only pertains to the mean structure due to the presence of group-specific residual item intercepts

ν_{g}^{*}

.

4.1. Joint Estimation

In the joint estimation of the tau-congeneric measurement model, the parameter of interest is given as

θ = (ν_{0}, λ_{0}, α_{2}, \dots, α_{G}, ψ_{1}, \dots, ψ_{G}, ϕ_{1}, \dots, ϕ_{G})

, where

ϕ_{g}

include the diagonal entries of

Φ_{g}

. The general fitting function is defined by

F (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - α_{g} λ_{i 0} - ν_{0 i}) + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j g} ρ (σ_{i j g} - ψ_{g} λ_{i 0} λ_{j 0} - ϕ_{i g} 1_{{i = j}}) .

(37)

Because the covariance structure is correctly specified, common item loadings

λ_{0}

and group-specific residual variance matrices

Φ_{g}

can be uniquely determined by minimizing the second term in (37). Hence, for determining group means

α_{g}

, only the first term in (37) must be considered, which results in a fitting function

F_{1} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - α_{g} λ_{i 0} - ν_{0 i}) .

(38)

Similar to Equation (21) in Section 3.1, we get the identification constraints by taking the same steps as in Section 3.1

\sum_{i = 1}^{I} w_{1 i g} λ_{i 0} ρ^{'} (ν_{i g}^{*}) = 0 for all g = 1, \dots, G .

(39)

Comparing (39) with (21), it is vital that the common item loadings

λ_{i 0}

now also enter the identification constraint (see also [48]).

4.2. Linking

We now investigate HL in the tau-equivalent model [21] (see [48] for a similar approach). Again, identified parameters (i.e., item intercepts

ν_{i g}

and item loadings

λ_{i g}

) are firstly obtained in group-wise estimations

ν_{i g} = α_{g} λ_{i 0} + ν_{i 0} + ν_{i g}^{*} and λ_{i g} = ψ_{g}^{1 / 2} λ_{i 0} .

(40)

From (40) and assuming

λ_{10} = 1

for the first item for reasons of identification, it can be seen that all group variances

ψ_{g}

can be uniquely identified. Obviously, we get

ψ_{g} = λ_{1 g}^{2}

. For determining group means

α_{g}

, we define a linking function H for the parameter

θ = (ν_{i 1}, \dots, ν_{I 0}, ψ_{2}, \dots, ψ_{G})

of interest:

H (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (ν_{i g} - α_{g} λ_{i 0} - ν_{i 0})

(41)

Like in Section 3.2, it can be seen that the minimization problem in (41) corresponds to the problem in (38). Hence, the same identification constraints as in (39) are obtained.

As also discussed in Section 3.2, one can also argue that IA appears very similar to the linking problem in (41) and therefore resembles joint estimation by using an appropriate loss function

ρ

.

4.3. Regularization

The fitting function for the mean structure of regularized ML estimation for the tau-congeneric model with invariant item loadings can be written as

F_{reg} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} λ_{i 0} - ν_{i 0} - {\tilde{ν}}_{i g})}^{2} + \sum_{g = 1}^{G} \sum_{i = 1}^{I} Ƥ (κ, {\tilde{ν}}_{i g}) .

(42)

Similar arguments like in Section 3.3 can be made for demonstrating the equivalence of regularized ML estimation using (42) and joint estimation using a robust loss function

ρ

(see (38)).

Similar findings as in Section 3.3 regarding the equivalence of fused regularized ML estimation and BAMI can be made. The similarity of linking methods and regularized ML estimation was also pointed out by [48] in item response models for dichotomous items.

5. Group Comparisons in the Tau-Congeneric Model with Noninvariant Item Intercepts and Noninvariant Item Loadings

Finally, we consider the estimation of the tau-congeneric measurement model. The data-generating model allows for noninvariant item intercepts and noninvariant item loadings.

Let

x \circ y

be the Hadamard product (i.e., element-wise multiplication of vectors

x

and

y

). We assume

μ_{g} = α_{g} λ_{0} \circ λ_{g}^{*} + ν_{0} + ν_{g}^{*} and Σ_{g} = ψ_{g} (λ_{0} \circ λ_{g}^{*}) {(λ_{0} \circ λ_{g}^{*})}^{⊤} + Φ_{g} .

(43)

For entries in

μ_{g}

and

Σ_{g}

, we get from (43)

μ_{i g} = α_{g} λ_{i 0} λ_{i g}^{*} + ν_{i 0} + ν_{i g}^{*} and σ_{i j g} = ψ_{g} λ_{i 0} λ_{j 0} λ_{i g}^{*} λ_{j g}^{*} + ϕ_{i g} 1_{{i = j}} .

(44)

Then, MNI is represented in

λ_{g}^{*}

and

ν_{g}^{*}

. Note that values of

λ_{i g}^{*}

equal to 1 indicate MI of a particular item parameter, while MNI is represented by values different from 1.

It is important to emphasize that deviations from MI in item loadings are modeled as multiplicative effects. This assumption means that there is an additive representation for logarithmized identified group-specific item loadings

l_{i g} = log λ_{i g}

l_{i g} = f_{g} + l_{i 0} + l_{i g}^{*},

where

l_{i 0} = log λ_{i 0}

,

l_{i g}^{*} = log λ_{i g}^{*}

, and

f_{g} = \frac{1}{2} log ψ_{g}

. Instead of treating deviations as multiplicative errors, one could also assume additive errors of deviations ([49], see also [50]) such that

λ_{i g} = ψ_{g}^{1 / 2} (λ_{i 0} + λ_{i g}^{*})

. In this case, the group-specific mean and covariances can be written as

μ_{i g} = α_{g} (λ_{i 0} + λ_{i g}^{*}) + ν_{i 0} + ν_{i g}^{*} and σ_{i j g} = ψ_{g} (λ_{i 0} + λ_{i g}^{*}) (λ_{j 0} + λ_{j g}^{*}) + ϕ_{i g} 1_{{i = j}} .

(45)

Note the difference to the parameterization in (44).

We now study the effects of MNI in intercepts and loadings in different estimation approaches for determining group means and group variances in the tau-congeneric model.

5.1. Joint Estimation

In joint estimation, the parameter of interest is given by

θ = (ν_{0}, λ_{0}, α_{2}, \dots, α_{G}, ψ_{2}, \dots, ψ_{G}, ϕ_{1}, \dots, ϕ_{G})

, while we set

α_{1} = 0

and

ψ_{1} = 1

in the estimation. We consider the general fitting function

F (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (μ_{i g} - α_{g} λ_{i 0} - ν_{0 i}) + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i j g} ρ (σ_{i j g} - ψ_{g} λ_{i 0} λ_{j 0} - ϕ_{i g} 1_{{i = j}}) .

We first investigate the determination of group variances

ψ_{g}

. Assume that common item loadings

λ_{0}

have already been determined. Then, we get the estimating equation by taking

\frac{\partial F}{\partial ψ_{g}}

\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} w_{2 i j g} λ_{i 0} λ_{j 0} ρ^{'} (σ_{i j g} - ψ_{g} λ_{i 0} λ_{j 0}) = 0 .

Note that

ϕ_{i g}

can be uniquely determined and there is a vanishing contribution of terms for

i = j

. Using (44), this implies the identification constraint

\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} w_{2 i j g} λ_{i 0} λ_{j 0} ρ^{'} (ψ_{g} λ_{i 0} λ_{j 0} (λ_{i g}^{*} λ_{j g}^{*} - 1)) = 0 .

(46)

For the loss function

ρ (x) = {| x |}^{p}

, we get due to

ρ^{'} (λ x) = ρ^{'} (λ) ρ^{'} (x)

for any

λ \geq 0

from (46)

\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} w_{2 i j g} λ_{i 0} λ_{j 0} ρ^{'} (ψ_{g} λ_{i 0} λ_{j 0}) ρ^{'} (λ_{i g}^{*} λ_{j g}^{*} - 1) = 0 .

(47)

Group-specific residual item loadings

λ_{i g}^{*}

vanish on average according to the identification constraint (47). We can further specialize (47) for DWLS estimation (which can also approximate ML estimation), which employs the square loss function

ρ (x) = x^{2}

:

\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} w_{2 i j g} λ_{i 0}^{2} λ_{j 0}^{2} ψ_{g} (λ_{i g}^{*} λ_{j g}^{*} - 1) = 0

Now, we determine group means

α_{g}

. Assume that common item loadings

λ_{0}

and

ν_{0}

have already been determined. By taking

\frac{\partial F}{\partial α_{g}}

, we get the identification constraints

\sum_{i = 1}^{I} w_{1 i g} λ_{i 0} ρ^{'} (α_{g} λ_{i 0} (λ_{i g}^{*} - 1) + ν_{i g}^{*}) = 0 .

(48)

For DWLS estimation, we get the identification constraint

\sum_{i = 1}^{I} w_{1 i g} λ_{i 0} (α_{g} λ_{i 0} (λ_{i g}^{*} - 1) + ν_{i g}^{*}) = 0 .

(49)

It can be seen that MNI in loadings due to

λ_{i g}^{*}

as well as MNI in intercepts due to

ν_{i g}^{*}

determine the group mean

α_{g}

.

For additive deviations from MI that follow (45), the condition (46) is replaced by

\sum_{i = 1}^{I - 1} \sum_{j = i + 1}^{I} w_{2 i j g} λ_{i 0} λ_{j 0} ρ^{'} (ψ_{g} (λ_{i 0} λ_{j g}^{*} + λ_{i g}^{*} λ_{j 0} + λ_{i g}^{*} λ_{j g}^{*})) = 0 .

The condition (49) is replaced by

\sum_{i = 1}^{I} w_{1 i g} λ_{i 0} ρ^{'} (α_{g} λ_{i g}^{*} + ν_{i g}^{*}) = 0 .

Due to the arbitrariness of using either the multiplicative or additive representation of MNI effects

λ_{i g}^{*}

, the different obtained identification conditions should not be interpreted as conflicting findings but rather as different ways of representing the same identification condition.

5.2. Linking

This subsection discusses the estimation of group means and group variances for linking approaches. We assume that item loadings follow the multiplicative representation of MNI in (44). This corresponds to an additivity assumption for logarithmized item loadings.

At first, the tau-congeneric measurement model is estimated separately in each group. By assuming

E (F_{g}) = 0

and

Var (F_{g}) = 1

, identified group-specific item intercepts

ν_{i g}

and group-specific item loadings

λ_{i g}

are given as

ν_{i g} = α_{g} λ_{i 0} λ_{i g}^{*} + ν_{i 0} + ν_{i g}^{*} and λ_{i g} = ψ_{g}^{1 / 2} λ_{i 0} λ_{i g}^{*} .

(50)

Note that (50) can be equivalently written as

ν_{i g} = α_{g} λ_{i 0} λ_{i g}^{*} + ν_{i 0} + ν_{i g}^{*} and log λ_{i g} = \frac{1}{2} log ψ_{g} + log λ_{i 0} + log λ_{i g}^{*} .

(51)

In HL [21,28], common logarithmized item loadings

l_{0} = (l_{10}, \dots, l_{I 0}) = log λ_{0}

and logarithmized group variances

f_{g} = \frac{1}{2} ψ_{g}

are computed in the first step. Note that

ψ_{g} = exp (2 f_{g})

. The linking function

H_{2}

using

θ = (f_{2}, \dots, f_{G}, l_{0})

using the identification constraint

f_{1} = 0

(i.e.,

ψ_{1} = 1

) is defined as

H_{2} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{2 i g} ρ (l_{i g} - f_{g} - l_{i 0}),

where

l_{i g} = log λ_{i g}

. For determining

f_{g}

(and, hence, the group variance

ψ_{g}

), applying

\frac{\partial H_{2}}{\partial f_{g}}

and considering (51) provides the identification constraint

\sum_{i = 1}^{I} w_{2 i g} ρ^{'} (log λ_{i g}^{*}) = 0 .

In the second step in HL, group means

α_{g}

are determined based on identified item intercepts

ν_{i g}

and identified item loadings

λ_{i g}

(see Equation (50)) and group variances

ψ_{g}

that are determined in a first step. We discuss a variant of HL (see Equation (25) in [21])

H_{1} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} ρ (ν_{i g} - ν_{i 0} - ψ_{g}^{- 1 / 2} λ_{i g} α_{g})

(52)

for

θ = (α_{2}, \dots, α_{G}, ν_{0})

and employing the identification constraint

α_{1} = 0

. For determining group means

α_{g}

, we consider

\frac{\partial H_{1}}{\partial α_{g}}

and get from (52) the identification constraints

\sum_{i = 1}^{I} w_{1 i g} ψ_{g}^{- 1 / 2} λ_{i g} ρ^{'} (ν_{i g} - ν_{i 0} - ψ_{g}^{- 1 / 2} λ_{i g} α_{g}) = \sum_{i = 1}^{I} w_{1 i g} λ_{i 0} λ_{i g}^{*} ρ^{'} (ν_{i g}^{*}) = 0 .

Another popular linking approach is IA [29]. Originally it was discussed as a simultaneous linking method for determining group means and group variances, it has been shown in [21] that alignment is equivalent to a two-step linking approach.

In the first step of IA, group standard deviations

p_{g}

defined as

ψ_{g} = p_{g}^{2}

are determined by minimizing the linking function

G_{2} (θ) = \sum_{g = 1}^{G} \sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{2 i g h} ρ (p_{g}^{- 1} λ_{i g} - p_{h}^{- 1} λ_{i h})

where

θ = (p_{2}, \dots, p_{G})

using the identification constraint

p_{1} = 1

(and, hence,

ψ_{1} = 1

). For determining

p_{g}

(and subsequently

ψ_{g}

), we receive the following identification condition by taking

\frac{\partial G_{2}}{\partial p_{g}}

\sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{2 i g h} λ_{i 0} λ_{i g}^{*} ρ^{'} (λ_{i 0} (λ_{i g}^{*} - λ_{i h}^{*})) = 0 .

In the second step of IA, group means

α_{g}

using the identification constraint

α_{1} = 0

are obtained by minimizing the linking function using

θ = (α_{2}, \dots, α_{G})

G_{1} (θ) = \sum_{g = 1}^{G} \sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{1 i g h} ρ (ν_{i g} - ν_{i h} - ψ_{g}^{- 1 / 2} λ_{i g} α_{g} + ψ_{h}^{- 1 / 2} λ_{i h} α_{h}) .

(53)

Then, we get the following identification condition by taking

\frac{\partial G_{1}}{\partial α_{g}}

\sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{1 i g h} ψ_{g}^{- 1 / 2} λ_{i g} ρ^{'} (ν_{i g} - ν_{i h} - ψ_{g}^{- 1 / 2} λ_{i g} α_{g} + ψ_{h}^{- 1 / 2} λ_{i h} α_{h}) = \sum_{h = 1}^{H} \sum_{i = 1}^{I} w_{1 i g h} λ_{i 0} λ_{i g}^{*} ρ^{'} (ν_{i g}^{*} - ν_{i h}^{*}) = 0 .

It has been shown in [21] that HL and IA provide very similar results for the tau-congeneric model if the same loss function

ρ (x) = {| x |}^{p}

for

p \in [0, 1]

is utilized. We can rely on the subadditivity property (24) for finding a majorizing function

{\tilde{H}}_{1}

in (53)

G_{1} (θ) \leq \sum_{g = 1}^{G} \sum_{i = 1}^{I} {\tilde{w}}_{1 i g} ρ (ν_{i g} - ν_{i 0} - ψ_{g}^{- 1 / 2} λ_{i g} α_{g}) = {\tilde{H}}_{1} (\tilde{θ}),

where

\tilde{θ}

now also includes common item intercepts

ν_{0}

and

{\tilde{w}}_{1 i g}

are appropriately defined weights. By minimizing

{\tilde{H}}_{1}

for determining the group mean

α_{g}

, we receive the identification constraint

\sum_{i = 1}^{I} {\tilde{w}}_{1 i g} λ_{i 0} λ_{i g}^{*} ρ (ν_{i g}^{*}) = 0 .

5.3. Regularization

We now discuss identification constraints in regularized ML estimation of the multiple-group tau-congeneric measurement model. Deviations of MI in item loadings can be either modeled in a multiplicative (see (44)) or an additive (see (45)) form. In the following, we assume additive effects because this specification appears more frequently in practical applications.

In regularized ML estimation, we collect in the parameter vector

θ

the parameters

α_{g}

,

ν_{i 0}

,

{\tilde{ν}}_{i g}

,

ψ_{g}

,

λ_{i 0}

, and

{\tilde{λ}}_{i g}

. Moreover, we set

α_{1} = 0

and

ψ_{1} = 1

for reason of identification and define the fitting function

\begin{matrix} F_{reg} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} (λ_{i 0} + {\tilde{λ}}_{i g}) - ν_{i 0} - {\tilde{ν}}_{i g})}^{2} \\ + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i g} {(σ_{i j g} - ψ_{g} (λ_{i 0} + {\tilde{λ}}_{i g}) (λ_{j 0} + {\tilde{λ}}_{j g}) - ϕ_{i g} 1_{{i = j}})}^{2} \\ + \sum_{g = 1}^{G} \sum_{i = 1}^{I} Ƥ (κ_{1}, {\tilde{ν}}_{i g}) + \sum_{g = 1}^{G} \sum_{i = 1}^{I} Ƥ (κ_{2}, {\tilde{λ}}_{i g}), \end{matrix}

(54)

where

κ_{1}

and

κ_{2}

are regularization parameters for group-specific residual intercepts

{\tilde{ν}}_{i g}

and residual loadings

{\tilde{λ}}_{i g}

, respectively. Using the additive representation of MNI (45), we get the following identification constraint for determining the group mean

α_{g}

\sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} (ν_{i g}^{*} - {\tilde{ν}}_{i g} + α_{g} (λ_{i g}^{*} - {\tilde{λ}}_{i g})) = 0 .

(55)

The condition (55) means that MNI effects

ν_{i g}^{*}

and

λ_{i g}^{*}

cancel out on average, where the average is computed mainly on those effects that are set to zero in regularized ML.

As an alternative approach, fused regularized ML estimation can be employed. In this approach, all item intercepts and item loadings are estimated group-wise, and identification is ensured using a fused penalty function. The fitting function is defined as

\begin{matrix} F_{fusedreg} (θ) = \sum_{g = 1}^{G} \sum_{i = 1}^{I} w_{1 i g} {(μ_{i g} - α_{g} {\overset{˘}{λ}}_{i g} - {\overset{˘}{ν}}_{i g})}^{2} \\ + \sum_{g = 1}^{G} \sum_{i = 1}^{I} \sum_{j = i}^{I} w_{2 i g} {(σ_{i j g} - ψ_{g} {\overset{˘}{λ}}_{i g} {\overset{˘}{λ}}_{j g} - ϕ_{i g} 1_{{i = j}})}^{2} \\ + \sum_{g = 1}^{G} \sum_{h = g + 1}^{G} \sum_{i = 1}^{I} Ƥ (κ_{1}, {\overset{˘}{ν}}_{i g} - {\overset{˘}{ν}}_{i h}) + \sum_{g = 1}^{G} \sum_{h = g + 1}^{G} \sum_{i = 1}^{I} Ƥ (κ_{2}, {\overset{˘}{λ}}_{i g} - {\overset{˘}{λ}}_{i h}) . \end{matrix}

With additive MNI effects (see (45)), the identification constraint for the group mean is given by

\sum_{i = 1}^{I} w_{1 i g} (α_{g} (λ_{i 0} + λ_{i g}^{*} - {\overset{˘}{λ}}_{i g}) + ν_{i 0} + ν_{i g}^{*} - {\overset{˘}{ν}}_{i g}) = 0 .

In the tau-congeneric measurement model, BAMI [41,47] uses normal prior distributions for intercept differences

ν_{i g} - ν_{i h}

and loading differences

λ_{i g} - λ_{i h}

[51]. Therefore, BAMI can be viewed as fused regularized ML estimation with a ridge penalty. Using the finding of Battauz [39] and the same reasoning as in Section 3.3, it is evident that BAMI as fused regularization with a ridge penalty function is equivalent to regularized ML estimation in (54) that involves the estimation of common item intercepts

ν_{0}

and common item loadings

λ_{0}

. As argued in Section 3.3, regularized ML estimation with a ridge penalty can be quite close to DWLS with appropriate weights (and, hence, ML) in terms of estimated group means and estimated variances because only shrinkage for otherwise nonidentified residual MNI effects for intercepts and loadings is introduced. Consequently, BAMI will provide similar results compared to ML estimation.

As also discussed in Section 3.3 for the tau-equivalent model, the output of BAMI can be used in a subsequent IA estimation [45]. Using the same steps as in Section 3.3 for the derivations, an identification constraint similar to (36) can be obtained.

6. Discussion

In this article, we have argued that joint estimation, linking, and regularized ML estimation in the tau-equivalent and the tau-congeneric model can provide similar if not identical estimates in the violation of MI if an appropriate loss function

ρ

in joint estimation or linking is used. In the violation of MI, it is important to emphasize that researchers can use arbitrary identification constraints to determine group means. Resulting estimates depend on the chosen weights and loss function or the used penalty functions in regularized ML estimation. Therefore, researchers use implicit definitions for identification constraints on effects that quantify MNI by choosing a particular fitting function. The wisdom under applied researchers that partial invariance is necessary for determining group-mean comparisons [24] is unsound because it would imply that a particular loss function should always be preferred in practice.

It is important to emphasize that choosing of a particular loss function weighs discrepancies between sample input data (i.e., means and covariances in factor analysis) and assumed population parameters. Two error types must be distinguished: sampling error and model error. The former sampling error can be reduced in large samples while the latter model error (i.e., MNI in multiple-group factor analysis) does not vanish in large samples. Certain types of misspecifications can be downweighted by utilizing a model-robust loss function

ρ

. Consequently, estimated model parameters are not influenced by some misspecifications. We think that the preference of ML estimation factor analysis (and in structural equation modeling in general) is misguided because ML can only be the most efficient estimation method in (very) large samples and if the model of interest is correctly specified. Because MI is rare as unicorns in applications of multiple-group factor analysis, we cannot imagine many situations in which a robust loss function should not be preferred. Note that we are discussing robustness in the sense of model misspecification in modelled means and covariances. In contrast, robust factor analysis is mainly devoted to the misspecification of the multivariate normal distribution [52].

Joint estimation in the case of measurement invariance still seems to be the most frequent approach. In contrast to regularization approaches, joint estimation does not include additional parameters for model deviations. Hence, joint estimation is less computationally demanding than regularization. Linking approaches are typically implemented in a two-step method. Because the one-dimensional factor model is estimated separately for each group in a first step, the linking approach might be less prone to convergence issues or ill-defined parameter estimates. However, group-wise estimation of the one-dimensional factor model might require a sufficiently large sample size. Hence, linking methods could result in less stable estimates than joint estimation or regularization.

Our arguments in this paper are based on fitting vectors of means and covariances that are computed for factor analysis of continuous items (i.e., assuming a multivariate normal distribution). The arguments likely generalize to the fitting of vectors of thresholds and polychoric correlation matrices for factor analysis of ordinal data [53]. Future research could investigate violations of measurement invariance in the one-dimensional factor model for a continuous covariate instead of a finite number of groups

g = 1, \dots, G

[54].

We limited our discussion to the most popular estimation methods of the multiple-group one-dimensional factor model (i.e., joint estimation, linking, and regularized estimation) in the violation of MI. More flexible handling of MNI has recently been proposed using deep learning methods [55]. These fitting functions also imply identification constraints for MNI effects. Because it is our conviction that all fitted models are grossly misspecified (and not only misspecified to a certain degree), there is no reason to believe that more complex models will provide more valid estimates for group comparisons. In contrast, researchers purposely choose fitting function that describe a complex dataset and define a pseudo-true parameter through the choice of this fitting function. It is likely not reasonable to talk about true parameters (i.e., true group means and true group variances) without explicitly mentioning identification constraints.

7. Conclusions

This article presented a formal analysis of different estimation methods in the violation of measurement invariance. We have shown how different fitting functions result in implied identification constraints on parameters that characterize the extent of measurement invariance. In our view, the choice of fitting functions should be mainly made regarding the weighing of model deviations because it is unlikely in practical applications that the doctrine of measurement invariance exactly holds.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BAMI	Bayesian approximate measurement invariance
DWLS	diagonally weighted least squares
HL	Haberman linking
IA	invariance alignment
ML	maximum likelihood
MI	measurement invariance
MNI	measurement noninvariance
PI	partial invariance
ULS	unweighted least squares
WLS	weighted least squares

References

Bartholomew, D.J. The foundations of factor analysis. Biometrika 1984, 71, 221–232. [Google Scholar] [CrossRef]
Jöreskog, K.G. A general approach to confirmatory maximum likelihood factor analysis. Psychometrika 1969, 34, 183–202. [Google Scholar] [CrossRef]
Bechger, T.M.; Maris, G. A statistical test for differential item pair functioning. Psychometrika 2015, 80, 317–340. [Google Scholar] [CrossRef] [PubMed]
Schulze, D.; Pohl, S. Finding clusters of measurement invariant items for continuous covariates. Struct. Equ. Model. A Multidiscip. J. 2021, 28, 219–228. [Google Scholar] [CrossRef]
Robitzsch, A. Robust and nonrobust linking of two groups for the Rasch model with balanced and unbalanced random DIF: A comparative simulation study and the simultaneous assessment of standard errors and linking errors with resampling techniques. Symmetry 2021, 13, 2198. [Google Scholar] [CrossRef]
Meredith, W. Measurement invariance, factor analysis and factorial invariance. Psychometrika 1993, 58, 525–543. [Google Scholar] [CrossRef]
Millsap, R.E. Statistical Approaches to Measurement Invariance; Routledge: New York, NY, USA, 2011. [Google Scholar] [CrossRef]
Davidov, E.; Meuleman, B. Measurement invariance analysis using multiple group confirmatory factor analysis and alignment optimisation. In Invariance Analyses in Large-Scale Studies; van de Vijver, F.J.R., Ed.; OECD: Paris, France, 2019; pp. 13–20. [Google Scholar]
Vandenberg, R.J.; Lance, C.E. A review and synthesis of the measurement invariance literature: Suggestions, practices, and recommendations for organizational research. Organ. Res. Methods 2000, 3, 4–70. [Google Scholar] [CrossRef]
Wicherts, J.M.; Dolan, C.V. Measurement invariance in confirmatory factor analysis: An illustration using IQ test performance of minorities. Educ. Meas. Issues Pract. 2010, 29, 39–47. [Google Scholar] [CrossRef]
Jöreskog, K.G. Statistical analysis of sets of congeneric tests. Psychometrika 1971, 36, 109–133. [Google Scholar] [CrossRef]
Lewis, C. Selected topics in classical test theory. In Handbook of Statistics; Rao, C.R., Sinharay, S., Eds.; Elsevier: Amsterdam, The Netherlands, 2006; Volume 26, pp. 29–43. [Google Scholar] [CrossRef]
Mellenbergh, G.J. A unidimensional latent trait model for continuous item responses. Multivariate Behav. Res. 1994, 29, 223–236. [Google Scholar] [CrossRef]
Steyer, R. Models of classical psychometric test theory as stochastic measurement models: Representation, uniqueness, meaningfulness, identifiability, and testability. Methodika 1989, 3, 25–60. Available online: https://bit.ly/3Js7N3S (accessed on 20 February 2022).
Jöreskog, K.G.; Olsson, U.H.; Wallentin, F.Y. Multivariate Analysis with LISREL; Springer: Basel, Switzerland, 2016. [Google Scholar] [CrossRef]
Kolenikov, S. Biases of parameter estimates in misspecified structural equation models. Sociol. Methodol. 2011, 41, 119–157. [Google Scholar] [CrossRef]
Savalei, V. Understanding robust corrections in structural equation modeling. Struct. Equ. Model. A Multidiscip. J. 2014, 21, 149–160. [Google Scholar] [CrossRef]
MacCallum, R.C.; Browne, M.W.; Cai, L. Factor analysis models as approximations. In Factor Analysis at 100; Cudeck, R., MacCallum, R.C., Eds.; Lawrence Erlbaum: Mahwah, NJ, USA, 2007; pp. 153–175. [Google Scholar] [CrossRef]
Siemsen, E.; Bollen, K.A. Least absolute deviation estimation in structural equation modeling. Sociol. Methods Res. 2007, 36, 227–265. [Google Scholar] [CrossRef]
Van Kesteren, E.J.; Oberski, D.L. Flexible extensions to structural equation models using computation graphs. Struct. Equ. Model. A Multidiscip. J. 2021. [Google Scholar] [CrossRef]
Robitzsch, A. L_p loss functions in invariance alignment and Haberman linking with few or many groups. Stats 2020, 3, 246–283. [Google Scholar] [CrossRef]
Yuan, K.H.; Marshall, L.L.; Bentler, P.M. Assessing the effect of model misspecifications on parameter estimates in structural equation models. Sociol. Methodol. 2003, 33, 241–265. [Google Scholar] [CrossRef]
Davies, P.L. Data Analysis and Approximate Models; CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar] [CrossRef]
Byrne, B.M.; Shavelson, R.J.; Muthén, B. Testing for the equivalence of factor covariance and mean structures: The issue of partial measurement invariance. Psychol. Bull. 1989, 105, 456–466. [Google Scholar] [CrossRef]
Davies, P.L.; Terbeck, W. Interactions and outliers in the two-way analysis of variance. Ann. Statist. 1998, 26, 1279–1305. [Google Scholar] [CrossRef]
Kolen, M.J.; Brennan, R.L. Test Equating, Scaling, and Linking; Springer: New York, NY, USA, 2014. [Google Scholar] [CrossRef]
Battauz, M. Multiple equating of separate IRT calibrations. Psychometrika 2017, 82, 610–636. [Google Scholar] [CrossRef]
Haberman, S.J. Linking Parameter Estimates Derived from An Item Response Model through Separate Calibrations; Research Report No. RR-09-40; Educational Testing Service: Princeton, NJ, USA, 2009. [Google Scholar] [CrossRef]
Asparouhov, T.; Muthén, B. Multiple-group factor analysis alignment. Struct. Equ. Model. A Multidiscip. J. 2014, 21, 495–508. [Google Scholar] [CrossRef]
Pokropek, A.; Lüdtke, O.; Robitzsch, A. An extension of the invariance alignment method for scale linking. Psychol. Test Assess. Model. 2020, 62, 303–334. Available online: https://bit.ly/2UEp9GH (accessed on 20 February 2020).
Schechter, E. Handbook of Analysis and Its Foundations; Academic Press: San Diego, CA, USA, 1996. [Google Scholar] [CrossRef]
Von Davier, M.; von Davier, A.A. A unified approach to IRT scale linking and scale transformations. Methodology 2007, 3, 115–124. [Google Scholar] [CrossRef]
Geminiani, E.; Marra, G.; Moustaki, I. Single- and multiple-group penalized factor analysis: A trust-region algorithm approach with integrated automatic multiple tuning parameter selection. Psychometrika 2021, 86, 65–95. [Google Scholar] [CrossRef]
Huang, P.H. A penalized likelihood method for multi-group structural equation modelling. Br. J. Math. Stat. Psychol. 2018, 71, 499–522. [Google Scholar] [CrossRef]
Li, X.; Jacobucci, R.; Ammerman, B.A. Tutorial on the use of the regsem package in R. Psych 2021, 3, 579–592. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Wainwright, M. Statistical Learning with Sparsity: The Lasso and Generalizations; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar] [CrossRef]
She, Y.; Owen, A.B. Outlier detection using nonconvex penalized regression. J. Am. Stat. Assoc. 2011, 106, 626–639. [Google Scholar] [CrossRef] [Green Version]
Yu, C.; Yao, W. Robust linear regression: A review and comparison. Commun. Stat. Simul. Comput. 2017, 46, 6261–6282. [Google Scholar] [CrossRef]
Battauz, M. Regularized estimation of the four-parameter logistic model. Psych 2020, 2, 269–278. [Google Scholar] [CrossRef]
Tibshirani, R.; Saunders, M.; Rosset, S.; Zhu, J.; Knight, K. Sparsity and smoothness via the fused lasso. J. R. Stat. Soc. Ser. B 2005, 67, 91–108. [Google Scholar] [CrossRef] [Green Version]
Muthén, B.; Asparouhov, T. Bayesian structural equation modeling: A more flexible representation of substantive theory. Psychol. Methods 2012, 17, 313–335. [Google Scholar] [CrossRef]
Pokropek, A.; Schmidt, P.; Davidov, E. Choosing priors in Bayesian measurement invariance modeling: A Monte Carlo simulation study. Struct. Equ. Model. 2020, 27, 750–764. [Google Scholar] [CrossRef]
Van de Schoot, R.; Kluytmans, A.; Tummers, L.; Lugtig, P.; Hox, J.; Muthén, B. Facing off with scylla and charybdis: A comparison of scalar, partial, and the novel possibility of approximate measurement invariance. Front. Psychol. 2013, 4, 770. [Google Scholar] [CrossRef] [Green Version]
Van Erp, S.; Oberski, D.L.; Mulder, J. Shrinkage priors for Bayesian penalized regression. J. Math. Psychol. 2019, 89, 31–50. [Google Scholar] [CrossRef] [Green Version]
Arts, I.; Fang, Q.; Meitinger, K.; van de Schoot, R. Approximate measurement invariance of willingness to sacrifice for the environment across 30 countries: The importance of prior distributions and their visualization. Front. Psychol. 2021, 12, 624032. [Google Scholar] [CrossRef]
De Bondt, N.; Van Petegem, P. Psychometric evaluation of the overexcitability questionnaire-two applying Bayesian structural equation modeling (BSEM) and multiple-group BSEM-based alignment with approximate measurement invariance. Front. Psychol. 2015, 6, 1963. [Google Scholar] [CrossRef] [Green Version]
Muthén, B.; Asparouhov, T. Recent methods for the study of measurement invariance with many groups: Alignment and random effects. Sociol. Methods Res. 2018, 47, 637–664. [Google Scholar] [CrossRef]
Chen, Y.; Li, C.; Xu, G. DIF statistical inference and detection without knowing anchoring items. arXiv 2021, arXiv:2110.11112. [Google Scholar]
Carroll, R.J.; Ruppert, D.; Stefanski, L.A.; Crainiceanu, C.M. Measurement Error in Nonlinear Models: A Modern Perspective; Chapman and Hall: New York, NY, USA; CRC: Boca Raton, FL, USA, 2006. [Google Scholar] [CrossRef]
Robitzsch, A. A comparison of linking methods for two groups for the two-parameter logistic item response model in the presence and absence of random differential item functioning. Foundations 2021, 1, 116–144. [Google Scholar] [CrossRef]
Lek, K.; van de Schoot, R. Bayesian approximate measurement invariance. In Invariance Analyses in Large-Scale Studies; van de Vijver, F.J.R., Ed.; OECD: Paris, France, 2019; pp. 21–35. [Google Scholar]
Yuan, K.H.; Bentler, P.M. Robust procedures in structural equation modeling. In Handbook of Latent Variable and Related Models; Lee, S.Y., Ed.; Elsevier: Amsterdam, The Netherlands, 2007; pp. 367–397. [Google Scholar] [CrossRef]
Cai, L.; Moustaki, I. Estimation methods in latent variable models for categorical outcome variables. In The Wiley Handbook of Psychometric Testing: A Multidisciplinary Reference on Survey, Scale and Test; Irwing, P., Booth, T., Hughes, D.J., Eds.; Wiley: New York, NY, USA, 2018; pp. 253–277. [Google Scholar] [CrossRef]
Hildebrandt, A.; Wilhelm, O.; Robitzsch, A. Complementary and competing factor analytic approaches for the investigation of measurement invariance. Sociol. Methods Res. 2009, 16, 87–102. [Google Scholar]
Pokropek, A.; Pokropek, E. Deep neural networks for detecting statistical model misspecifications. The case of measurement invariance. arXiv 2022, arXiv:2107.12757. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Robitzsch, A. Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance. Axioms 2022, 11, 119. https://doi.org/10.3390/axioms11030119

AMA Style

Robitzsch A. Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance. Axioms. 2022; 11(3):119. https://doi.org/10.3390/axioms11030119

Chicago/Turabian Style

Robitzsch, Alexander. 2022. "Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance" Axioms 11, no. 3: 119. https://doi.org/10.3390/axioms11030119

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation Methods of the Multiple-Group One-Dimensional Factor Model: Implied Identification Constraints in the Violation of Measurement Invariance

Abstract

1. Introduction

2. One-Dimensional Factor Model

2.1. Tau-Equivalent Model

2.2. Tau-Congeneric Model

2.3. Overview of Estimation Methods

2.4. Estimation in the Presence of Slight Model Misspecifications

3. Group Comparisons in the Tau-Equivalent Model with Noninvariant Item Intercepts

3.1. Joint Estimation

3.2. Linking

3.3. Regularization

4. Group Comparisons in the Tau-Congeneric Model with Noninvariant Item Intercepts

4.1. Joint Estimation

4.2. Linking

4.3. Regularization

5. Group Comparisons in the Tau-Congeneric Model with Noninvariant Item Intercepts and Noninvariant Item Loadings

5.1. Joint Estimation

5.2. Linking

5.3. Regularization

6. Discussion

7. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI