# A Comparison of Linking Methods for Two Groups for the Two-Parameter Logistic Item Response Model in the Presence and Absence of Random Differential Item Functioning

^{1}

^{2}

## Abstract

**:**

## 1. Introduction

## 2. Linking Two Groups with the 2PL Model

#### 2.1. 2PL Model

#### 2.2. Linking Design

#### 2.3. Random Differential Item Functioning

#### 2.3.1. Identified Item Parameters in Separate Calibrations in the Two Groups

#### 2.3.2. The Role of Normally Distributed Random DIF in Educational Assessment

## 3. Linking Methods

#### 3.1. Log-Mean-Mean Linking

**Proposition**

**1.**

**Proof.**

#### 3.2. Mean-Mean Linking

**Proposition**

**2.**

**Proof.**

#### 3.3. Haberman Linking (HAB and HAB-nolog)

#### 3.4. Invariance Alignment with $p=2$

#### 3.5. Haebara Linking Methods (HAE-Asymm, HAE-Symm, HAE-Joint)

#### 3.6. Recalibration Linking (RC1, RC2, and RC3)

#### 3.7. Anchored Item Parameters

#### 3.8. Concurrent Calibration

## 4. Simulation Study

#### 4.1. Purpose

#### 4.2. Design

#### 4.3. Analysis Methods

#### 4.4. Results

## 5. Empirical Example: Linking PISA 2006 and PISA 2009 for Austria

#### 5.1. Method

#### 5.2. Results

## 6. Discussion

## Funding

## Data Availability Statement

## Conflicts of Interest

## Abbreviations

1PL | one-parameter logistic model |

2PL | two-parameter logistic model |

ANCH | anchored item parameters |

CC | concurrent calibration |

DIF | differential item functioning |

HAB | Haberman linking with logarithmized item discriminations |

HAB-nolog | Haberman linking with untransformed item discriminations |

HAE | Haebara linking |

HAE-asymm | asymmetric Haebara linking |

HAE-joint | Haebara linking with joint item parameters |

HAE-symm | symmetric Haebara linking |

IA2 | invariance alignment with power $p=2$ |

IRF | item response function |

IRT | item response theory |

logMM | log-mean-mean linking |

LSA | large-scale assessment |

MM | mean-mean linking |

MML | marginal maximum likelihood |

MSE | mean-squared error |

NUDIF | nonuniform differential item functioning |

PIRLS | Progress in International Reading Literacy Study |

PISA | Programme for International Student Assessment |

RC | recalibration linking |

RMSE | root-mean-squared error |

SD | standard deviation |

TIMSS | Trends in International Mathematics and Science Study |

UDIF | uniform differential item functioning |

## Appendix A. Nonidentifiability of DIF Effects Distributions

#### Appendix A.1. DIF Effects for Item Difficulties

#### Appendix A.2. DIF Effects for Item Discriminations

## Appendix B. Proof of Proposition 1

#### Appendix B.1. Consistency of Additive DIF Effects f_{i} with Condition (I)

#### Appendix B.2. Consistency for Multiplicative DIF Effects f_{i} with Condition (II)

## Appendix C. Proof of Proposition 2

#### Appendix C.1. Consistency for Additive DIF Effects f_{i} with Condition (I)

#### Appendix C.2. Consistency for Multiplicative DIF Effects f_{i} with Condition (IO)

## Appendix D. Estimates in Haberman Linking

## Appendix E. Estimates in Invariance Alignment

## Appendix F. Item Parameters Used in the Simulation Study

Item | ${\mathit{a}}_{\mathit{i}}$ | ${\mathit{b}}_{\mathit{i}}$ |
---|---|---|

1 | 0.95 | −0.97 |

2 | 0.88 | $\phantom{-}$0.59 |

3 | 0.75 | $\phantom{-}$0.75 |

4 | 1.29 | −0.79 |

5 | 1.28 | $\phantom{-}$1.23 |

6 | 1.29 | −1.10 |

7 | 1.25 | −0.67 |

8 | 0.97 | $\phantom{-}$0.20 |

9 | 0.73 | $\phantom{-}$1.26 |

10 | 1.27 | $\phantom{-}$0.05 |

11 | 1.42 | $\phantom{-}$1.22 |

12 | 0.75 | −0.01 |

13 | 0.50 | $\phantom{-}$0.20 |

14 | 0.81 | $\phantom{-}$1.39 |

15 | 1.12 | $\phantom{-}$0.61 |

16 | 0.78 | −1.00 |

17 | 1.30 | −1.58 |

18 | 0.70 | −1.62 |

19 | 1.29 | $\phantom{-}$1.06 |

20 | 0.74 | −0.81 |

**Figure 1.**Linking design for two groups with common items ${I}_{0}$ and group-specific unique items ${I}_{1}$ and ${I}_{2}$.

**Table 1.**Variance proportions of different factors in the simulation study for the bias and RMSE for the estimated mean ${\widehat{\mu}}_{2}$ and estimated SD ${\widehat{\sigma}}_{2}$ for the second group.

Source | ${\widehat{\mathit{\mu}}}_{2}$ | ${\widehat{\mathit{\sigma}}}_{2}$ | ||
---|---|---|---|---|

Bias | RMSE | Bias | RMSE | |

N | 0.3 | 1.1 | 0.6 | 3.9 |

I | 0.0 | 0.3 | 0.0 | 0.0 |

Meth | 10.2$\phantom{1}$ | 14.9$\phantom{1}$ | 19.1$\phantom{1}$ | 0.0 |

${\tau}_{b}$ | 13.0$\phantom{1}$ | 0.0 | 0.8 | 1.8 |

${\tau}_{a}$ | 4.3 | 9.0 | 12.3$\phantom{1}$ | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$I | 0.0 | 0.0 | 0.0 | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth | 0.0 | 3.7 | 0.8 | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.0 | 2.4 | 0.0 | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 0.0 | 0.6 | 0.0 | 5.0 |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth | 0.4 | 0.1 | 0.1 | 0.0 |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.0 | 0.1 | 0.0 | 0.0 |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 0.0 | 0.0 | 0.0 | 0.6 |

Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 58.1$\phantom{1}$ | 13.1$\phantom{1}$ | 17.5$\phantom{1}$ | 14.2$\phantom{1}$ |

Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 8.2 | 12.1$\phantom{1}$ | 47.7$\phantom{1}$ | 13.2$\phantom{1}$ |

${\tau}_{a}$$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.0 | 4.1 | 0.0 | 17.7$\phantom{1}$ |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth | 0.0 | 0.0 | 0.1 | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.0 | 0.2 | 0.0 | 0.0 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 0.0 | 0.1 | 0.0 | 0.4 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.2 | 7.5 | 0.0 | 4.2 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 0.0 | 4.0 | 0.0 | 9.1 |

N$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.1 | 10.0$\phantom{1}$ | 0.0 | 13.8$\phantom{1}$ |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.5 | 0.0 | 0.0 | 0.4 |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$ | 0.1 | 0.0 | 0.2 | 1.1 |

I$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 0.1 | 0.3 | 0.0 | 0.7 |

Meth$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{a}$$\phantom{\rule{0.166667em}{0ex}}\times \phantom{\rule{0.166667em}{0ex}}$${\tau}_{b}$ | 1.0 | 10.1$\phantom{1}$ | 0.1 | 8.2 |

Residual | 3.7 | 6.4 | 0.6 | 5.7 |

**Table 2.**Summary of the satisfactory performance of linking methods for the absolute bias and RMSE across parameters (mean ${\widehat{\mu}}_{2}$ and standard deviation ${\widehat{\sigma}}_{2}$) and conditions.

Bias | RMSE | |||||
---|---|---|---|---|---|---|

NODIF | UDIF | NUDIF | NODIF | UDIF | NUDIF | |

logMM | 100 | $\phantom{1}$97 | 94 | 100 | 100 | 45 |

HAB | 100 | $\phantom{1}$97 | 94 | 100 | 100 | 44 |

MM | 100 | $\phantom{1}$94 | 95 | $\phantom{1}$92 | 100 | 72 |

HAB-nolog | 100 | $\phantom{1}$94 | 96 | 100 | 100 | 78 |

IA2 | $\phantom{1}$75 | $\phantom{1}$78 | $\phantom{1}$8 | 100 | 100 | $\phantom{1}$4 |

HAE-asymm | 100 | $\phantom{1}$42 | 42 | 100 | $\phantom{1}$61 | 78 |

HAE-symm | 100 | $\phantom{1}$97 | 94 | 100 | $\phantom{1}$61 | 81 |

HAE-joint | 100 | $\phantom{1}$42 | 60 | 100 | $\phantom{1}$42 | 61 |

RC1 | $\phantom{1}$83 | $\phantom{1}$78 | 16 | 100 | $\phantom{1}$61 | 29 |

RC2 | $\phantom{1}$83 | $\phantom{1}$78 | $\phantom{1}$8 | 100 | $\phantom{1}$61 | 48 |

RC3 | 100 | $\phantom{1}$94 | 96 | 100 | $\phantom{1}$61 | 79 |

ANCH | $\phantom{1}$83 | $\phantom{1}$78 | 13 | 100 | $\phantom{1}$61 | 48 |

CC | 100 | $\phantom{1}$50 | 45 | 100 | $\phantom{1}$33 | 46 |

**Table 3.**Bias and RMSE for mean ${\widehat{\mu}}_{2}$ and standard deviation ${\widehat{\sigma}}_{2}$ for the second group for a sample size $N=1000$ and $I=40$ items as a function of the type of differential item functioning and linking method.

Bias | RMSE | |||||
---|---|---|---|---|---|---|

NODIF | UDIF | NUDIF | NODIF | UDIF | NUDIF | |

${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0.5}$ | ${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0.5}$ | ${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0.5}$ | ${\mathbf{\tau}}_{\mathit{b}}=\mathbf{0.5}$ | |

${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0.25}$ | ${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0}$ | ${\mathbf{\tau}}_{\mathit{a}}=\mathbf{0.25}$ | |

Mean${\widehat{\mu}}_{2}$ | ||||||

logMM | $\phantom{-}$0.000 | $\phantom{-}$0.007 | $\phantom{-}$0.008 | 108.2 | 104.4 | 106.1 |

HAB | $\phantom{-}$0.000 | $\phantom{-}$0.007 | $\phantom{-}$0.008 | 108.2 | 104.4 | 106.1 |

MM | $\phantom{-}$0.000 | $\phantom{-}$0.007 | $\phantom{-}$0.007 | 108.1 | 103.7 | 104.7 |

HAB-nolog | $\phantom{-}$0.001 | $\phantom{-}$0.007 | $\phantom{-}$0.007 | 108.5 | 103.5 | 104.5 |

IA2 | −0.001 | $\phantom{-}$0.001 | $\phantom{-}$0.045 | 103.2 | 107.5 | 133.3 |

HAE-asymm | −0.002 | −0.030 | −0.032 | 102.3 | 100.0 | 100.0 |

HAE-symm | −0.001 | $\phantom{-}$0.002 | $\phantom{-}$0.005 | 102.7 | 105.0 | 105.2 |

HAE-joint | −0.002 | $\phantom{-}$0.067 | $\phantom{-}$0.064 | 100.9 | 136.1 | 132.4 |

RC1 | −0.001 | $\phantom{-}$0.001 | $\phantom{-}$0.028 | 100.2 | 104.8 | 120.5 |

RC2 | −0.006 | −0.004 | −0.022 | 100.0 | 104.0 | 100.1 |

RC3 | −0.003 | −0.001 | $\phantom{-}$0.002 | 100.1 | 103.9 | 109.4 |

ANCH | −0.003 | −0.004 | −0.021 | 101.4 | 104.2 | 103.9 |

CC | −0.002 | $\phantom{-}$0.095 | $\phantom{-}$0.109 | 101.3 | 149.2 | 157.7 |

Standard Deviation${\widehat{\sigma}}_{2}$ | ||||||

logMM | $\phantom{-}$0.000 | $\phantom{-}$0.003 | $\phantom{-}$0.008 | 110.2 | 112.6 | 128.9 |

HAB | $\phantom{-}$0.000 | $\phantom{-}$0.003 | $\phantom{-}$0.008 | 110.2 | 112.6 | 129.4 |

MM | −0.001 | $\phantom{-}$0.001 | $\phantom{-}$0.005 | 108.5 | 109.4 | 107.7 |

HAB-nolog | $\phantom{-}$0.001 | $\phantom{-}$0.002 | $\phantom{-}$0.007 | 100.0 | 100.0 | 100.0 |

IA2 | $\phantom{-}$0.009 | $\phantom{-}$0.009 | $\phantom{-}$0.147 | 113.2 | 111.6 | 197.9 |

HAE-asymm | −0.002 | −0.120 | −0.134 | 107.2 | 378.8 | 185.6 |

HAE-symm | $\phantom{-}$0.001 | −0.003 | $\phantom{-}$0.003 | 108.3 | 233.7 | 119.9 |

HAE-joint | −0.001 | $\phantom{-}$0.020 | $\phantom{-}$0.029 | 107.5 | 317.0 | 146.6 |

RC1 | $\phantom{-}$0.006 | $\phantom{-}$0.008 | $\phantom{-}$0.105 | 109.8 | 243.8 | 174.5 |

RC2 | −0.009 | −0.008 | −0.097 | 108.5 | 217.2 | 148.3 |

RC3 | −0.002 | $\phantom{-}$0.000 | $\phantom{-}$0.002 | 106.6 | 228.3 | 110.2 |

ANCH | −0.009 | −0.008 | −0.097 | 108.5 | 217.2 | 148.3 |

CC | −0.001 | $\phantom{-}$0.015 | $\phantom{-}$0.029 | 107.4 | 220.4 | 129.0 |

Domain | N | I | $\mathbf{M}$ | $\mathbf{SD}$ | ||||
---|---|---|---|---|---|---|---|---|

P06 | P09 | P06 | P09 | P06 | P09 | P06 | P09 | |

Mathematics | 3784 | 4575 | $\phantom{1}$48 | 35 | 506.8 | 495.9 | $\phantom{1}$96.8 | $\phantom{1}$96.1 |

Reading | 2646 | 6585 | $\phantom{1}$27 | 99 | 491.2 | 470.3 | 107.7 | 100.1 |

Science | 4927 | 4577 | 103 | 53 | 511.7 | 494.3 | $\phantom{1}$97.3 | 101.8 |

Method | Mathematics | Reading | Science | |||
---|---|---|---|---|---|---|

1PL | 2PL | 1PL | 2PL | 1PL | 2PL | |

logMM | −15.5 | −12.4 | −5.8 | −6.3 | −14.7 | −16.8 |

HAB | −15.5 | −12.4 | −5.8 | −6.3 | −14.7 | −16.8 |

MM | −15.5 | −12.4 | −5.8 | −6.3 | −14.7 | −16.7 |

HAB-nolog | −15.5 | −12.3 | −6.0 | −6.3 | −14.5 | −16.6 |

IA2 | −15.5 | −15.9 | −5.8 | −6.1 | −14.7 | −11.6 |

HAE-asymm | −14.4 | −14.6 | −4.9 | −6.4 | −14.2 | −15.9 |

HAE-symm | −14.6 | −15.0 | −5.0 | −6.6 | −14.2 | −15.7 |

HAE-joint | −13.5 | −14.1 | −4.1 | −5.0 | −13.9 | −14.0 |

RC1 | −14.3 | −14.5 | −4.4 | −5.1 | −14.0 | −13.2 |

RC2 | −14.3 | −14.3 | −4.3 | −5.0 | −14.2 | −12.9 |

RC3 | −14.3 | −14.4 | −4.4 | −5.0 | −14.1 | −13.1 |

ANCH | −14.4 | −15.7 | −4.5 | −5.4 | −14.5 | −14.1 |

CC | −14.3 | −14.9 | −4.3 | −5.3 | −14.2 | −13.6 |

M | −14.8 | −14.1 | −5.0 | −5.8 | −14.3 | −14.7 |

SD | $\phantom{-}$$\phantom{1}$0.7 | $\phantom{-}$$\phantom{1}$1.3 | $\phantom{-}$0.7 | $\phantom{-}$0.6 | $\phantom{-}$$\phantom{1}$0.3 | $\phantom{-}$$\phantom{1}$1.8 |

Min | −15.5 | −15.9 | −6.0 | −6.6 | −14.7 | −16.8 |

Max | −13.5 | −12.3 | −4.1 | −5.0 | −13.9 | −11.6 |

**Table 6.**Standard deviation for Austrian students in PISA 2009. for domains Mathematics, Reading and Science for the 1PL and the 2PL model as a function of the linking method.

Method | Mathematics | Reading | Science | |||
---|---|---|---|---|---|---|

1PL | 2PL | 1PL | 2PL | 1PL | 2PL | |

logMM | 97.7 | 98.3 | $\phantom{1}$98.6 | 103.2 | 103.2 | 106.8 |

HAB | 97.7 | 98.3 | $\phantom{1}$98.6 | 103.2 | 103.2 | 106.8 |

MM | 97.7 | 98.7 | $\phantom{1}$98.6 | 103.8 | 103.2 | 106.9 |

HAB-nolog | 97.9 | 99.3 | $\phantom{1}$94.6 | 102.0 | 103.9 | 108.1 |

IA2 | 97.7 | 99.5 | $\phantom{1}$98.6 | 104.6 | 103.2 | 109.2 |

HAE-asymm | 94.1 | 95.0 | 102.6 | 105.4 | 105.0 | 107.5 |

HAE-symm | 95.0 | 96.2 | 103.1 | 105.9 | 105.3 | 107.8 |

HAE-joint | 95.0 | 95.7 | 105.1 | 107.5 | 104.7 | 107.4 |

RC1 | 96.0 | 96.9 | 103.1 | 107.2 | 103.9 | 108.6 |

RC2 | 96.0 | 95.6 | $\phantom{1}$99.9 | 106.2 | 104.7 | 105.9 |

RC3 | 96.0 | 96.3 | 101.5 | 106.7 | 104.3 | 107.2 |

ANCH | 96.0 | 95.6 | $\phantom{1}$99.9 | 106.2 | 104.7 | 105.9 |

CC | 95.9 | 96.7 | 101.3 | 106.4 | 104.1 | 107.5 |

M | 96.3 | 97.1 | 100.4 | 105.2 | 104.1 | 107.4 |

SD | $\phantom{1}$1.2 | $\phantom{1}$1.5 | $\phantom{1}$$\phantom{1}$2.7 | $\phantom{1}$$\phantom{1}$1.7 | $\phantom{1}$$\phantom{1}$0.7 | $\phantom{1}$$\phantom{1}$0.9 |

Min | 94.1 | 95.0 | $\phantom{1}$94.6 | 102.0 | 103.2 | 105.9 |

Max | 97.9 | 99.5 | 105.1 | 107.5 | 105.3 | 109.2 |

