# A Comparative Study of Item Response Theory Models for Mixed Discrete-Continuous Responses

## Abstract

## 1. Introduction

## 2. Materials and Methods

#### 2.1. Data

#### 2.2. Zero-and-One-Inflated Item Response Models for Bounded Continuous Data

_{B}distribution, the Continuous Response Model is a special case within Samejima’s Graded Response Model framework. The Simplex IRT model utilizes the simplex distribution, which, while less common, offers an alternative modeling approach to bounded continuous data. This model is beneficial in contexts such as response time analysis, where the data is naturally bounded within a specific range. In this section, we provide a general introduction to the overall model structure for all these models. All three models operate under the same structure, and the only difference is the model-specific density function utilized when modeling the continuous part of the distribution.

#### 2.3. Incorporating Collateral Information

#### 2.4. Model Fitting in Stan

#### 2.5. Disclosure of the Use of AI or AI-Assisted Technologies

## 3. Results

#### 3.1. Model Comparison and Prediction Error

#### 3.2. Model Fit

#### 3.3. Parameter Estimates

## 4. Discussion

## Author Contributions

## Funding

## Institutional Review Board Statement

## Informed Consent Statement

## Data Availability Statement

## Acknowledgments

## Conflicts of Interest

## Appendix A. Model-Specific Probability Density Functions

**SB_IRT Model**

**Beta IRT Model**

**Simplex IRT Model**

**Figure 1.**Comparison of model-generated response distributions for the Beta, SB, and Simplex IRT models. Latent proficiency is assumed to follow a standard normal distribution. All item parameters except the dispersion parameter were the same across models.

**Figure 2.**Comparison of the sum of the squared error of predictions across six folds for the Beta, SB, and Simplex IRT models with and without latent regression. The horizontal line for each fold represents the baseline prediction error when an average response is used. A smaller sum of squared error indicates better performance.

**Figure 3.**Density plots of observed sum score distribution (dashed line) and distributions of sum scores from 3000 posterior samples (gray area) for each model.

**Figure 4.**Comparison of average item scores from observed data and posterior predictive distributions of model-generated data.

**Figure 5.**Comparison of standard deviations of item scores from observed data and posterior predictive distributions of model-generated data.

**Figure 6.**The relationships among the item parameter estimates obtained from Beta, SB, and Simplex IRT models.

**Figure 7.**The relationships among the person parameter estimates obtained from Beta, SB, and Simplex IRT models.

**Table 1.**Descriptive statistics for the sum scores from observed data and the average of posterior distribution of sum scores.

Mean | SD | Skewness | Kurtosis | |
---|---|---|---|---|

Beta IRT | 5.56 | 0.36 | −1.96 | 7.76 |

SB IRT | 5.56 | 0.36 | −1.87 | 6.72 |

Simplex IRT | 5.56 | 0.37 | −1.93 | 6.90 |

Observed Data | 5.56 | 0.38 | −2.90 | 22.63 |

**Table 2.**Descriptive statistics for item and person parameters estimated from the Beta, SB, and Simplex IRT models with latent regression.

Beta IRT Model with Latent Regression | SB IRT Model with Latent Regression | Simplex IRT Model with Latent Regression | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

Parameters | Mean | SD | Min | Max | Mean | SD | Min | Max | Mean | SD | Min | Max |

$\theta $ | 0.00 | 0.86 | −11.86 | 3.05 | 0.00 | 0.85 | −6.99 | 3.17 | 0.00 | 0.85 | −10.17 | 3.20 |

$\beta $ | 2.18 | 0.53 | −0.09 | 5.15 | 2.41 | 0.54 | −0.16 | 5.27 | 2.20 | 0.51 | 0.12 | 4.52 |

$\alpha $ | 0.61 | 0.23 | 0.02 | 1.69 | 0.62 | 0.25 | 0.03 | 1.72 | 0.59 | 0.22 | 0.00 | 1.77 |

${\gamma}_{0}$ | −11.62 | 1.97 | −14.51 | −4.77 | −11.65 | 1.95 | −15.05 | −4.81 | −11.64 | 1.96 | −14.56 | −4.77 |

${\gamma}_{1}$ | 0.10 | 1.27 | −4.47 | 4.62 | 0.09 | 1.28 | −4.64 | 4.35 | 0.08 | 1.25 | −4.93 | 4.38 |

δ * | 3.53 | 0.97 | −0.05 | 13.21 | 0.66 | 0.28 | 0.04 | 3.30 | 8.41 | 5.16 | 0.26 | 36.45 |

$\mathbf{Writing}({\mathit{\xi}}_{1})$ | $\mathbf{Speaking}({\mathit{\xi}}_{2})$ | |||
---|---|---|---|---|

Posterior Mean | 95% Credible Interval | Posterior Mean | 95% Credible Interval | |

Beta IRT | 0.351 | (0.346, 0.354) | 0.345 | (0.340, 0.349) |

SB IRT | 0.347 | (0.342, 0.351) | 0.352 | (0.347, 0.356) |

Simplex IRT | 0.330 | (0.325, 0.334) | 0.346 | (0.342, 0.351) |

