A Bayesian nonparametric approach for handling item and examinee heterogeneity in assessment data

Br J Math Stat Psychol. 2024 Feb;77(1):196-211. doi: 10.1111/bmsp.12322. Epub 2023 Sep 20.

Abstract

We propose a novel nonparametric Bayesian item response theory model that estimates clusters at the question level, while simultaneously allowing for heterogeneity at the examinee level under each question cluster, characterized by a mixture of binomial distributions. The main contribution of this work is threefold. First, we present our new model and demonstrate that it is identifiable under a set of conditions. Second, we show that our model can correctly identify question-level clusters asymptotically, and the parameters of interest that measure the proficiency of examinees in solving certain questions can be estimated at a n rate (up to a log term). Third, we present a tractable sampling algorithm to obtain valid posterior samples from our proposed model. Compared to the existing methods, our model manages to reveal the multi-dimensionality of the examinees' proficiency level in handling different types of questions parsimoniously by imposing a nested clustering structure. The proposed model is evaluated via a series of simulations as well as apply it to an English proficiency assessment data set. This data analysis example nicely illustrates how our model can be used by test makers to distinguish different types of students and aid in the design of future tests.

Keywords: IRT model; Rasch model; model averaging; nonparametric Bayesian method; posterior contraction rate.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Cluster Analysis
  • Humans
  • Students*