Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2019 Sep;84(3):749-771.
doi: 10.1007/s11336-018-9644-7. Epub 2018 Dec 3.

Variable-Length Stopping Rules for Multidimensional Computerized Adaptive Testing

Affiliations
Comparative Study

Variable-Length Stopping Rules for Multidimensional Computerized Adaptive Testing

Chun Wang et al. Psychometrika. 2019 Sep.

Abstract

In computerized adaptive testing (CAT), a variable-length stopping rule refers to ending item administration after a pre-specified measurement precision standard has been satisfied. The goal is to provide equal measurement precision for all examinees regardless of their true latent trait level. Several stopping rules have been proposed in unidimensional CAT, such as the minimum information rule or the maximum standard error rule. These rules have also been extended to multidimensional CAT and cognitive diagnostic CAT, and they all share the same idea of monitoring measurement error. Recently, Babcock and Weiss (J Comput Adapt Test 2012. https://doi.org/10.7333/1212-0101001) proposed an "absolute change in theta" (CT) rule, which is useful when an item bank is exhaustive of good items for one or more ranges of the trait continuum. Choi, Grady and Dodd (Educ Psychol Meas 70:1-17, 2010) also argued that a CAT should stop when the standard error does not change, implying that the item bank is likely exhausted. Although these stopping rules have been evaluated and compared in different simulation studies, the relationships among the various rules remain unclear, and therefore there lacks a clear guideline regarding when to use which rule. This paper presents analytic results to show the connections among various stopping rules within both unidimensional and multidimensional CAT. In particular, it is argued that the CT-rule alone can be unstable and it can end the test prematurely. However, the CT-rule can be a useful secondary rule to monitor the point of diminished returns. To further provide empirical evidence, three simulation studies are reported using both the 2PL model and the multidimensional graded response model.

Keywords: computerized adaptive testing; information; multidimensional models; standard error; stopping rules; variable-length adaptive testing.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Multivariate Behav Res. 2018 May-Jun;53(3):403-418 - PubMed
    1. BMC Psychiatry. 2004 May 06;4:13 - PubMed
    1. Psychometrika. 2015 Jun;80(2):428-49 - PubMed
    1. J Clin Epidemiol. 2006 Mar;59(3):290-8 - PubMed
    1. Psychometrika. 2009 Jun;74(2):273-296 - PubMed

Publication types

LinkOut - more resources