The importance of measurement invariance in neurocognitive ability testing

Jelte M Wicherts

doi:10.1080/13854046.2016.1205136

The importance of measurement invariance in neurocognitive ability testing

Clin Neuropsychol. 2016 Oct;30(7):1006-16. doi: 10.1080/13854046.2016.1205136. Epub 2016 Jun 30.

Author

Jelte M Wicherts¹

Affiliation

¹ a Department of Methodology and Statistics , Tilburg University , Tilburg , The Netherlands.

PMID: 27356958
DOI: 10.1080/13854046.2016.1205136

Abstract

Objective: Neurocognitive test batteries such as recent editions of the Wechsler's Adult Intelligence Scale (WAIS-III/WAIS-IV) typically use nation-level population-based norms. The question is whether these batteries function in the same manner across different subgroups based on gender, age, educational background, socioeconomic status, ethnicity, mother tongue, or race. Here, the author argues that measurement invariance is a core issue in determining whether population-based norms are valid for different subgroups.

Method: The author introduces measurement invariance, argues why it is an important topic of study, discusses why invariance might fail in cognitive ability testing, and reviews a dozen studies of invariance of commonly used neurocognitive test batteries.

Results: In over half of the reviewed studies, IQ batteries were not found to be measurement invariant across groups based on ethnicity, gender, educational background, cohort, or age. Apart from age and cohort, test manuals do not take such lack of invariance into account in computing full-scale IQ scores or normed domain scores.

Conclusions: Measurement invariance is crucial for valid use of neurocognitive tests in clinical, educational, and professional practice. The appropriateness of population-based norms to particular subgroups should depend also on whether measurement invariance holds with respect to important subgroups.

Keywords: IQ tests; Measurement invariance; differential item functioning; measurement equivalence; test fairness.

Publication types

Review

MeSH terms

Adult
Cognition*
Ethnicity / psychology
Female
Humans
Intelligence Tests / standards*
Male
Neuropsychological Tests / standards*
Reproducibility of Results
Wechsler Scales / standards