What sample sizes for reliability and validity studies in neurology?

J Neurol. 2012 Dec;259(12):2681-94. doi: 10.1007/s00415-012-6570-y. Epub 2012 Jun 24.

Abstract

Rating scales are increasingly used in neurologic research and trials. A key question relating to their use across the range of neurologic diseases, both common and rare, is what sample sizes provide meaningful estimates of reliability and validity. Here, we address two questions: (1) to what extent does sample size influence the stability of reliability and validity estimates; and (2) to what extent does sample size influence the inferences made from reliability and validity testing? We examined data from two studies. In Study 1, we retrospectively reduced the total sample randomly and nonrandomly by decrements of approximately 50 % to generate sub-samples from n = 713-20. In Study 2, we prospectively generated sub-samples from n = 20-320, by entry time into study. In all samples we estimated reliability (internal consistency, item total correlations, test-retest) and validity (within scale correlations, convergent and discriminant construct validity). Reliability estimates were stable in magnitude and interpretation in all sub-samples of both studies. Validity estimates were stable in samples of n ≥ 80, for 75 % of scales in samples of n = 40, and for 50 % of scales in samples of n = 20. In this study, sample sizes of a minimum of 20 for reliability and 80 for validity provided estimates highly representative of the main study samples. These findings should be considered provisional and more work is needed to determine if these estimates are generalisable, consistent, and useful.

Publication types

  • Randomized Controlled Trial

MeSH terms

  • Adult
  • Aged
  • Female
  • Humans
  • Male
  • Middle Aged
  • Multiple Sclerosis / diagnosis
  • Multiple Sclerosis / epidemiology*
  • Neurology / methods
  • Neurology / standards*
  • Prospective Studies
  • Reproducibility of Results
  • Retrospective Studies
  • Sample Size*