Allowing for imprecision of the intracluster correlation coefficient in the design of cluster randomized trials

Stat Med. 2004 Apr 30;23(8):1195-214. doi: 10.1002/sim.1721.

Abstract

The sample size required for a cluster randomized trial depends on the magnitude of the intracluster correlation coefficient (ICC). The usual sample size calculation makes no allowance for the fact that the ICC is not known precisely in advance. We develop methods which allow for the uncertainty in a previously observed ICC, using a variety of distributional assumptions. Distributions for the power are derived, reflecting this uncertainty. Further, the observed ICC in a future study will not equal its true value, and we consider the impact of this on power. We implement calculations within a Bayesian simulation approach, and provide one simplification that can be performed using simple simulation within spreadsheet software. In our examples, recognizing the uncertainty in a previous ICC estimate decreases expected power, especially when the power calculated naively from the ICC estimate is high. To protect against the possibility of low power, sample sizes may need to be very substantially increased. Recognizing the variability in the future observed ICC has little effect if prior uncertainty has already been taken into account. We show how our method can be extended to the case in which multiple prior ICC estimates are available. The methods presented in this paper can be used by applied researchers to protect against loss of power, or to choose a design which reduces the impact of uncertainty in the ICC.

Publication types

  • Comparative Study

MeSH terms

  • Bayes Theorem
  • Cluster Analysis*
  • Data Interpretation, Statistical
  • Humans
  • Models, Statistical
  • Randomized Controlled Trials as Topic / methods*
  • Randomized Controlled Trials as Topic / statistics & numerical data
  • Sample Size*
  • Software
  • Uncertainty