Sample size and power calculations with correlated binary data

Control Clin Trials. 2001 Jun;22(3):211-27. doi: 10.1016/s0197-2456(01)00131-3.


Correlated binary data are common in biomedical studies. Such data can be analyzed using Liang and Zeger's generalized estimating equations (GEE) approach. An attractive point of the GEE approach is that one can use a misspecified working correlation matrix, such as the working independence model (i.e., the identity matrix), and draw (asymptotically) valid statistical inference by using the so-called robust or sandwich variance estimator. In this article we derive some explicit formulas for sample size and power calculations under various common situations. The given formulas are based on using the robust variance estimator in GEE. We believe that these formulas will facilitate the practice in planning two-arm clinical trials with correlated binary outcome data.

MeSH terms

  • Clinical Trials as Topic*
  • Humans
  • Logistic Models*
  • Sample Size