Including known covariates can reduce power to detect genetic effects in case-control studies

Nat Genet. 2012 Jul 22;44(8):848-51. doi: 10.1038/ng.2346.


Genome-wide association studies (GWAS) search for associations between genetic variants and disease status, typically via logistic regression. Often there are covariates, such as sex or well-established major genetic factors, that are known to affect disease susceptibility and are independent of tested genotypes at the population level. We show theoretically and with data from recent GWAS on multiple sclerosis, psoriasis and ankylosing spondylitis that inclusion of known covariates can substantially reduce power for the identification of associated variants when the disease prevalence is lower than a few percent. Whether the inclusion of such covariates reduces or increases power to detect genetic effects depends on various factors, including the prevalence of the disease studied. When the disease is common (prevalence of >20%), the inclusion of covariates typically increases power, whereas, for rarer diseases, it can often decrease power to detect new genetic associations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Case-Control Studies
  • Genome-Wide Association Study / statistics & numerical data*
  • Humans
  • Logistic Models
  • Models, Statistical
  • Multiple Sclerosis / genetics
  • Multivariate Analysis
  • Psoriasis / genetics
  • Sample Size
  • Spondylitis, Ankylosing / genetics