Testing for rare variant associations in the presence of missing data

Genet Epidemiol. 2013 Sep;37(6):529-38. doi: 10.1002/gepi.21736. Epub 2013 Jun 11.

Abstract

For studies of genetically complex diseases, many association methods have been developed to analyze rare variants. When variant calls are missing, naïve implementation of rare variant association (RVA) methods may lead to inflated type I error rates as well as a reduction in power. To overcome these problems, we developed extensions for four commonly used RVA tests. Data from the National Heart Lung and Blood Institute-Exome Sequencing Project were used to demonstrate that missing variant calls can lead to increased false-positive rates and that the extended RVA methods control type I error without reducing power. We suggest a combined strategy of data filtering based on variant and sample level missing genotypes along with implementation of these extended RVA tests.

Keywords: complex disease; next-generation sequencing; rare variant association studies.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Anaplastic Lymphoma Kinase
  • Computer Simulation
  • Exome
  • Genetic Association Studies
  • Genetic Variation*
  • Genotype
  • Humans
  • Models, Genetic*
  • Receptor Protein-Tyrosine Kinases / genetics
  • Receptor, Melanocortin, Type 4 / genetics

Substances

  • MC4R protein, human
  • Receptor, Melanocortin, Type 4
  • Anaplastic Lymphoma Kinase
  • Receptor Protein-Tyrosine Kinases