Evolution and functional impact of rare coding variation from deep sequencing of human exomes

Science. 2012 Jul 6;337(6090):64-9. doi: 10.1126/science.1219240. Epub 2012 May 17.

Abstract

As a first step toward understanding how rare variants contribute to risk for complex diseases, we sequenced 15,585 human protein-coding genes to an average median depth of 111× in 2440 individuals of European (n = 1351) and African (n = 1088) ancestry. We identified over 500,000 single-nucleotide variants (SNVs), the majority of which were rare (86% with a minor allele frequency less than 0.5%), previously unknown (82%), and population-specific (82%). On average, 2.3% of the 13,595 SNVs each person carried were predicted to affect protein function of ~313 genes per genome, and ~95.7% of SNVs predicted to be functionally important were rare. This excess of rare functional variants is due to the combined effects of explosive, recent accelerated population growth and weak purifying selection. Furthermore, we show that large sample sizes will be required to associate rare variants with complex traits.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • African Americans / genetics*
  • Disease / genetics
  • European Continental Ancestry Group / genetics*
  • Evolution, Molecular*
  • Exome*
  • Female
  • Gene Frequency
  • Genetic Association Studies
  • Genetic Predisposition to Disease
  • Genetic Variation*
  • Genome, Human*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Male
  • Polymorphism, Single Nucleotide*
  • Population Growth
  • Selection, Genetic