Targeted capture and massively parallel sequencing of 12 human exomes

Nature. 2009 Sep 10;461(7261):272-6. doi: 10.1038/nature08250. Epub 2009 Aug 16.

Abstract

Genome-wide association studies suggest that common genetic variants explain only a modest fraction of heritable risk for common diseases, raising the question of whether rare variants account for a significant fraction of unexplained heritability. Although DNA sequencing costs have fallen markedly, they remain far from what is necessary for rare and novel variants to be routinely identified at a genome-wide scale in large cohorts. We have therefore sought to develop second-generation methods for targeted sequencing of all protein-coding regions ('exomes'), to reduce costs while enriching for discovery of highly penetrant variants. Here we report on the targeted capture and massively parallel sequencing of the exomes of 12 humans. These include eight HapMap individuals representing three populations, and four unrelated individuals with a rare dominantly inherited disorder, Freeman-Sheldon syndrome (FSS). We demonstrate the sensitive and specific identification of rare and common variants in over 300 megabases of coding sequence. Using FSS as a proof-of-concept, we show that candidate genes for Mendelian disorders can be identified by exome sequencing of a small number of unrelated, affected individuals. This strategy may be extendable to diseases with more complex genetics through larger sample sizes and appropriate weighting of non-synonymous variants by predicted functional impact.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Exons / genetics*
  • Gene Frequency / genetics
  • Gene Library
  • Genes, Dominant / genetics
  • Genetic Predisposition to Disease / genetics*
  • Genetic Testing / methods*
  • Genetic Variation / genetics*
  • Genome, Human / genetics*
  • Haplotypes / genetics
  • Humans
  • INDEL Mutation / genetics
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide / genetics
  • RNA Splice Sites / genetics
  • Sample Size
  • Sensitivity and Specificity
  • Sequence Analysis, DNA / methods*
  • Syndrome

Substances

  • RNA Splice Sites