Leveraging ancestry to improve causal variant identification in exome sequencing for monogenic disorders

Eur J Hum Genet. 2016 Jan;24(1):113-9. doi: 10.1038/ejhg.2015.68. Epub 2015 Apr 22.

Abstract

Recent breakthroughs in exome-sequencing technology have made possible the identification of many causal variants of monogenic disorders. Although extremely powerful when closely related individuals (eg, child and parents) are simultaneously sequenced, sequencing of a single case is often unsuccessful due to the large number of variants that need to be followed up for functional validation. Many approaches filter out common variants above a given frequency threshold (eg, 1%), and then prioritize the remaining variants according to their functional, structural and conservation properties. Here we present methods that leverage the genetic structure across different populations to improve filtering performance while accounting for the finite sample size of the reference panels. We show that leveraging genetic structure reduces the number of variants that need to be followed up by 16% in simulations and by up to 38% in empirical data of 20 exomes from individuals with monogenic disorders for which the causal variants are known.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Computer Simulation
  • Exome*
  • Female
  • Genetic Diseases, Inborn / diagnosis
  • Genetic Diseases, Inborn / ethnology
  • Genetic Diseases, Inborn / genetics*
  • Genetic Variation
  • Genome, Human
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Inheritance Patterns
  • Male
  • Models, Statistical*
  • Pedigree
  • Polymorphism, Single Nucleotide*
  • Racial Groups
  • Sequence Analysis, DNA