Predicting the molecular complexity of sequencing libraries

Nat Methods. 2013 Apr;10(4):325-7. doi: 10.1038/nmeth.2375. Epub 2013 Feb 24.


Predicting the molecular complexity of a genomic sequencing library is a critical but difficult problem in modern sequencing applications. Methods to determine how deeply to sequence to achieve complete coverage or to predict the benefits of additional sequencing are lacking. We introduce an empirical bayesian method to accurately characterize the molecular complexity of a DNA sample for almost any sequencing application on the basis of limited preliminary sequencing.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Bayes Theorem
  • Cloning, Molecular
  • Databases, Genetic
  • Gene Library*
  • Genomics / methods*
  • Humans
  • Models, Statistical*
  • Pan troglodytes / genetics
  • Sequence Analysis, DNA / methods*