Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibrium

Theor Popul Biol. 2003 May;63(3):245-55. doi: 10.1016/s0040-5809(03)00005-4.


As large-scale sequencing efforts turn from single genome sequencing to polymorphism discovery, single nucleotide polymorphisms (SNPs) are becoming an increasingly important class of population genetic data. But because of the ascertainment biases introduced by many methods of SNP discovery, most SNP data cannot be analyzed using classical population genetic methods. Statistical methods must instead be developed that can explicitly take into account each method of SNP discovery. Here we review some of the current methods for analyzing SNPs and derive sampling distributions for single SNPs and pairs of SNPs for some common SNP discovery schemes. We also show that the ascertainment scheme has a large effect on the estimation of linkage disequilibrium and recombination, and describe some methods of correcting for ascertainment biases when estimating recombination rates from SNP data.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bias*
  • Genetics, Population / statistics & numerical data*
  • Humans
  • Linkage Disequilibrium*
  • Polymorphism, Single Nucleotide / genetics*
  • United States