Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data

Genetics. 2003 Dec;165(4):2213-33. doi: 10.1093/genetics/165.4.2213.

Abstract

We introduce a new statistical model for patterns of linkage disequilibrium (LD) among multiple SNPs in a population sample. The model overcomes limitations of existing approaches to understanding, summarizing, and interpreting LD by (i) relating patterns of LD directly to the underlying recombination process; (ii) considering all loci simultaneously, rather than pairwise; (iii) avoiding the assumption that LD necessarily has a "block-like" structure; and (iv) being computationally tractable for huge genomic regions (up to complete chromosomes). We examine in detail one natural application of the model: estimation of underlying recombination rates from population data. Using simulation, we show that in the case where recombination is assumed constant across the region of interest, recombination rate estimates based on our model are competitive with the very best of current available methods. More importantly, we demonstrate, on real and simulated data, the potential of the model to help identify and quantify fine-scale variation in recombination rate from population data. We also outline how the model could be useful in other contexts, such as in the development of more efficient haplotype-based methods for LD mapping.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • ATP Binding Cassette Transporter, Subfamily B, Member 3
  • ATP-Binding Cassette Transporters / genetics
  • Algorithms
  • Chromosome Mapping
  • Computer Simulation
  • Genetic Heterogeneity
  • Genetics, Population
  • Haplotypes*
  • Humans
  • Linkage Disequilibrium*
  • Models, Genetic*
  • Polymorphism, Single Nucleotide*
  • Population Density
  • Recombination, Genetic*

Substances

  • ATP Binding Cassette Transporter, Subfamily B, Member 3
  • ATP-Binding Cassette Transporters
  • TAP2 protein, human