The architecture of long-range haplotypes shared within and across populations

Mol Biol Evol. 2012 Feb;29(2):473-86. doi: 10.1093/molbev/msr133. Epub 2011 Oct 6.


Homologous long segments along the genomes of close or remote relatives that are identical by descent (IBD) from a common ancestor provide clues for recent events in human genetics. We set out to extensively map such IBD segments in large cohorts and investigate their distribution within and across different populations. We report analysis of several data sets, demonstrating that IBD is more common than expected by naïve models of population genetics. We show that the frequency of IBD pairs is population dependent and can be used to cluster individuals into populations, detect a homogeneous subpopulation within a larger cohort, and infer bottleneck events in such a subpopulation. Specifically, we show that Ashkenazi Jewish individuals are all connected through transitive remote family ties evident by sharing of 50 cM IBD to a publicly available data set of less than 400 individuals. We further expose regions where long-range haplotypes are shared significantly more often than elsewhere in the genome, observed across multiple populations, and enriched for common long structural variation. These are inconsistent with recent relatedness and suggest ancient common ancestry, with limited recombination between haplotypes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Evolution, Molecular*
  • Gene Frequency
  • Genetic Variation*
  • Genetics, Population / methods*
  • Genome, Human
  • Haplotypes
  • Humans
  • Jews / genetics
  • Pedigree
  • Polymorphism, Single Nucleotide