A novel measure of genetic distance for highly polymorphic tandem repeat loci

Mol Biol Evol. 1995 Sep;12(5):914-20. doi: 10.1093/oxfordjournals.molbev.a040268.


Genetic distance measures are indicators of relatedness among populations or species and are useful for reconstructing the historic and phylogenetic relationships among such groups. Classical measures of genetic distance were developed to analyze biochemical and serological polymorphisms, systems which generally show limited variability. However, these traditional measures of genetic distance are inadequate for the analysis of certain classes of variable number tandem repeat (VNTR) loci, which have a larger number of alleles and higher levels of heterozygosity than traditional genetic markers. At the higher levels of heterozygosity observed at these loci, the standard measures of genetic distance are nonlinear and do not account for the mutational mechanisms of hypervariable loci. We have developed a measure of genetic distance, DSW, which is appropriate for the analysis of highly polymorphic DNA loci. Using computer simulations of diverging populations, we show that DSW conforms to linearity and that the variance is similar in magnitude to traditional measures of genetic distance. Comparisons of phylogenetic trees derived from the simulated divergence of human racial groups demonstrate that the branch lengths of trees prepared using DSW are more similar to the model tree than those generated using other measures. Finally, we demonstrate the applicability of DSW to evolutionary analysis by reconstructing the relationships among eight human populations using 14 microsatellite and STR loci. The phylogenetic trees generated using DSW are different from trees constructed with traditional measures and better reflect the well-documented ancient divergence of African and non-African populations.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Computer Simulation*
  • Confidence Intervals
  • Continental Population Groups / genetics
  • DNA / genetics*
  • Ethnic Groups / genetics*
  • Genetic Variation*
  • Hominidae / genetics*
  • Humans
  • Models, Genetic*
  • Models, Statistical
  • Phylogeny*
  • Polymorphism, Genetic*
  • Repetitive Sequences, Nucleic Acid*
  • Time


  • DNA