Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph

PLoS Genet. 2008 Oct;4(10):e1000212. doi: 10.1371/journal.pgen.1000212. Epub 2008 Oct 10.

Abstract

Genetic linkage maps are cornerstones of a wide spectrum of biotechnology applications, including map-assisted breeding, association genetics, and map-assisted gene cloning. During the past several years, the adoption of high-throughput genotyping technologies has been paralleled by a substantial increase in the density and diversity of genetic markers. New genetic mapping algorithms are needed in order to efficiently process these large datasets and accurately construct high-density genetic maps. In this paper, we introduce a novel algorithm to order markers on a genetic linkage map. Our method is based on a simple yet fundamental mathematical property that we prove under rather general assumptions. The validity of this property allows one to determine efficiently the correct order of markers by computing the minimum spanning tree of an associated graph. Our empirical studies obtained on genotyping data for three mapping populations of barley (Hordeum vulgare), as well as extensive simulations on synthetic data, show that our algorithm consistently outperforms the best available methods in the literature, particularly when the input data are noisy or incomplete. The software implementing our algorithm is available in the public domain as a web tool under the name MSTmap.

Publication types

  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Chromosome Mapping / statistics & numerical data*
  • Cluster Analysis
  • Computer Simulation
  • Databases, Genetic
  • Genes, Plant
  • Genetic Markers
  • Genotype
  • Hordeum / genetics
  • Models, Genetic
  • Multigene Family
  • Polymorphism, Single Nucleotide
  • Software

Substances

  • Genetic Markers