Estimating genealogies from linked marker data: a Bayesian approach

BMC Bioinformatics. 2007 Oct 25;8:411. doi: 10.1186/1471-2105-8-411.

Abstract

Background: Answers to several fundamental questions in statistical genetics would ideally require knowledge of the ancestral pedigree and of the gene flow therein. A few examples of such questions are haplotype estimation, relatedness and relationship estimation, gene mapping by combining pedigree and linkage disequilibrium information, and estimation of population structure.

Results: We present a probabilistic method for genealogy reconstruction. Starting with a group of genotyped individuals from some population isolate, we explore the state space of their possible ancestral histories under our Bayesian model by using Markov chain Monte Carlo (MCMC) sampling techniques. The main contribution of our work is the development of sampling algorithms in the resulting vast state space with highly dependent variables. The main drawback is the computational complexity that limits the time horizon within which explicit reconstructions can be carried out in practice.

Conclusion: The estimates for IBD (identity-by-descent) and haplotype distributions are tested in several settings using simulated data. The results appear to be promising for a further development of the method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alleles
  • Bayes Theorem*
  • Chromosome Mapping
  • Data Interpretation, Statistical
  • Female
  • Founder Effect
  • Gene Frequency
  • Genetic Linkage
  • Genetic Markers
  • Genetics, Population / methods*
  • Haplotypes
  • Humans
  • Male
  • Markov Chains
  • Models, Genetic
  • Monte Carlo Method
  • Pedigree*
  • Quantitative Trait Loci / genetics*

Substances

  • Genetic Markers