A Bayesian hierarchical model for allele frequencies

Genet Epidemiol. 2001 Jan;20(1):17-33. doi: 10.1002/1098-2272(200101)20:1<17::AID-GEPI3>3.0.CO;2-Q.

Abstract

Genetic epidemiological methodologies, such as linkage analysis, often require accurate estimates of allele frequencies. When studies involve multiple sub-populations with different evolutionary histories, accurate estimates can be difficult to obtain because the number of subjects per sub-population tends to be limited. Given allele counts for a collection of loci and sub-populations, we propose a Bayesian hierarchical model that extends existing empirical Bayesian approaches by allowing for explicit inclusion of prior information about both allele frequencies and inter-population divergence. We describe how such information can be derived from published data and then incorporated into the model via prior distributions for model parameters. By analysis of simulated data, we highlight how the hierarchical model, as implemented in the publicly available program AllDist, combines prior information with the observed data to refine allele frequency estimates.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alleles
  • Bayes Theorem*
  • Computer Simulation
  • Gene Frequency*
  • Genetic Linkage*
  • Humans
  • Models, Genetic*