Bayesian methods for examining Hardy-Weinberg equilibrium

Biometrics. 2010 Mar;66(1):257-65. doi: 10.1111/j.1541-0420.2009.01267.x. Epub 2009 May 12.

Abstract

Testing for Hardy-Weinberg equilibrium is ubiquitous and has traditionally been carried out via frequentist approaches. However, the discreteness of the sample space means that uniformity of p-values under the null cannot be assumed, with enumeration of all possible counts, conditional on the minor allele count, offering a computationally expensive way of p-value calibration. In addition, the interpretation of the subsequent p-values, and choice of significance threshold depends critically on sample size, because equilibrium will always be rejected at conventional levels with large sample sizes. We argue for a Bayesian approach using both Bayes factors, and the examination of posterior distributions. We describe simple conjugate approaches, and methods based on importance sampling Monte Carlo. The former are convenient because they yield closed-form expressions for Bayes factors, which allow their application to a large number of single nucleotide polymorphisms (SNPs), in particular in genome-wide contexts. We also describe straightforward direct sampling methods for examining posterior distributions of parameters of interest. For large numbers of alleles at a locus we resort to Markov chain Monte Carlo. We discuss a number of possibilities for prior specification, and apply the suggested methods to a number of real datasets.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Bayes Theorem*
  • Biometry / methods*
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Epidemiologic Methods
  • Genetic Markers / genetics*
  • Genetic Predisposition to Disease / epidemiology*
  • Genetic Predisposition to Disease / genetics*
  • Humans
  • Models, Genetic*

Substances

  • Genetic Markers