Assessing population structure: F(ST) and related measures

Mol Ecol Resour. 2011 Jan;11(1):5-18. doi: 10.1111/j.1755-0998.2010.02927.x. Epub 2010 Oct 26.

Abstract

Although F(ST) is widely used as a measure of population structure, it has been criticized recently because of its dependency on within-population diversity. This dependency can lead to difficulties in interpretation and in the comparison of estimates among species or among loci and has led to the development of two replacement statistics, F'(ST) and D. F'(ST) is the normal F(ST) standardized by the maximum value it can obtain, given the observed within-population diversity. D uses a multiplicative partitioning of diversity, based on the effective number of alleles rather than on the expected heterozygosity. In this study, we review the relationships between the three classes of statistics (F(ST), F'(ST) and D), their estimation and their properties. We illustrate the relationships between the statistics using a data set of estimates from 84 species taken from the last 4 years of Molecular Ecology. As with F(ST), unbiased estimators are available for the two new statistics D and F'(ST). Here, we develop a new unbiased F'(ST) estimator based on G(ST), which we call G''(ST). However, F'(ST) can be calculated using any F(ST) estimator for which the maximum value can be obtained. As all three statistics have their advantages and their drawbacks, we recommend continued use of F(ST) in combination with either F'(ST) or D. In most cases, F'(ST) would be the best choice among the latter two as it is most suited for inferences of the influence of demographic processes such as genetic drift and migration on genetic population structure.

Publication types

  • Review

MeSH terms

  • Factor Analysis, Statistical
  • Genetic Variation
  • Genetics, Population / methods*
  • Genetics, Population / statistics & numerical data*
  • Models, Genetic
  • Models, Statistical
  • Polymorphism, Single Nucleotide