Inferring Balancing Selection From Genome-Scale Data

Genome Biol Evol. 2023 Mar 3;15(3):evad032. doi: 10.1093/gbe/evad032.

Abstract

The identification of genomic regions and genes that have evolved under natural selection is a fundamental objective in the field of evolutionary genetics. While various approaches have been established for the detection of targets of positive selection, methods for identifying targets of balancing selection, a form of natural selection that preserves genetic and phenotypic diversity within populations, have yet to be fully developed. Despite this, balancing selection is increasingly acknowledged as a significant driver of diversity within populations, and the identification of its signatures in genomes is essential for understanding its role in evolution. In recent years, a plethora of sophisticated methods has been developed for the detection of patterns of linked variation produced by balancing selection, such as high levels of polymorphism, altered allele-frequency distributions, and polymorphism sharing across divergent populations. In this review, we provide a comprehensive overview of classical and contemporary methods, offer guidance on the choice of appropriate methods, and discuss the importance of avoiding artifacts and of considering alternative evolutionary processes. The increasing availability of genome-scale datasets holds the potential to assist in the identification of new targets and the quantification of the prevalence of balancing selection, thus enhancing our understanding of its role in natural populations.

Keywords: composite likelihood ratio tests; genetic variation; heterozygote advantage; natural selection; population genomics; summary statistics.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Frequency
  • Genetic Variation*
  • Genetics, Population
  • Genome
  • Polymorphism, Genetic*
  • Selection, Genetic