Inferring orthology and paralogy

Methods Mol Biol. 2012;855:259-79. doi: 10.1007/978-1-61779-582-4_9.

Abstract

The distinction between orthologs and paralogs, genes that started diverging by speciation versus duplication, is relevant in a wide range of contexts, most notably phylogenetic tree inference and protein function annotation. In this chapter, we provide an overview of the methods used to infer orthology and paralogy. We survey both graph-based approaches (and their various grouping strategies) and tree-based approaches, which solve the more general problem of gene/species tree reconciliation. We discuss conceptual differences among the various orthology inference methods and databases, and examine the difficult issue of verifying and benchmarking orthology predictions. Finally, we review typical applications of orthologous genes, groups, and reconciled trees and conclude with thoughts on future methodological developments.

Publication types

  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Evolution, Molecular*
  • Humans
  • Phylogeny