Constrained models of evolution lead to improved prediction of functional linkage from correlated gain and loss of genes

Bioinformatics. 2007 Jan 1;23(1):14-20. doi: 10.1093/bioinformatics/btl558. Epub 2006 Nov 7.

Abstract

Motivation: We compare phylogenetic approaches for inferring functional gene links. The approaches detect independent instances of the correlated gain and loss of pairs of genes from species' genomes. We investigate the effect on results of basing evidence of correlations on two phylogenetic approaches, Dollo parsminony and maximum likelihood (ML). We further examine the effect of constraining the ML model by fixing the rate of gene gain at a low value, rather than estimating it from the data.

Results: We detect correlated evolution among a test set of pairs of yeast (Saccharomyces cerevisiae) genes, with a case study of 21 eukaryotic genomes and test data derived from known yeast protein complexes. If the rate at which genes are gained is constrained to be low, ML achieves by far the best results at detecting known functional links. The model then has fewer parameters but it is more realistic by preventing genes from being gained more than once.

Availability: BayesTraits by M. Pagel and A. Meade, and a script to configure and repeatedly launch it by D. Barker and M. Pagel, are available at http://www.evolution.reading.ac.uk

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Allelic Imbalance
  • Animals
  • Computational Biology
  • Evolution, Molecular*
  • Genetic Linkage*
  • Likelihood Functions
  • Models, Genetic*
  • Models, Statistical*
  • Phylogeny
  • Saccharomyces cerevisiae / genetics
  • Sequence Homology
  • Species Specificity