Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences

J Mol Evol. 2007 Feb;64(2):171-80. doi: 10.1007/s00239-005-0299-5. Epub 2007 Jan 2.

Abstract

Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Base Pairing / genetics*
  • Base Sequence
  • Codon / genetics
  • DNA, Plant / genetics
  • Evolution, Molecular
  • Expressed Sequence Tags*
  • Genome, Plant
  • Helianthus / genetics*
  • Lactuca / genetics*
  • Likelihood Functions
  • Reproducibility of Results
  • Selection, Genetic

Substances

  • Codon
  • DNA, Plant