Evolution and selection in yeast promoters: analyzing the combined effect of diverse transcription factor binding sites

PLoS Comput Biol. 2008 Jan;4(1):e7. doi: 10.1371/journal.pcbi.0040007.

Abstract

In comparative genomics one analyzes jointly evolutionarily related species in order to identify conserved and diverged sequences and to infer their function. While such studies enabled the detection of conserved sequences in large genomes, the evolutionary dynamics of regulatory regions as a whole remain poorly understood. Here we present a probabilistic model for the evolution of promoter regions in yeast, combining the effects of regulatory interactions of many different transcription factors. The model expresses explicitly the selection forces acting on transcription factor binding sites in the context of a dynamic evolutionary process. We develop algorithms to compute likelihood and to learn de novo collections of transcription factor binding motifs and their selection parameters from alignments. Using the new techniques, we examine the evolutionary dynamics in Saccharomyces species promoters. Analyses of an evolutionary model constructed using all known transcription factor binding motifs and of a model learned from the data automatically reveal relatively weak selection on most binding sites. Moreover, according to our estimates, strong binding sites are constraining only a fraction of the yeast promoter sequence that is under selection. Our study demonstrates how complex evolutionary dynamics in noncoding regions emerges from formalization of the evolutionary consequences of known regulatory mechanisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Binding Sites
  • Computer Simulation
  • Evolution, Molecular*
  • Genetic Variation / genetics*
  • Genome, Fungal / genetics*
  • Models, Genetic*
  • Molecular Sequence Data
  • Promoter Regions, Genetic / genetics*
  • Protein Binding
  • Saccharomyces cerevisiae / genetics*
  • Sequence Analysis, DNA / methods
  • Transcription Factors / genetics*

Substances

  • Transcription Factors