Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation

Nat Biotechnol. 1998 Oct;16(10):939-45. doi: 10.1038/nbt1098-939.


Whole-genome mRNA quantitation can be used to identify the genes that are most responsive to environmental or genotypic change. By searching for mutually similar DNA elements among the upstream non-coding DNA sequences of these genes, we can identify candidate regulatory motifs and corresponding candidate sets of coregulated genes. We have tested this strategy by applying it to three extensively studied regulatory systems in the yeast Saccharomyces cerevisiae: galactose response, heat shock, and mating type. Galactose-response data yielded the known binding site of Gal4, and six of nine genes known to be induced by galactose. Heat shock data yielded the cell-cycle activation motif, which is known to mediate cell-cycle dependent activation, and a set of genes coding for all four nucleosomal proteins. Mating type alpha and a data yielded all of the four relevant DNA motifs and most of the known a- and alpha-specific genes.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Cloning, Molecular
  • DNA / genetics*
  • Galactose / metabolism
  • Gene Expression Regulation, Fungal
  • Glucose / metabolism
  • Mating Factor
  • Peptides / genetics
  • RNA, Messenger / genetics*
  • Regulatory Sequences, Nucleic Acid*
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / growth & development


  • Peptides
  • RNA, Messenger
  • Mating Factor
  • DNA
  • Glucose
  • Galactose