Conserved noncoding sequences in the grasses

Genome Res. 2003 Sep;13(9):2030-41. doi: 10.1101/gr.1280703.

Abstract

As orthologous genes from related species diverge over time, some sequences are conserved in noncoding regions. In mammals, large phylogenetic footprints, or conserved noncoding sequences (CNSs), are known to be common features of genes. Here we present the first large-scale analysis of plant genes for CNSs. We used maize and rice, maximally diverged members of the grass family of monocots. Using a local sequence alignment set to deliver only significant alignments, we found one or more CNSs in the noncoding regions of the majority of genes studied. Grass genes have dramatically fewer and much smaller CNSs than mammalian genes. Twenty-seven percent of grass gene comparisons revealed no CNSs. Genes functioning in upstream regulatory roles, such as transcription factors, are greatly enriched for CNSs relative to genes encoding enzymes or structural proteins. Further, we show that a CNS cluster in an intron of the knotted1 homeobox gene serves as a site of negative regulation. We showthat CNSs in the adh1 gene do not correlate with known cis-acting sites. We discuss the potential meanings of CNSs and their value as analytical tools and evolutionary characters. We advance the idea that many CNSs function to lock-in gene regulatory decisions.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 5' Flanking Region / genetics
  • Animals
  • Binding Sites / genetics
  • Computational Biology / methods
  • Conserved Sequence / genetics*
  • DNA Transposable Elements / genetics
  • Gene Expression Regulation, Plant / genetics
  • Genes, Regulator / genetics
  • Homeodomain Proteins / genetics
  • Humans
  • Introns / genetics
  • Multigene Family / genetics
  • Nuclear Matrix-Associated Proteins / genetics
  • Oryza / genetics*
  • Plant Proteins / genetics
  • Promoter Regions, Genetic / genetics
  • Untranslated Regions / genetics*
  • Zea mays / genetics*

Substances

  • DNA Transposable Elements
  • Homeodomain Proteins
  • Kn1 protein, plant
  • Nuclear Matrix-Associated Proteins
  • Plant Proteins
  • Untranslated Regions