Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae

Nucleic Acids Res. 2002 Mar 1;30(5):1233-9. doi: 10.1093/nar/30.5.1233.

Abstract

Patterns of similarity between genomes of related species reflect the distribution of selective constraint within DNA. We analyzed alignments of 142 orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae and found a mosaic pattern with regions of high similarity (phylogenetic footprints) interspersed with non-alignable sequences. Footprints cover approximately 20% of intergenic regions, often occur in clumps and are rare within 5' UTRs but common within 3' UTRs. The footprints have a higher ratio of transitions to transversions than expected at random and a higher GC content than the rest of the intergenic region. The number of footprints and the GC content of footprints within an intergenic region are higher when genes are oriented so that their 5' ends form the boundaries of the intergenic region. Overall, the patterns and characteristics identified here, along with other comparative and experimental studies, suggest that many footprints have a regulatory function, although other types of function are also possible. These conclusions may be quite general across eukaryotes, and the characteristics of conserved regulatory elements determined from genomic comparisons can be useful in prediction of regulation sites within individual DNA sequences.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Caenorhabditis / genetics*
  • Caenorhabditis elegans / genetics*
  • DNA, Helminth / analysis
  • DNA, Intergenic*
  • GC Rich Sequence
  • Molecular Sequence Data
  • Phylogeny
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Untranslated Regions

Substances

  • DNA, Helminth
  • DNA, Intergenic
  • Untranslated Regions