Prediction of transcription regulatory sites in Archaea by a comparative genomic approach

Nucleic Acids Res. 2000 Feb 1;28(3):695-705. doi: 10.1093/nar/28.3.695.


Intragenomic and intergenomic comparisons of upstream nucleotide sequences of archaeal genes were performed with the goal of predicting transcription regulatory sites (operators) and identifying likely regulons. Learning sets for the detection of regulatory sites were constructed using the available experimental data on archaeal transcription regulation or by analogy with known bacterial regulons, and further analysis was performed using iterative profile searches. The information content of the candidate signals detected by this method is insufficient for reliable predictions to be made. Therefore, this approach has to be complemented by examination of evolutionary conservation in different archaeal genomes. This combined strategy resulted in the prediction of a conserved heat shock regulon in all euryarchaea, a nitrogen fixation regulon in the methanogens Methanococcus jannaschii and Methanobacterium thermoautotrophicum and an aromatic amino acid regulon in M.thermoautotrophicum. Unexpectedly, the heat shock regulatory site was detected not only for genes that encode known chaperone proteins but also for archaeal histone genes. This suggests a possible function for archaeal histones in stress-related changes in DNA condensation. In addition, comparative analysis of the genomes of three Pyrococcus species resulted in the prediction of their purine metabolism and transport regulon. The results demonstrate the feasibility of prediction of at least some transcription regulatory sites by comparing poorly characterized prokaryotic genomes, particularly when several closely related genome sequences are available.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea / enzymology
  • Archaea / genetics
  • Base Sequence
  • Computational Biology* / methods
  • Conserved Sequence / genetics
  • Evolution, Molecular
  • Genes, Archaeal / genetics*
  • Genome, Archaeal*
  • Glutamate-Ammonia Ligase / genetics
  • Heat-Shock Proteins / genetics
  • Histones / genetics
  • Molecular Chaperones / genetics
  • Molecular Sequence Data
  • Nitrogen Fixation / genetics
  • Phylogeny
  • Promoter Regions, Genetic / genetics
  • Purines / metabolism
  • Regulon / genetics
  • Response Elements / genetics*
  • Sequence Alignment
  • Transcription, Genetic / genetics*
  • Tryptophan / genetics


  • Heat-Shock Proteins
  • Histones
  • Molecular Chaperones
  • Purines
  • Tryptophan
  • Glutamate-Ammonia Ligase