An in silico analysis of T-box regulated genes and T-box evolution in prokaryotes, with emphasis on prediction of substrate specificity of transporters

BMC Genomics. 2008 Jul 14:9:330. doi: 10.1186/1471-2164-9-330.

Abstract

Background: T-box anti-termination is an elegant and sensitive mechanism by which many bacteria maintain constant levels of amino acid-charged tRNAs. The amino acid specificity of the regulatory element is related to a so-called specifier codon and can in principle be used to guide the functional annotation of the genes controlled via the T-box anti-termination mechanism.

Results: Hidden Markov Models were defined to search the T-box regulatory element and were applied to all completed prokaryotic genomes. The vast majority of the genes found downstream of the retrieved elements encoded functionalities related to transport and synthesis of amino acids and the charging of tRNA. This is completely in line with findings reported in literature and with the proposed biological role of the regulatory element. For several species, the functional annotation of a large number of genes encoding proteins involved in amino acid transport could be improved significantly on basis of the amino acid specificity of the identified T-boxes. In addition, these annotations could be extrapolated to a larger number of orthologous systems in other species. Analysis of T-box distribution confirmed that the element is restricted predominantly to species of the phylum Firmicutes. Furthermore, it appeared that the distribution was highly species specific and that in the case of amino acid transport some boxes seemed to "pop-up" only recently.

Conclusion: We have demonstrated that the identification of the molecular specificity of a regulatory element can be of great help in solving notoriously difficult annotation issues, e.g. by defining the substrate specificity of genes encoding amino acid transporters on basis of the amino acid specificity of the regulatory T-box. Furthermore, our analysis of the species-dependency of the occurrence of specific T-boxes indicated that these regulatory elements propagate in a semi-independent way from the genes that they control.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Transport Systems / genetics
  • Bacteria / genetics*
  • Bacterial Proteins / genetics*
  • Base Sequence
  • Codon / genetics
  • DNA, Bacterial / genetics
  • Evolution, Molecular*
  • Gene Expression Regulation, Bacterial
  • Gene Regulatory Networks
  • Genome, Bacterial
  • Multigene Family
  • RNA Ligase (ATP) / genetics
  • Regulatory Elements, Transcriptional / genetics*
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Substrate Specificity
  • T-Box Domain Proteins / genetics*
  • Terminator Regions, Genetic

Substances

  • Amino Acid Transport Systems
  • Bacterial Proteins
  • Codon
  • DNA, Bacterial
  • T-Box Domain Proteins
  • RNA Ligase (ATP)