Comparative genomics of thiamin biosynthesis in procaryotes. New genes and regulatory mechanisms

J Biol Chem. 2002 Dec 13;277(50):48949-59. doi: 10.1074/jbc.M208965200. Epub 2002 Oct 9.


Vitamin B(1) in its active form thiamin pyrophosphate is an essential coenzyme that is synthesized by coupling of pyrimidine (hydroxymethylpyrimidine; HMP) and thiazole (hydroxyethylthiazole) moieties in bacteria. Using comparative analysis of genes, operons, and regulatory elements, we describe the thiamin biosynthetic pathway in available bacterial genomes. The previously detected thiamin-regulatory element, thi box (Miranda-Rios, J., Navarro, M., and Soberon, M. (2001) Proc. Natl. Acad. Sci. U. S. A. 98, 9736-9741), was extended, resulting in a new, highly conserved RNA secondary structure, the THI element, which is widely distributed in eubacteria and also occurs in some archaea. Search for THI elements and analysis of operon structures identified a large number of new candidate thiamin-regulated genes, mostly transporters, in various prokaryotic organisms. In particular, we assign the thiamin transporter function to yuaJ in the Bacillus/Clostridium group and the HMP transporter function to an ABC transporter thiXYZ in some proteobacteria and firmicutes. By analogy to the model of regulation of the riboflavin biosynthesis, we suggest thiamin-mediated regulation based on formation of alternative RNA structures involving the THI element. Either transcriptional or translational attenuation mechanism may operate in different taxonomic groups, dependent on the existence of putative hairpins that either act as transcriptional terminators or sequester translation initiation sites. Based on analysis of co-occurrence of the thiamin biosynthetic genes in complete genomes, we predict that eubacteria, archaea, and eukaryota have different pathways for the HMP and hydroxyethylthiazole biosynthesis.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Biological Transport
  • Genome, Bacterial*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • RNA, Bacterial
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Thiamine / biosynthesis*


  • RNA, Bacterial
  • Thiamine