Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline

Proc Natl Acad Sci U S A. 2002 Sep 3;99(18):11664-9. doi: 10.1073/pnas.142413399. Epub 2002 Aug 22.


Structural genomics is emerging as a principal approach to define protein structure-function relationships. To apply this approach on a genomic scale, novel methods and technologies must be developed to determine large numbers of structures. We describe the design and implementation of a high-throughput structural genomics pipeline and its application to the proteome of the thermophilic bacterium Thermotoga maritima. By using this pipeline, we successfully cloned and attempted expression of 1,376 of the predicted 1,877 genes (73%) and have identified crystallization conditions for 432 proteins, comprising 23% of the T. maritima proteome. Representative structures from TM0423 glycerol dehydrogenase and TM0449 thymidylate synthase-complementing protein are presented as examples of final outputs from the pipeline.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Cloning, Molecular
  • Genome, Bacterial*
  • Models, Molecular
  • Open Reading Frames
  • Protein Conformation
  • Proteome*
  • Thermotoga maritima / genetics*
  • Thermotoga maritima / metabolism


  • Proteome

Associated data

  • PDB/1KQ3
  • PDB/1KQ4