Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics

J Bacteriol. 1997 Nov;179(22):7135-55. doi: 10.1128/jb.179.22.7135-7155.1997.


The complete 1,751,377-bp sequence of the genome of the thermophilic archaeon Methanobacterium thermoautotrophicum deltaH has been determined by a whole-genome shotgun sequencing approach. A total of 1,855 open reading frames (ORFs) have been identified that appear to encode polypeptides, 844 (46%) of which have been assigned putative functions based on their similarities to database sequences with assigned functions. A total of 514 (28%) of the ORF-encoded polypeptides are related to sequences with unknown functions, and 496 (27%) have little or no homology to sequences in public databases. Comparisons with Eucarya-, Bacteria-, and Archaea-specific databases reveal that 1,013 of the putative gene products (54%) are most similar to polypeptide sequences described previously for other organisms in the domain Archaea. Comparisons with the Methanococcus jannaschii genome data underline the extensive divergence that has occurred between these two methanogens; only 352 (19%) of M. thermoautotrophicum ORFs encode sequences that are >50% identical to M. jannaschii polypeptides, and there is little conservation in the relative locations of orthologous genes. When the M. thermoautotrophicum ORFs are compared to sequences from only the eucaryal and bacterial domains, 786 (42%) are more similar to bacterial sequences and 241 (13%) are more similar to eucaryal sequences. The bacterial domain-like gene products include the majority of those predicted to be involved in cofactor and small molecule biosyntheses, intermediary metabolism, transport, nitrogen fixation, regulatory functions, and interactions with the environment. Most proteins predicted to be involved in DNA metabolism, transcription, and translation are more similar to eucaryal sequences. Gene structure and organization have features that are typical of the Bacteria, including genes that encode polypeptides closely related to eucaryal proteins. There are 24 polypeptides that could form two-component sensor kinase-response regulator systems and homologs of the bacterial Hsp70-response proteins DnaK and DnaJ, which are notably absent in M. jannaschii. DNA replication initiation and chromosome packaging in M. thermoautotrophicum are predicted to have eucaryal features, based on the presence of two Cdc6 homologs and three histones; however, the presence of an ftsZ gene indicates a bacterial type of cell division initiation. The DNA polymerases include an X-family repair type and an unusual archaeal B type formed by two separate polypeptides. The DNA-dependent RNA polymerase (RNAP) subunits A', A", B', B" and H are encoded in a typical archaeal RNAP operon, although a second A' subunit-encoding gene is present at a remote location. There are two rRNA operons, and 39 tRNA genes are dispersed around the genome, although most of these occur in clusters. Three of the tRNA genes have introns, including the tRNAPro (GGG) gene, which contains a second intron at an unprecedented location. There is no selenocysteinyl-tRNA gene nor evidence for classically organized IS elements, prophages, or plasmids. The genome contains one intein and two extended repeats (3.6 and 8.6 kb) that are members of a family with 18 representatives in the M. jannaschii genome.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Anaerobiosis
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Bacterial Proteins / physiology
  • Carbon / metabolism
  • Chromosome Mapping
  • DNA, Bacterial / analysis*
  • DNA, Bacterial / genetics*
  • Gene Library
  • Genes, Bacterial*
  • Genome, Bacterial*
  • Methane / metabolism
  • Methanobacterium / genetics*
  • Methanobacterium / metabolism
  • Nitrogen / metabolism
  • Open Reading Frames
  • Phylogeny
  • Plasmids
  • Protein Biosynthesis
  • RNA / metabolism
  • RNA-Directed DNA Polymerase / metabolism
  • Repetitive Sequences, Nucleic Acid
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid
  • Transcription Factors / metabolism
  • Transcription, Genetic


  • Bacterial Proteins
  • DNA, Bacterial
  • Transcription Factors
  • RNA
  • Carbon
  • RNA-Directed DNA Polymerase
  • Nitrogen
  • Methane

Associated data

  • GENBANK/AE000795
  • GENBANK/AE000796
  • GENBANK/AE000797
  • GENBANK/AE000798
  • GENBANK/AE000799
  • GENBANK/AE000800
  • GENBANK/AE000801
  • GENBANK/AE000802
  • GENBANK/AE000803
  • GENBANK/AE000804
  • GENBANK/AE000805
  • GENBANK/AE000806
  • GENBANK/AE000807
  • GENBANK/AE000808
  • GENBANK/AE000809
  • GENBANK/AE000810
  • GENBANK/AE000811
  • GENBANK/AE000812
  • GENBANK/AE000813
  • GENBANK/AE000814
  • GENBANK/AE000815
  • GENBANK/AE000816
  • GENBANK/AE000817
  • GENBANK/AE000818
  • GENBANK/AE000819
  • GENBANK/AE000820
  • GENBANK/AE000821