Analysis of the Escherichia coli genome. IV. DNA sequence of the region from 89.2 to 92.8 minutes

Nucleic Acids Res. 1993 Nov 25;21(23):5408-17. doi: 10.1093/nar/21.23.5408.

Abstract

We present the sequence of 176 kilobases of the Escherichia coli K-12 genome, from katG at 89.2 to an open reading frame (ORF) of unknown function at 92.8 minutes on the genetic map. This brings the total of contiguous sequence from the E. coli genome project to 500 kb (81.5 to 92.8 minutes). This segment contains 134 putative coding genes (ORFs) of which 66 genes were previously identified. Eight new genes--acs, pepE, and nrfB-G--were identified as well as the previously mapped gldA and alr genes. Still, 58 ORFs remain unidentified despite literature and similarity searches. The arrangement of proposed genes relative to possible promoters and terminators suggests 55 potential transcription units. Other features include 13 REP elements, one IRU (ERIC) repeat, 59 computer-predicted bends, 42 Chi sites and one new grey hole. Sixteen signal peptides were found, including those of lamB, btuB, and malE. Two ribosomal RNA loci, rrnB and rrnE, are located in this segment, so we have now sequenced four of the seven E. coli rRNA loci. Comparison of the rRNA loci reveals some differences in the ribosomal structural RNAs which are generally compatible with the proposed secondary structures.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Base Sequence
  • DNA, Bacterial / genetics*
  • DNA, Ribosomal / genetics*
  • Escherichia coli / genetics*
  • Genes, Bacterial*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Open Reading Frames
  • Protein Sorting Signals / chemistry
  • Regulatory Sequences, Nucleic Acid
  • Repetitive Sequences, Nucleic Acid
  • Restriction Mapping

Substances

  • DNA, Bacterial
  • DNA, Ribosomal
  • Protein Sorting Signals

Associated data

  • GENBANK/L10328
  • GENBANK/L19201
  • GENBANK/M87049
  • GENBANK/U00006