Small membrane proteins found by comparative genomics and ribosome binding site models

Mol Microbiol. 2008 Dec;70(6):1487-501. doi: 10.1111/j.1365-2958.2008.06495.x.


The correct annotation of genes encoding the smallest proteins is one of the biggest challenges of genome annotation, and perhaps more importantly, few annotated short open reading frames have been confirmed to correspond to synthesized proteins. We used sequence conservation and ribosome binding site models to predict genes encoding small proteins, defined as having 16-50 amino acids, in the intergenic regions of the Escherichia coli genome. We tested expression of these predicted as well as previously annotated genes by integrating the sequential peptide affinity tag directly upstream of the stop codon on the chromosome and assaying for synthesis using immunoblot assays. This approach confirmed that 20 previously annotated and 18 newly discovered proteins of 16-50 amino acids are synthesized. We summarize the properties of these small proteins; remarkably more than half of the proteins are predicted to be single-transmembrane proteins, nine of which we show co-fractionate with cell membranes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Binding Sites
  • DNA, Intergenic
  • Escherichia coli / genetics*
  • Escherichia coli Proteins / biosynthesis
  • Escherichia coli Proteins / genetics*
  • Genome, Bacterial*
  • Genomics
  • Membrane Proteins / biosynthesis
  • Membrane Proteins / genetics*
  • Molecular Sequence Data
  • Protein Biosynthesis
  • Ribosomes / genetics
  • Ribosomes / metabolism*
  • Sequence Analysis, DNA
  • Sequence Homology


  • DNA, Intergenic
  • Escherichia coli Proteins
  • Membrane Proteins