BactPepDB: a database of predicted peptides from a exhaustive survey of complete prokaryote genomes

Database (Oxford). 2014 Nov 6;2014:bau106. doi: 10.1093/database/bau106. Print 2014.


With the recent progress in complete genome sequencing, mining the increasing amount of genomic information available should in theory provide the means to discover new classes of peptides. However, annotation pipelines often do not consider small reading frames likely to be expressed. BactPepDB, available online at, is a database that aims at providing an exhaustive re-annotation of all complete prokaryotic genomes-chromosomal and plasmid DNA-available in RefSeq for coding sequences ranging between 10 and 80 amino acids. The identified peptides are classified as (i) previously identified in RefSeq, (ii) entity-overlapping (intragenic) or intergenic, and (iii) potential pseudogenes-intergenic sequences corresponding to a portion of a previously annotated larger gene. Additional information is related to homologs within order, predicted signal sequence, transmembrane segments, disulfide bonds, secondary structure, and the existence of a related 3D structure in the Protein Databank. As a result, BactPepDB provides insights about candidate peptides, and provides information about their conservation, together with some of their expected biological/structural features. The BactPepDB interface allows to search for candidate peptides in the database, or to search for peptides similar to a query, according to the multiple properties predicted or related to genomic localization. Database URL:

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacterial Proteins / genetics*
  • Databases, Genetic*
  • Genome, Bacterial / genetics*
  • Internet
  • Molecular Sequence Annotation
  • Peptides / genetics*
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein


  • Bacterial Proteins
  • Peptides