Exhaustive database searching for amino acid mutations in proteomes

Bioinformatics. 2012 Jul 15;28(14):1895-901. doi: 10.1093/bioinformatics/bts274. Epub 2012 May 10.


Motivation: Amino acid mutations in proteins can be found by searching tandem mass spectra acquired in shotgun proteomics experiments against protein sequences predicted from genomes. Traditionally, unconstrained searches for amino acid mutations have been accomplished by using a sequence tagging approach that combines de novo sequencing with database searching. However, this approach is limited by the performance of de novo sequencing.

Results: The Sipros algorithm v2.0 was developed to perform unconstrained database searching using high-resolution tandem mass spectra by exhaustively enumerating all single non-isobaric mutations for every residue in a protein database. The performance of Sipros for amino acid mutation identification exceeded that of an established sequence tagging algorithm, Inspect, based on benchmarking results from a Rhodopseudomonas palustris proteomics dataset. To demonstrate the viability of the algorithm for meta-proteomics, Sipros was used to identify amino acid mutations in a natural microbial community in acid mine drainage.

Availability: The Sipros algorithm is freely available at\newline http://code.google.com/p/sipros.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Amino Acids*
  • Computational Biology / methods
  • Databases, Protein
  • Microbial Consortia / genetics
  • Mining
  • Mutation*
  • Proteins / chemistry
  • Proteome / analysis
  • Proteomics / methods*
  • Rhodopseudomonas / genetics
  • Tandem Mass Spectrometry


  • Amino Acids
  • Proteins
  • Proteome