High-performance signal peptide prediction based on sequence alignment techniques

Bioinformatics. 2008 Oct 1;24(19):2172-6. doi: 10.1093/bioinformatics/btn422. Epub 2008 Aug 12.

Abstract

The accuracy of current signal peptide predictors is outstanding. The most successful predictors are based on neural networks and hidden Markov models, reaching a sensitivity of 99% and an accuracy of 95%. Here, we demonstrate that the popular BLASTP alignment tool can be tuned for signal peptide prediction reaching the same high level of prediction success. Alignment-based techniques provide additional benefits. In spite of high success rates signal peptide predictors yield false predictions. Simple sequences like polyvaline, for example, are predicted as signal peptides. The general architecture of learning systems makes it difficult to trace the cause of such problems. This kind of false predictions can be recognized or avoided altogether by using sequence comparison techniques. Based on these results we have implemented a public web service, called Signal-BLAST. Predictions returned by Signal-BLAST are transparent and easy to analyze.

Availability: Signal-BLAST is available online at http://sigpep.services.came.sbg.ac.at/signalblast.html.

MeSH terms

  • Computational Biology / methods
  • Databases, Protein
  • Protein Sorting Signals*
  • Sequence Alignment*
  • Sequence Analysis, Protein*

Substances

  • Protein Sorting Signals