An HMM posterior decoder for sequence feature prediction that includes homology information

Bioinformatics. 2005 Jun:21 Suppl 1:i251-7. doi: 10.1093/bioinformatics/bti1014.

Abstract

Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil-coil structures, protein secondary structure or genes, extra support can be gained from homologs.

Results: We present here a general hidden Markov model (HMM) decoding algorithm that combines probabilities for sequence features of homologs by considering the average of the posterior label probability of each position in a global sequence alignment. The algorithm is an extension of the previously described 'optimal accuracy' decoder, allowing homology information to be used. It was benchmarked using an HMM for transmembrane topology and signal peptide prediction, Phobius. We found that the performance was substantially increased when incorporating information from homologs.

Availability: A prediction server for transmembrane topology and signal peptides that uses the algorithm is available at http://phobius.cgb.ki.se/poly.html. An implementation of the algorithm is available on request from the authors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computational Biology / methods*
  • Databases, Protein
  • Internet
  • Markov Chains
  • Models, Statistical
  • Models, Theoretical
  • Probability
  • Protein Sorting Signals
  • Software

Substances

  • Protein Sorting Signals