An HMM posterior decoder for sequence feature prediction that includes homology information

Lukas Käll; Anders Krogh; Erik L L Sonnhammer

doi:10.1093/bioinformatics/bti1014

An HMM posterior decoder for sequence feature prediction that includes homology information

Bioinformatics. 2005 Jun:21 Suppl 1:i251-7. doi: 10.1093/bioinformatics/bti1014.

Authors

Lukas Käll¹, Anders Krogh, Erik L L Sonnhammer

Affiliation

¹ Center for Genomics and Bioinformatics, Karolinska Institutet SE-17 177 Stockholm, Sweden.

PMID: 15961464
DOI: 10.1093/bioinformatics/bti1014

Abstract

Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil-coil structures, protein secondary structure or genes, extra support can be gained from homologs.

Results: We present here a general hidden Markov model (HMM) decoding algorithm that combines probabilities for sequence features of homologs by considering the average of the posterior label probability of each position in a global sequence alignment. The algorithm is an extension of the previously described 'optimal accuracy' decoder, allowing homology information to be used. It was benchmarked using an HMM for transmembrane topology and signal peptide prediction, Phobius. We found that the performance was substantially increased when incorporating information from homologs.

Availability: A prediction server for transmembrane topology and signal peptides that uses the algorithm is available at http://phobius.cgb.ki.se/poly.html. An implementation of the algorithm is available on request from the authors.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence
Computational Biology / methods*
Databases, Protein
Internet
Markov Chains
Models, Statistical
Models, Theoretical
Probability
Protein Sorting Signals
Software

Substances

Protein Sorting Signals