Modelling interaction sites in protein domains with interaction profile hidden Markov models

Bioinformatics. 2006 Dec 1;22(23):2851-7. doi: 10.1093/bioinformatics/btl486. Epub 2006 Sep 25.


Motivation: Due to the growing number of completely sequenced genomes, functional annotation of proteins becomes a more and more important issue. Here, we describe a method for the prediction of sites within protein domains, which are part of protein-ligand interactions. As recently demonstrated, these sites are not trivial to detect because of a varying degree of conservation of their location and type within a domain family.

Results: The developed method for the prediction of protein-ligand interaction sites is based on a newly defined interaction profile hidden Markov model (ipHMM) topology that takes structural and sequence data into account. It is based on a homology search via a posterior decoding algorithm that yields probabilities for interacting sequence positions and inherits the efficiency and the power of the profile hidden Markov model (pHMM) methodology. The algorithm enhances the quality of interaction site predictions and is a suitable tool for large scale studies, which was already demonstrated for pHMMs.

Availability: The MATLAB-files are available on request from the first author.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Artificial Intelligence
  • Binding Sites
  • Markov Chains
  • Molecular Sequence Data
  • Pattern Recognition, Automated / methods*
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Protein Structure, Tertiary
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*