An evolution based classifier for prediction of protein interfaces without using protein structures

Bioinformatics. 2005 May 15;21(10):2496-501. doi: 10.1093/bioinformatics/bti340. Epub 2005 Feb 22.


Motivation: The number of available protein structures still lags far behind the number of known protein sequences. This makes it important to predict which residues participate in protein-protein interactions using only sequence information. Few studies have tackled this problem until now.

Results: We applied support vector machines to sequences in order to generate a classification of all protein residues into those that are part of a protein interface and those that are not. For the first time evolutionary information was used as one of the attributes and this inclusion of evolutionary importance rankings improves the classification. Leave-one-out cross-validation experiments show that prediction accuracy reaches 64%.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Artificial Intelligence*
  • Binding Sites
  • Evolution, Molecular*
  • Molecular Sequence Data
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Proteins / analysis
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods*


  • Proteins