Prediction of protein surface accessibility with information theory

Proteins. 2001 Mar 1;42(4):452-9. doi: 10.1002/1097-0134(20010301)42:4<452::aid-prot40>;2-q.


A new, simple method based on information theory is introduced to predict the solvent accessibility of amino acid residues in various states defined by their different thresholds. Prediction is achieved by the application of information obtained from a single amino acid position or pair-information for a window of seventeen amino acids around the desired residue. Results obtained by pairwise information values are better than results from single amino acids. This reinforces the effect of the local environment on the accessibility of amino acid residues. The prediction accuracy of this method in a jackknife test system for two and three states is better than 70 and 60 %, respectively. A comparison of the results with those reported by others involving the same data set also testifies to a better prediction accuracy in our case.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Amino Acids / metabolism
  • Animals
  • Databases, Factual
  • Humans
  • Information Theory*
  • Linear Models
  • Models, Chemical
  • Models, Molecular
  • Protein Conformation / drug effects
  • Proteins / chemistry*
  • Proteins / metabolism
  • Solvents / chemistry*
  • Solvents / metabolism
  • Solvents / pharmacology


  • Amino Acids
  • Proteins
  • Solvents