Predicting protein fold pattern with functional domain and sequential evolution information

J Theor Biol. 2009 Feb 7;256(3):441-6. doi: 10.1016/j.jtbi.2008.10.007. Epub 2008 Oct 19.

Abstract

The fold pattern of a protein is one level deeper than its structural classification, and hence is more challenging and complicated for prediction. Many efforts have been made in this regard, but so far all the reported success rates are still under 70%, indicating that it is extremely difficult to enhance the success rate even by 1% or 2%. To address this problem, here a novel approach is proposed that is featured by combining the functional domain information and the sequential evolution information through a fusion ensemble classifier. The predictor thus developed is called PFP-FunDSeqE. Tests were performed for identifying proteins among their 27 fold patterns. Compared with the existing predictors tested by a same stringent benchmark dataset, the new predictor can, for the first time, achieve over 70% success rate. The PFP-FunDSeqE predictor is freely available to the public as a web server at http://www.csbio.sjtu.edu.cn/bioinf/PFP-FunDSeqE/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Biological Evolution*
  • Databases, Protein
  • Models, Biological
  • Models, Chemical*
  • Protein Folding*
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / classification
  • Structure-Activity Relationship

Substances

  • Proteins