Accurate prediction of protein catalytic residues by side chain orientation and residue contact density

PLoS One. 2012;7(10):e47951. doi: 10.1371/journal.pone.0047951. Epub 2012 Oct 24.

Abstract

Prediction of protein catalytic residues provides useful information for the studies of protein functions. Most of the existing methods combine both structure and sequence information but heavily rely on sequence conservation from multiple sequence alignments. The contribution of structure information is usually less than that of sequence conservation in existing methods. We found a novel structure feature, residue side chain orientation, which is the first structure-based feature that achieves prediction results comparable to that of evolutionary sequence conservation. We developed a structure-based method, Enzyme Catalytic residue SIde-chain Arrangement (EXIA), which is based on residue side chain orientations and backbone flexibility of protein structure. The prediction that uses EXIA outperforms existing structure-based features. The prediction quality of combing EXIA and sequence conservation exceeds that of the state-of-the-art prediction methods. EXIA is designed to predict catalytic residues from single protein structure without needing sequence or structure alignments. It provides invaluable information when there is no sufficient or reliable homology information for target protein. We found that catalytic residues have very special side chain orientation and designed the EXIA method based on the newly discovered feature. It was also found that EXIA performs well for a dataset of enzymes without any bounded ligand in their crystallographic structures.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acids / chemistry
  • Amino Acids / metabolism
  • Binding Sites
  • Catalytic Domain*
  • Computational Biology / methods*
  • Databases, Protein
  • Models, Molecular
  • Molecular Structure
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism
  • Reproducibility of Results

Substances

  • Amino Acids
  • Proteins

Grants and funding

The work was supported by grant (101-2218-E-320-001-) of National Science Council, Taiwan. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.