Improving prediction of burial state of residues by exploiting correlation among residues

BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):70. doi: 10.1186/s12859-017-1475-5.

Abstract

Background: Residues in a protein might be buried inside or exposed to the solvent surrounding the protein. The buried residues usually form hydrophobic cores to maintain the structural integrity of proteins while the exposed residues are tightly related to protein functions. Thus, the accurate prediction of solvent accessibility of residues will greatly facilitate our understanding of both structure and functionalities of proteins. Most of the state-of-the-art prediction approaches consider the burial state of each residue independently, thus neglecting the correlations among residues.

Results: In this study, we present a high-order conditional random field model that considers burial states of all residues in a protein simultaneously. Our approach exploits not only the correlation among adjacent residues but also the correlation among long-range residues. Experimental results showed that by exploiting the correlation among residues, our approach outperformed the state-of-the-art approaches in prediction accuracy. In-depth case studies also showed that by using the high-order statistical model, the errors committed by the bidirectional recurrent neural network and chain conditional random field models were successfully corrected.

Conclusions: Our methods enable the accurate prediction of residue burial states, which should greatly facilitate protein structure prediction and evaluation.

Keywords: Burial states of residue; Conditional random field; Protein structure; Residue correlation.

MeSH terms

  • Databases, Factual
  • Hydrophobic and Hydrophilic Interactions
  • Models, Theoretical*
  • Protein Conformation
  • Proteins / chemistry*
  • Reproducibility of Results
  • Solvents / chemistry

Substances

  • Proteins
  • Solvents