Template based protein structure modeling by global optimization in CASP11

Proteins. 2016 Sep:84 Suppl 1:221-32. doi: 10.1002/prot.24917. Epub 2015 Sep 14.

Abstract

For the template-based modeling (TBM) of CASP11 targets, we have developed three new protein modeling protocols (nns for server prediction and LEE and LEER for human prediction) by improving upon our previous CASP protocols (CASP7 through CASP10). We applied the powerful global optimization method of conformational space annealing to three stages of optimization, including multiple sequence-structure alignment, three-dimensional (3D) chain building, and side-chain remodeling. For more successful fold recognition, a new alignment method called CRFalign was developed. It can incorporate sensitive positional and environmental dependence in alignment scores as well as strong nonlinear correlations among various features. Modifications and adjustments were made to the form of the energy function and weight parameters pertaining to the chain building procedure. For the side-chain remodeling step, residue-type dependence was introduced to the cutoff value that determines the entry of a rotamer to the side-chain modeling library. The improved performance of the nns server method is attributed to successful fold recognition achieved by combining several methods including CRFalign and to the current modeling formulation that can incorporate native-like structural aspects present in multiple templates. The LEE protocol is identical to the nns one except that CASP11-released server models are used as templates. The success of LEE in utilizing CASP11 server models indicates that proper template screening and template clustering assisted by appropriate cluster ranking promises a new direction to enhance protein 3D modeling. Proteins 2016; 84(Suppl 1):221-232. © 2015 Wiley Periodicals, Inc.

Keywords: casp; fold recognition; global optimization; homology modeling; protein structure modeling; sequence alignment; template based modeling.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data*
  • Computer Simulation
  • Databases, Protein
  • Humans
  • Internet
  • Models, Molecular*
  • Models, Statistical*
  • Protein Folding
  • Protein Interaction Domains and Motifs
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Sequence Alignment
  • Software*
  • Structural Homology, Protein
  • Thermodynamics

Substances

  • Proteins