Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions

J Mol Biol. 1997 Apr 25;268(1):209-25. doi: 10.1006/jmbi.1997.0959.


We explore the ability of a simple simulated annealing procedure to assemble native-like structures from fragments of unrelated protein structures with similar local sequences using Bayesian scoring functions. Environment and residue pair specific contributions to the scoring functions appear as the first two terms in a series expansion for the residue probability distributions in the protein database; the decoupling of the distance and environment dependencies of the distributions resolves the major problems with current database-derived scoring functions noted by Thomas and Dill. The simulated annealing procedure rapidly and frequently generates native-like structures for small helical proteins and better than random structures for small beta sheet containing proteins. Most of the simulated structures have native-like solvent accessibility and secondary structure patterns, and thus ensembles of these structures provide a particularly challenging set of decoys for evaluating scoring functions. We investigate the effects of multiple sequence information and different types of conformational constraints on the overall performance of the method, and the ability of a variety of recently developed scoring functions to recognize the native-like conformations in the ensembles of simulated structures.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bayes Theorem*
  • Computer Simulation
  • Databases, Factual
  • Models, Molecular*
  • Models, Statistical
  • Peptide Fragments / chemistry
  • Protein Folding
  • Protein Structure, Tertiary*
  • Proteins / chemistry*
  • Sequence Homology, Amino Acid


  • Peptide Fragments
  • Proteins