Effective scoring function for protein sequence design

Proteins. 2004 Feb 1;54(2):271-81. doi: 10.1002/prot.10560.


We have developed an effective scoring function for protein design. The atomic solvation parameters, together with the weights of energy terms, were optimized so that residues corresponding to the native sequence were predicted with low energy in the training set of 28 protein structures. The solvation energy of non-hydrogen-bonded hydrophilic atoms was considered separately and expressed in a nonlinear way. As a result, our scoring function predicted native residues as the most favorable in 59% of the total positions in 28 proteins. We then tested the scoring function by comparing the predicted stability changes for 103 T4 lysozyme mutants with the experimental values. The correlation coefficients were 0.77 for surface mutations and 0.71 for all mutations. Finally, the scoring function combined with Monte Carlo simulation was used to predict favorable sequences on a fixed backbone. The designed sequences were similar to the natural sequences of the family to which the template structure belonged. The profile of the designed sequences was helpful for identification of remote homologues of the native sequence.

MeSH terms

  • Bacteriophage T4 / enzymology
  • Computational Biology*
  • Computer Simulation*
  • Drug Design
  • Enzyme Stability
  • Hydrogen Bonding
  • Monte Carlo Method
  • Muramidase / chemistry
  • Muramidase / genetics
  • Muramidase / metabolism
  • Mutation
  • Protein Engineering*
  • Protein Folding
  • Proteins / chemistry*
  • Proteins / genetics*
  • Proteins / metabolism
  • Static Electricity
  • Structure-Activity Relationship
  • Thermodynamics


  • Proteins
  • Muramidase