Prediction of site-specific amino acid distributions and limits of divergent evolutionary changes in protein sequences

Mol Biol Evol. 2005 Mar;22(3):630-8. doi: 10.1093/molbev/msi048. Epub 2004 Nov 10.

Abstract

We derive an analytic expression for site-specific stationary distributions of amino acids from the structurally constrained neutral (SCN) model of protein evolution with conservation of folding stability. The stationary distributions that we obtain have a Boltzmann-like shape, and their effective temperature parameter, measuring the limit of divergent evolutionary changes at a given site, can be predicted from a site-specific topological property, the principal eigenvector of the contact matrix of the native conformation of the protein. These analytic results, obtained without free parameters, are compared with simulations of the SCN model and with the site-specific amino acid distributions obtained from the Protein Data Bank. These results also provide new insights into how the topology of a protein fold influences its designability, i.e., the number of sequences compatible with that fold. The dependence of the effective temperature on the principal eigenvector decreases for longer proteins, as a possible consequence of the fact that selection for thermodynamic stability becomes weaker in this case.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / chemistry*
  • Animals
  • Evolution, Molecular*
  • Models, Molecular*
  • Protein Folding*
  • Proteins / chemistry*

Substances

  • Amino Acids
  • Proteins