Testing similarity measures with continuous and discrete protein models

Proteins. 2003 Jan 1;50(1):144-57. doi: 10.1002/prot.10271.


There are many ways to define the distance between two protein structures, thus assessing their similarity. Here, we investigate and compare the properties of five different distance measures, including the standard root-mean-square deviation (cRMSD). The performance of these measures is studied from different perspectives with two different protein models, one continuous and the other discrete. Using the continuous model, we examine the correlation between energy and native distance, and the ability of the different measures to discriminate between the two possible topologies of a three-helix bundle. Using the discrete model, we perform fits to real protein structures by minimizing different distance measures. The properties of the fitted structures are found to depend strongly on the distance measure used and the scale considered. We find that the cRMSD measure very effectively describes long-range features but is less effective with short-range features, and it correlates weakly with energy. A stronger correlation with energy and a better description of short-range properties is obtained when we use measures based on intramolecular distances.

Publication types

  • Comparative Study

MeSH terms

  • Models, Molecular*
  • Models, Statistical*
  • Molecular Structure
  • Protein Conformation*
  • Protein Folding
  • Protein Structure, Secondary
  • Proteins / chemistry*


  • Proteins