A model for statistical significance of local similarities in structure

J Mol Biol. 2003 Mar 7;326(5):1307-16. doi: 10.1016/s0022-2836(03)00045-7.


Structural biology can provide three-dimensional structures for proteins of unknown function. When sequence or structure comparisons fail to suggest a function, insights can come from discovery of functionally important local structural patterns. Existing methods to detect such patterns lack rigorous statistics needed for widespread application. Here, we derive a formula to calculate statistical significance of the root-mean-square deviation between atoms in such patterns. When combined with a database search method, our statistics permit true functional or structural patterns in different folds to be discerned from noise. The approach is highly complementary to fold comparison for providing functional clues for new structures, and is key for the detection of recurrences of any new pattern.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Animals
  • Binding Sites
  • Databases, Factual
  • Humans
  • Models, Molecular*
  • Protein Conformation
  • Protein Folding*
  • Protein Structure, Quaternary
  • Proteins / chemistry*
  • Proteins / genetics
  • Sequence Alignment*
  • Sequence Analysis
  • Structural Homology, Protein*


  • Proteins