A simple definition of structural regions in proteins and its use in analyzing interface evolution

J Mol Biol. 2010 Nov 5;403(4):660-70. doi: 10.1016/j.jmb.2010.09.028. Epub 2010 Sep 22.


Analysis of proteins commonly requires the partition of their structure into regions such as the surface, interior, or interface. Despite the frequent use of such categorization, no consensus definition seems to exist. This study thus aims at providing a definition that is general, is simple to implement, and yields new biological insights. This analysis relies on 397, 196, and 701 protein structures from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, respectively, and the conclusions are consistent across all three species. A threshold of 25% relative accessible surface area best segregates amino acids at the interior and at the surface. This value is further used to extend the core-rim model of protein-protein interfaces and to introduce a third region called support. Interface core, rim, and support regions contain similar numbers of residues on average, but core residues contribute over two-thirds of the contact surface. The amino acid composition of each region remains similar across different organisms and interface types. The interface core composition is intermediate between the surface and the interior, but the compositions of the support and the rim are virtually identical with those of the interior and the surface, respectively. The support and rim could thus "preexist" in proteins, and evolving a new interaction could require mutations to form an interface core only. Using the interface regions defined, it is shown through simulations that only two substitutions are necessary to shift the average composition of a 1000-Å(2) surface patch involving ∼28 residues to that of an equivalent interface. This analysis and conclusions will help understand the notion of promiscuity in protein-protein interaction networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Substitution
  • Amino Acids / analysis
  • Escherichia coli Proteins / chemistry
  • Escherichia coli Proteins / genetics
  • Evolution, Molecular*
  • Humans
  • Models, Molecular
  • Protein Conformation
  • Protein Interaction Domains and Motifs / genetics*
  • Proteins / chemistry*
  • Proteins / genetics*
  • Saccharomyces cerevisiae Proteins / chemistry
  • Saccharomyces cerevisiae Proteins / genetics
  • Species Specificity


  • Amino Acids
  • Escherichia coli Proteins
  • Proteins
  • Saccharomyces cerevisiae Proteins