A genome-wide sequence-structure analysis suggests aggregation gatekeepers constitute an evolutionary constrained functional class

J Mol Biol. 2014 Jun 12;426(12):2405-12. doi: 10.1016/j.jmb.2014.04.007. Epub 2014 Apr 13.


Protein aggregation is geared by aggregation-prone regions that self-associate by β-strand interactions. Charged residues and prolines are enriched at the flanks of aggregation-prone regions resulting in decreased aggregation. It is still unclear what drives the overrepresentation of these "aggregation gatekeepers", that is, whether their presence results from structural constraints determining protein stability or whether they constitute a bona fide functional class selectively maintained to control protein aggregation. As functional residues are typically conserved regardless of their cost to protein stability, we compared sequence conservation and thermodynamic cost of these residues in 2659 protein families in Escherichia coli. Across protein families, we find gatekeepers to be under strong selective conservation while at the same time representing a significant thermodynamic cost to protein structure. This finding supports the notion that aggregation gatekeepers are not structurally determined but evolutionary selected to control protein aggregation.

Keywords: evolution; gatekeepers; protein aggregation; thermodynamic stability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Conserved Sequence
  • Escherichia coli / chemistry*
  • Escherichia coli / genetics
  • Escherichia coli Proteins / chemistry*
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism
  • Protein Binding
  • Protein Conformation
  • Protein Denaturation*
  • Protein Multimerization*
  • Protein Stability*
  • Thermodynamics


  • Escherichia coli Proteins