Genome-wide prediction of SH2 domain targets using structural information and the FoldX algorithm

PLoS Comput Biol. 2008 Apr 4;4(4):e1000052. doi: 10.1371/journal.pcbi.1000052.


Current experiments likely cover only a fraction of all protein-protein interactions. Here, we developed a method to predict SH2-mediated protein-protein interactions using the structure of SH2-phosphopeptide complexes and the FoldX algorithm. We show that our approach performs similarly to experimentally derived consensus sequences and substitution matrices at predicting known in vitro and in vivo targets of SH2 domains. We use our method to provide a set of high-confidence interactions for human SH2 domains with known structure filtered on secondary structure and phosphorylation state. We validated the predictions using literature-derived SH2 interactions and a probabilistic score obtained from a naive Bayes integration of information on coexpression, conservation of the interaction in other species, shared interaction partners, and functions. We show how our predictions lead to a new hypothesis for the role of SH2 domains in signaling.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Binding Sites
  • Chromosome Mapping / methods*
  • Molecular Sequence Data
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Sequence Analysis, Protein / methods*
  • Software*
  • src Homology Domains*
  • src-Family Kinases / chemistry*


  • src-Family Kinases