Probabilistic assembly of human protein interaction networks from label-free quantitative proteomics

Proc Natl Acad Sci U S A. 2008 Feb 5;105(5):1454-9. doi: 10.1073/pnas.0706983105. Epub 2008 Jan 24.

Abstract

Large-scale affinity purification and mass spectrometry studies have played important roles in the assembly and analysis of comprehensive protein interaction networks for lower eukaryotes. However, the development of such networks for human proteins has been slowed by the high cost and significant technical challenges associated with systematic studies of protein interactions. To address this challenge, we have developed a method for building local and focused networks. This approach couples vector algebra and statistical methods with normalized spectral counting (NSAF) derived from the analysis of affinity purifications via chromatography-based proteomics. After mathematical removal of contaminant proteins, the core components of multiprotein complexes are determined by singular value decomposition analysis and clustering. The probability of interactions within and between complexes is computed solely based upon NSAFs using Bayes' approach. To demonstrate the application of this method to small-scale datasets, we analyzed an expanded human TIP49a and TIP49b dataset. This dataset contained proteins affinity-purified with 27 different epitope-tagged components of the chromatin remodeling SRCAP, hINO80, and TRRAP/TIP60 complexes, and the nutrient sensing complex Uri/Prefoldin. Within a core network of 65 unique proteins, we captured all known components of these complexes and novel protein associations, especially in the Uri/Prefoldin complex. Finally, we constructed a probabilistic human interaction network composed of 557 protein pairs.

MeSH terms

  • ATPases Associated with Diverse Cellular Activities
  • Bayes Theorem
  • Carrier Proteins / metabolism
  • DNA Helicases / metabolism
  • Databases, Protein
  • Humans
  • Mathematical Computing*
  • Models, Statistical*
  • Protein Interaction Mapping*
  • Proteomics*

Substances

  • Carrier Proteins
  • ATPases Associated with Diverse Cellular Activities
  • DNA Helicases
  • RUVBL1 protein, human
  • RUVBL2 protein, human