Network compression as a quality measure for protein interaction networks

PLoS One. 2012;7(6):e35729. doi: 10.1371/journal.pone.0035729. Epub 2012 Jun 18.

Abstract

With the advent of large-scale protein interaction studies, there is much debate about data quality. Can different noise levels in the measurements be assessed by analyzing network structure? Because proteomic regulation is inherently co-operative, modular and redundant, it is inherently compressible when represented as a network. Here we propose that network compression can be used to compare false positive and false negative noise levels in protein interaction networks. We validate this hypothesis by first confirming the detrimental effect of false positives and false negatives. Second, we show that gold standard networks are more compressible. Third, we show that compressibility correlates with co-expression, co-localization, and shared function. Fourth, we also observe correlation with better protein tagging methods, physiological expression in contrast to over-expression of tagged proteins, and smart pooling approaches for yeast two-hybrid screens. Overall, this new measure is a proxy for both sensitivity and specificity and gives complementary information to standard measures such as average degree and clustering coefficients.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Mass Spectrometry
  • Principal Component Analysis
  • Protein Binding
  • Proteins / metabolism*

Substances

  • Proteins