An integer programming framework for inferring disease complexes from network data

Bioinformatics. 2016 Jun 15;32(12):i271-i277. doi: 10.1093/bioinformatics/btw263.

Abstract

Motivation: Unraveling the molecular mechanisms that underlie disease calls for methods that go beyond the identification of single causal genes to inferring larger protein assemblies that take part in the disease process.

Results: Here, we develop an exact, integer-programming-based method for associating protein complexes with disease. Our approach scores proteins based on their proximity in a protein-protein interaction network to a prior set that is known to be relevant for the studied disease. These scores are combined with interaction information to infer densely interacting protein complexes that are potentially disease-associated. We show that our method outperforms previous ones and leads to predictions that are well supported by current experimental data and literature knowledge.

Availability and implementation: The datasets we used, the executables and the results are available at www.cs.tau.ac.il/roded/disease_complexes.zip

Contact: roded@post.tau.ac.il.

MeSH terms

  • Algorithms
  • Protein Interaction Maps
  • Proteins
  • Software*

Substances

  • Proteins