Identifying protein complexes based on multiple topological structures in PPI networks

IEEE Trans Nanobioscience. 2013 Sep;12(3):165-72. doi: 10.1109/TNB.2013.2264097. Epub 2013 Aug 21.

Abstract

Various computational algorithms are developed to identify protein complexes based on only one of specific topological structures in protein-protein interaction (PPI) networks, such as cliques, dense subgraphs, core-attachment structures and starlike structures. However, protein complexes exhibit intricate connections in a PPI network. They cannot be fully detected by only single topological structure. In this paper, we propose an algorithm based on multiple topological structures to identify protein complexes from PPI networks. In the proposed algorithm, four single topological structure based algorithms are first employed to identify raw predictions with specific topological structures, respectively. Those raw predictions are trimmed according to their topological information or GO annotations. Similar results are carefully merged before generating final predictions. Numerical experiments are conducted on a yeast PPI network of DIP and a human PPI network of HPRD. The predicted results show that the multiple topological structure based algorithm can not only obtain a more number of predictions, but also generate results with high accuracy in terms of f-score, matching with known protein complexes and functional enrichments with GO.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein
  • Humans
  • Models, Statistical
  • Protein Interaction Maps*
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Yeasts / chemistry
  • Yeasts / metabolism

Substances

  • Proteins