Identification of key player genes in gene regulatory networks

BMC Syst Biol. 2016 Sep 6;10(1):88. doi: 10.1186/s12918-016-0329-5.

Abstract

Background: Identifying the gene regulatory networks governing the workings and identity of cells is one of the main challenges in understanding processes such as cellular differentiation, reprogramming or cancerogenesis. One particular challenge is to identify the main drivers and master regulatory genes that control such cell fate transitions. In this work, we reformulate this problem as the optimization problems of computing a Minimum Dominating Set and a Minimum Connected Dominating Set for directed graphs.

Results: Both MDS and MCDS are applied to the well-studied gene regulatory networks of the model organisms E. coli and S. cerevisiae and to a pluripotency network for mouse embryonic stem cells. The results show that MCDS can capture most of the known key player genes identified so far in the model organisms. Moreover, this method suggests an additional small set of transcription factors as novel key players for governing the cell-specific gene regulatory network which can also be investigated with regard to diseases. To this aim, we investigated the ability of MCDS to define key drivers in breast cancer. The method identified many known drug targets as members of the MDS and MCDS.

Conclusions: This paper proposes a new method to identify key player genes in gene regulatory networks. The Java implementation of the heuristic algorithm explained in this paper is available as a Cytoscape plugin at http://apps.cytoscape.org/apps/mcds . The SageMath programs for solving integer linear programming formulations used in the paper are available at https://github.com/maryamNazarieh/KeyRegulatoryGenes and as supplementary material.

Keywords: Gene regulatory network; Heuristic algorithm; Integer linear programming; Minimum connected dominating set; Minimum dominating set.

MeSH terms

  • Animals
  • Breast Neoplasms / genetics
  • Cell Cycle / genetics
  • Escherichia coli / cytology
  • Escherichia coli / genetics
  • Gene Regulatory Networks*
  • Heuristics
  • Humans
  • Mice
  • Saccharomyces cerevisiae / cytology
  • Saccharomyces cerevisiae / genetics
  • Software
  • Systems Biology / methods*