Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Aug 14:7:1286.
doi: 10.12688/f1000research.15845.1. eCollection 2018.

Recursive module extraction using Louvain and PageRank

Affiliations

Recursive module extraction using Louvain and PageRank

Dimitri Perrin et al. F1000Res. .

Abstract

Biological networks are highly modular and contain a large number of clusters, which are often associated with a specific biological function or disease. Identifying these clusters, or modules, is therefore valuable, but it is not trivial. In this article we propose a recursive method based on the Louvain algorithm for community detection and the PageRank algorithm for authoritativeness weighting in networks. PageRank is used to initialise the weights of nodes in the biological network; the Louvain algorithm with the Newman-Girvan criterion for modularity is then applied to the network to identify modules. Any identified module with more than k nodes is further processed by recursively applying PageRank and Louvain, until no module contains more than k nodes (where k is a parameter of the method, no greater than 100). This method is evaluated on a heterogeneous set of six biological networks from the Disease Module Identification DREAM Challenge. Empirical findings suggest that the method is effective in identifying a large number of significant modules, although with substantial variability across restarts of the method.

Keywords: Community detection; DREAM challenge; Module identification; Network biology.

PubMed Disclaimer

Conflict of interest statement

No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Conversion of a directed network into an undirected one.
Figure 2.
Figure 2.. Overall algorithm.
Figure 3.
Figure 3.. Results on each network as a function of the value for k.
White and red dots represent the median and mean values for each configuration, respectively. The blue line indicates our performance in the challenge leaderboard for that network, and the red line that of the best submission for that network.

Similar articles

Cited by

References

    1. Ukai-Tadenuma M, Yamada RG, Xu H, et al. : Delay in feedback repression by cryptochrome 1 is required for circadian clock function. Cell. 2011;144(2):268–281. 10.1016/j.cell.2010.12.019 - DOI - PubMed
    1. Jolley CC, Ukai-Tadenuma M, Perrin D, et al. : A mammalian circadian clock model incorporating daytime expression elements. Biophys J. 2014;107(6):1462–1473. 10.1016/j.bpj.2014.07.022 - DOI - PMC - PubMed
    1. McLean MH, El-Omar EM: Genetics of gastric cancer. Nat Rev Gastroenterol Hepatol. 2014;11(11):664–674. 10.1038/nrgastro.2014.143 - DOI - PubMed
    1. Perrin D, Ruskin HJ, Niwa T: Cell type-dependent, infection-induced, aberrant DNA methylation in gastric cancer. J Theor Biol. 2010;264(2):570–577. 10.1016/j.jtbi.2010.02.040 - DOI - PubMed
    1. Barabási AL, Gulbahce N, Loscalzo J: Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68. 10.1038/nrg2918 - DOI - PMC - PubMed

Grants and funding

The author(s) declared that no grants were involved in supporting this work.

LinkOut - more resources