Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
- PMID: 29914350
- PMCID: PMC6006753
- DOI: 10.1186/s12859-018-2217-z
Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
Abstract
Background: A fundamental fact in biology states that genes do not operate in isolation, and yet, methods that infer regulatory networks for single cell gene expression data have been slow to emerge. With single cell sequencing methods now becoming accessible, general network inference algorithms that were initially developed for data collected from bulk samples may not be suitable for single cells. Meanwhile, although methods that are specific for single cell data are now emerging, whether they have improved performance over general methods is unknown. In this study, we evaluate the applicability of five general methods and three single cell methods for inferring gene regulatory networks from both experimental single cell gene expression data and in silico simulated data.
Results: Standard evaluation metrics using ROC curves and Precision-Recall curves against reference sets sourced from the literature demonstrated that most of the methods performed poorly when they were applied to either experimental single cell data, or simulated single cell data, which demonstrates their lack of performance for this task. Using default settings, network methods were applied to the same datasets. Comparisons of the learned networks highlighted the uniqueness of some predicted edges for each method. The fact that different methods infer networks that vary substantially reflects the underlying mathematical rationale and assumptions that distinguish network methods from each other.
Conclusions: This study provides a comprehensive evaluation of network modeling algorithms applied to experimental single cell gene expression data and in silico simulated datasets where the network structure is known. Comparisons demonstrate that most of these assessed network methods are not able to predict network structures from single cell expression data accurately, even if they are specifically developed for single cell methods. Also, single cell methods, which usually depend on more elaborative algorithms, in general have less similarity to each other in the sets of edges detected. The results from this study emphasize the importance for developing more accurate optimized network modeling methods that are compatible for single cell data. Newly-developed single cell methods may uniquely capture particular features of potential gene-gene relationships, and caution should be taken when we interpret these results.
Keywords: Bayesian network; Correlation network; Gene regulatory network; Single cell genomics.
Conflict of interest statement
Ethics approval and consent to participate
No ethics approval was required for the study. All input data are publicly available through the citations supplied.
Consent for publication
Not applicable.
Competing interests
The authors declared that they have no competing interests.
Figures
Similar articles
-
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1. BMC Syst Biol. 2018. PMID: 30547796 Free PMC article.
-
A group LASSO-based method for robustly inferring gene regulatory networks from multiple time-course datasets.BMC Syst Biol. 2014;8 Suppl 3(Suppl 3):S1. doi: 10.1186/1752-0509-8-S3-S1. Epub 2014 Oct 22. BMC Syst Biol. 2014. PMID: 25350697 Free PMC article.
-
Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data.G3 (Bethesda). 2023 Mar 9;13(3):jkad004. doi: 10.1093/g3journal/jkad004. G3 (Bethesda). 2023. PMID: 36626328 Free PMC article.
-
A comprehensive survey of regulatory network inference methods using single cell RNA sequencing data.Brief Bioinform. 2021 May 20;22(3):bbaa190. doi: 10.1093/bib/bbaa190. Brief Bioinform. 2021. PMID: 34020546 Free PMC article. Review.
-
Learning Differential Module Networks Across Multiple Experimental Conditions.Methods Mol Biol. 2019;1883:303-321. doi: 10.1007/978-1-4939-8882-2_13. Methods Mol Biol. 2019. PMID: 30547406 Review.
Cited by
-
A dynamical perspective: moving towards mechanism in single-cell transcriptomics.Philos Trans R Soc Lond B Biol Sci. 2024 Apr 22;379(1900):20230049. doi: 10.1098/rstb.2023.0049. Epub 2024 Mar 4. Philos Trans R Soc Lond B Biol Sci. 2024. PMID: 38432314 Free PMC article. Review.
-
Integrating single-cell multi-omics and prior biological knowledge for a functional characterization of the immune system.Nat Immunol. 2024 Mar;25(3):405-417. doi: 10.1038/s41590-024-01768-2. Epub 2024 Feb 27. Nat Immunol. 2024. PMID: 38413722 Review.
-
Computational single cell oncology: state of the art.Front Genet. 2023 Nov 8;14:1256991. doi: 10.3389/fgene.2023.1256991. eCollection 2023. Front Genet. 2023. PMID: 38028624 Free PMC article. Review.
-
Gene Regulatory Networks in Coronary Artery Disease.Curr Atheroscler Rep. 2023 Dec;25(12):1013-1023. doi: 10.1007/s11883-023-01170-7. Epub 2023 Nov 27. Curr Atheroscler Rep. 2023. PMID: 38008808 Review.
-
MICA: a multi-omics method to predict gene regulatory networks in early human embryos.Life Sci Alliance. 2023 Oct 25;7(1):e202302415. doi: 10.26508/lsa.202302415. Print 2024 Jan. Life Sci Alliance. 2023. PMID: 37879938 Free PMC article.
References
-
- Azizi E, et al. Bayesian inference for single-cell clustering and imputing. Genomics and Computational Biology. 2017;3(1):e46. https://genomicscomputbiol.org/ojs/index.php/GCB/article/view/46.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
