Robust rank aggregation for gene list integration and meta-analysis
- PMID: 22247279
- PMCID: PMC3278763
- DOI: 10.1093/bioinformatics/btr709
Robust rank aggregation for gene list integration and meta-analysis
Abstract
Motivation: The continued progress in developing technological platforms, availability of many published experimental datasets, as well as different statistical methods to analyze those data have allowed approaching the same research question using various methods simultaneously. To get the best out of all these alternatives, we need to integrate their results in an unbiased manner. Prioritized gene lists are a common result presentation method in genomic data analysis applications. Thus, the rank aggregation methods can become a useful and general solution for the integration task.
Results: Standard rank aggregation methods are often ill-suited for biological settings where the gene lists are inherently noisy. As a remedy, we propose a novel robust rank aggregation (RRA) method. Our method detects genes that are ranked consistently better than expected under null hypothesis of uncorrelated inputs and assigns a significance score for each gene. The underlying probabilistic model makes the algorithm parameter free and robust to outliers, noise and errors. Significance scores also provide a rigorous way to keep only the statistically relevant genes in the final list. These properties make our approach robust and compelling for many settings.
Availability: All the methods are implemented as a GNU R package RobustRankAggreg, freely available at the Comprehensive R Archive Network http://cran.r-project.org/.
Figures
Similar articles
-
Preprocessing of gene expression data by optimally robust estimators.BMC Bioinformatics. 2010 Nov 30;11:583. doi: 10.1186/1471-2105-11-583. BMC Bioinformatics. 2010. PMID: 21118506 Free PMC article.
-
Hybrid Bayesian-rank integration approach improves the predictive power of genomic dataset aggregation.Bioinformatics. 2015 Jan 15;31(2):209-15. doi: 10.1093/bioinformatics/btu518. Epub 2014 Sep 29. Bioinformatics. 2015. PMID: 25266226 Free PMC article.
-
GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.BMC Bioinformatics. 2009 Feb 3;10:48. doi: 10.1186/1471-2105-10-48. BMC Bioinformatics. 2009. PMID: 19192299 Free PMC article.
-
RankAggreg, an R package for weighted rank aggregation.BMC Bioinformatics. 2009 Feb 19;10:62. doi: 10.1186/1471-2105-10-62. BMC Bioinformatics. 2009. PMID: 19228411 Free PMC article.
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
Cited by
-
Framework for in vivo T cell screens.J Exp Med. 2024 Apr 1;221(4):e20230699. doi: 10.1084/jem.20230699. Epub 2024 Feb 27. J Exp Med. 2024. PMID: 38411617
-
Identification of a gene network driving the attenuated response to lipopolysaccharide of monocytes from hypertensive coronary artery disease patients.Front Immunol. 2024 Feb 12;15:1286382. doi: 10.3389/fimmu.2024.1286382. eCollection 2024. Front Immunol. 2024. PMID: 38410507 Free PMC article.
-
Molecularly stratified hypothalamic astrocytes are cellular foci for obesity.Res Sq [Preprint]. 2024 Feb 9:rs.3.rs-3748581. doi: 10.21203/rs.3.rs-3748581/v1. Res Sq. 2024. PMID: 38405925 Free PMC article. Preprint.
-
Integrating single-cell and bulk expression data to identify and analyze cancer prognosis-related genes.Heliyon. 2024 Feb 10;10(4):e25640. doi: 10.1016/j.heliyon.2024.e25640. eCollection 2024 Feb 29. Heliyon. 2024. PMID: 38379985 Free PMC article.
-
Uncovering hub genes and immunological characteristics for heart failure utilizing RRA, WGCNA and Machine learning.Int J Cardiol Heart Vasc. 2024 Feb 9;51:101335. doi: 10.1016/j.ijcha.2024.101335. eCollection 2024 Apr. Int J Cardiol Heart Vasc. 2024. PMID: 38371312 Free PMC article.
References
-
- Aerts S., et al. Gene prioritization through genomic data fusion. Nat. Biotechnol. 2006;24:537–544. - PubMed
-
- Bie T.D., et al. Kernel-based data fusion for gene prioritization. Bioinformatics. 2007;23:i125–i132. - PubMed
-
- Boulesteix A., Slawski M. Stability and aggregation of ranked gene lists. Brief. Bioinformatics. 2009;10:556. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
