Over-representation of correlation analysis (ORCA): a method for identifying associations between variable sets
- PMID: 25183485
- DOI: 10.1093/bioinformatics/btu589
Over-representation of correlation analysis (ORCA): a method for identifying associations between variable sets
Abstract
Motivation: Often during the analysis of biological data, it is of importance to interpret the correlation structure that exists between variables. Such correlations may reveal patterns of co-regulation that are indicative of biochemical pathways or common mechanisms of response to a related set of treatments. However, analyses of correlations are usually conducted by either subjective interpretation of the univariate covariance matrix or by applying multivariate modeling techniques, which do not take prior biological knowledge into account. Over-representation analysis (ORA) is a simple method for objectively deciding whether a set of variables of known or suspected biological relevance, such as a gene set or pathway, is more prevalent in a set of variables of interest than we expect by chance. However, ORA is usually applied to a set of variables differentiating a single experimental variable and does not take into account correlations.
Results: Over-representation of correlation analysis (ORCA) is a novel combination of ORA and correlation analysis that provides a means to test whether more associations exist between two specific groups of variables than expected by chance. The method is exemplified by application to drug sensitivity and microRNA expression data from a panel of cancer cell lines (NCI60). ORCA highlighted a previously reported correlation between sensitivity to alkylating anticancer agents and topoisomerase inhibitors. We also used this approach to validate microRNA clusters predicted by mRNA correlations. These observations suggest that ORCA has the potential to reveal novel insights from these data, which are not readily apparent using classical ORA.
Availability and implementation: The R code of the method is available at https://github.com/ORCABioinfo/ORCAcode.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
-
A global × global test for testing associations between two large sets of variables.Biom J. 2017 Jan;59(1):145-158. doi: 10.1002/bimj.201500106. Epub 2016 May 25. Biom J. 2017. PMID: 27225065
-
Identifying cancer-related microRNAs based on gene expression data.Bioinformatics. 2015 Apr 15;31(8):1226-34. doi: 10.1093/bioinformatics/btu811. Epub 2014 Dec 12. Bioinformatics. 2015. PMID: 25505085
-
Feature selection and classifier performance on diverse bio- logical datasets.BMC Bioinformatics. 2014;15 Suppl 13(Suppl 13):S4. doi: 10.1186/1471-2105-15-S13-S4. Epub 2014 Nov 13. BMC Bioinformatics. 2014. PMID: 25434802 Free PMC article.
-
Computational methods for analysis of cellular functions and pathways collectively targeted by differentially expressed microRNA.Methods. 2008 Jan;44(1):61-72. doi: 10.1016/j.ymeth.2007.10.005. Methods. 2008. PMID: 18158134 Review.
Cited by
-
A fairness assessment of mobility-based COVID-19 case prediction models.PLoS One. 2023 Oct 18;18(10):e0292090. doi: 10.1371/journal.pone.0292090. eCollection 2023. PLoS One. 2023. PMID: 37851681 Free PMC article.
-
Atherosclerotic plaque vulnerability quantification system for clinical and biological interpretability.iScience. 2023 Aug 9;26(9):107587. doi: 10.1016/j.isci.2023.107587. eCollection 2023 Sep 15. iScience. 2023. PMID: 37664595 Free PMC article.
-
Smooth muscle cell fate decisions decipher a high-resolution heterogeneity within atherosclerosis molecular subtypes.J Transl Med. 2022 Dec 6;20(1):568. doi: 10.1186/s12967-022-03795-9. J Transl Med. 2022. PMID: 36474294 Free PMC article.
-
Identification of Key Immune-Related Genes in the Progression of Septic Shock.Front Genet. 2021 Nov 3;12:668527. doi: 10.3389/fgene.2021.668527. eCollection 2021. Front Genet. 2021. PMID: 34804111 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
