Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer
- PMID: 33402734
- PMCID: PMC7785750
- DOI: 10.1038/s41467-020-20430-7
Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer
Abstract
High-dimensional multi-omics data are now standard in biology. They can greatly enhance our understanding of biological systems when effectively integrated. To achieve proper integration, joint Dimensionality Reduction (jDR) methods are among the most efficient approaches. However, several jDR methods are available, urging the need for a comprehensive benchmark with practical guidelines. We perform a systematic evaluation of nine representative jDR methods using three complementary benchmarks. First, we evaluate their performances in retrieving ground-truth sample clustering from simulated multi-omics datasets. Second, we use TCGA cancer data to assess their strengths in predicting survival, clinical annotations and known pathways/biological processes. Finally, we assess their classification of multi-omics single-cell data. From these in-depth comparisons, we observe that intNMF performs best in clustering, while MCIA offers an effective behavior across many contexts. The code developed for this benchmark study is implemented in a Jupyter notebook-multi-omics mix (momix)-to foster reproducibility, and support users and future developers.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
A benchmark study of deep learning-based multi-omics data fusion methods for cancer.Genome Biol. 2022 Aug 9;23(1):171. doi: 10.1186/s13059-022-02739-2. Genome Biol. 2022. PMID: 35945544 Free PMC article.
-
Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration.Brief Bioinform. 2020 Dec 1;21(6):2011-2030. doi: 10.1093/bib/bbz138. Brief Bioinform. 2020. PMID: 31792509
-
Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.BMC Genomics. 2015 Dec 1;16:1022. doi: 10.1186/s12864-015-2223-8. BMC Genomics. 2015. PMID: 26626453 Free PMC article.
-
Multi-omic and multi-view clustering algorithms: review and cancer benchmark.Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889. Nucleic Acids Res. 2018. PMID: 30295871 Free PMC article. Review.
-
Integrative Multi-Omics Approaches in Cancer Research: From Biological Networks to Clinical Subtypes.Mol Cells. 2021 Jul 31;44(7):433-443. doi: 10.14348/molcells.2021.0042. Mol Cells. 2021. PMID: 34238766 Free PMC article. Review.
Cited by
-
Web-based multi-omics integration using the Analyst software suite.Nat Protoc. 2024 Feb 14. doi: 10.1038/s41596-023-00950-4. Online ahead of print. Nat Protoc. 2024. PMID: 38355833 Review.
-
Toward the novel AI tasks in infection biology.mSphere. 2024 Feb 28;9(2):e0059123. doi: 10.1128/msphere.00591-23. Epub 2024 Feb 9. mSphere. 2024. PMID: 38334404 Free PMC article. Review.
-
Joint clinical and molecular subtyping of COPD with variational autoencoders.medRxiv [Preprint]. 2024 Jan 10:2023.08.19.23294298. doi: 10.1101/2023.08.19.23294298. medRxiv. 2024. PMID: 38260473 Free PMC article. Preprint.
-
Genomic data integration tutorial, a plant case study.BMC Genomics. 2024 Jan 17;25(1):66. doi: 10.1186/s12864-023-09833-0. BMC Genomics. 2024. PMID: 38233804 Free PMC article.
-
Challenges and best practices in omics benchmarking.Nat Rev Genet. 2024 Jan 12. doi: 10.1038/s41576-023-00679-6. Online ahead of print. Nat Rev Genet. 2024. PMID: 38216661 Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
