Machine learning for multi-omics data integration in cancer
- PMID: 35169688
- PMCID: PMC8829812
- DOI: 10.1016/j.isci.2022.103798
Machine learning for multi-omics data integration in cancer
Abstract
Multi-omics data analysis is an important aspect of cancer molecular biology studies and has led to ground-breaking discoveries. Many efforts have been made to develop machine learning methods that automatically integrate omics data. Here, we review machine learning tools categorized as either general-purpose or task-specific, covering both supervised and unsupervised learning for integrative analysis of multi-omics data. We benchmark the performance of five machine learning approaches using data from the Cancer Cell Line Encyclopedia, reporting accuracy on cancer type classification and mean absolute error on drug response prediction, and evaluating runtime efficiency. This review provides recommendations to researchers regarding suitable machine learning method selection for their specific applications. It should also promote the development of novel machine learning methodologies for data integration, which will be essential for drug discovery, clinical trial design, and personalized treatments.
Keywords: machine learning; omics; systems biology.
© 2022 The Author(s).
Conflict of interest statement
JL has received grant funding from 10.13039/100004325AstraZeneca for research unrelated to the current work.
Figures
Similar articles
-
Using machine learning approaches for multi-omics data analysis: A review.Biotechnol Adv. 2021 Jul-Aug;49:107739. doi: 10.1016/j.biotechadv.2021.107739. Epub 2021 Mar 29. Biotechnol Adv. 2021. PMID: 33794304 Review.
-
Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review.BMJ Open. 2021 Dec 6;11(12):e053674. doi: 10.1136/bmjopen-2021-053674. BMJ Open. 2021. PMID: 34873011 Free PMC article. Review.
-
A Machine Learning-Based Approach Using Multi-omics Data to Predict Metabolic Pathways.Methods Mol Biol. 2023;2553:441-452. doi: 10.1007/978-1-0716-2617-7_19. Methods Mol Biol. 2023. PMID: 36227554
-
Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data.BMC Bioinformatics. 2021 May 25;22(1):269. doi: 10.1186/s12859-021-04146-z. BMC Bioinformatics. 2021. PMID: 34034645 Free PMC article.
-
Deep learning assisted multi-omics integration for survival and drug-response prediction in breast cancer.BMC Genomics. 2021 Mar 24;22(1):214. doi: 10.1186/s12864-021-07524-2. BMC Genomics. 2021. PMID: 33761889 Free PMC article.
Cited by
-
Classifying breast cancer using multi-view graph neural network based on multi-omics data.Front Genet. 2024 Feb 20;15:1363896. doi: 10.3389/fgene.2024.1363896. eCollection 2024. Front Genet. 2024. PMID: 38444760 Free PMC article.
-
Performance analysis of data resampling on class imbalance and classification techniques on multi-omics data for cancer classification.PLoS One. 2024 Feb 29;19(2):e0293607. doi: 10.1371/journal.pone.0293607. eCollection 2024. PLoS One. 2024. PMID: 38422094 Free PMC article.
-
Machine Learning Methods for Gene Selection in Uveal Melanoma.Int J Mol Sci. 2024 Feb 1;25(3):1796. doi: 10.3390/ijms25031796. Int J Mol Sci. 2024. PMID: 38339073 Free PMC article.
-
Machine learning and multi-omics data in chronic lymphocytic leukemia: the future of precision medicine?Front Genet. 2024 Jan 12;14:1304661. doi: 10.3389/fgene.2023.1304661. eCollection 2023. Front Genet. 2024. PMID: 38283149 Free PMC article. Review.
-
Characterization of prevalent tyrosine kinase inhibitors and their challenges in glioblastoma treatment.Front Chem. 2024 Jan 8;11:1325214. doi: 10.3389/fchem.2023.1325214. eCollection 2023. Front Chem. 2024. PMID: 38264122 Free PMC article. Review.
References
-
- Aizerman M.A. Theoretical foundations of the potential function method in pattern recognition learning. Autom. Remote Control. 1964;25:821–837.
-
- Alcala N., Leblay N., Gabriel A.A.G., Mangiante L., Hervas D., Giffon T., Sertier A.S., Ferrari A., Derks J., Ghantous A., et al. Integrative and comparative genomic analyses identify clinically relevant pulmonary carcinoid groups and unveil the supra-carcinoids. Nat. Commun. 2019;10:3407. - PMC - PubMed
-
- Andersson R., Sandelin A. Determinants of enhancer and promoter activities of regulatory elements. Nat. Rev. Genet. 2020;21:71–87. - PubMed
-
- Andrew G., Arora R., Bilmes J., Livescu K. Deep canonical correlation analysis. Proc. 30th Int. Conf. Machine Learn. 2013;28:1247–1255.
Publication types
LinkOut - more resources
Full Text Sources
