Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data
- PMID: 34320340
- DOI: 10.1016/j.celrep.2021.109442
Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data
Abstract
Transcriptomic analysis plays a key role in biomedical research. Linear dimensionality reduction methods, especially principal-component analysis (PCA), are widely used in detecting sample-to-sample heterogeneity, while recently developed non-linear methods, such as t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP), can efficiently cluster heterogeneous samples in single-cell RNA sequencing analysis. Yet, the application of t-SNE and UMAP in bulk transcriptomic analysis and comparison with conventional methods have not been achieved. We compare four major dimensionality reduction methods (PCA, multidimensional scaling [MDS], t-SNE, and UMAP) in analyzing 71 large bulk transcriptomic datasets. UMAP is superior to PCA and MDS but shows some advantages over t-SNE in differentiating batch effects, identifying pre-defined biological groups, and revealing in-depth clusters in two-dimensional space. Importantly, UMAP generates sample clusters uncovering biological features and clinical meaning. We recommend deploying UMAP in visualizing and analyzing sizable bulk transcriptomic datasets to reinforce sample heterogeneity analysis.
Keywords: PCA; UMAP; bulk transcriptomics; clustering structure; dimensionality reduction; heterogeneity analysis; t-SNE.
Copyright © 2021 The Author(s). Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Declaration of interests The authors declare no competing interests.
Similar articles
-
A cross entropy test allows quantitative statistical comparison of t-SNE and UMAP representations.Cell Rep Methods. 2023 Jan 13;3(1):100390. doi: 10.1016/j.crmeth.2022.100390. eCollection 2023 Jan 23. Cell Rep Methods. 2023. PMID: 36814837 Free PMC article.
-
Evaluation of Distance Metrics and Spatial Autocorrelation in Uniform Manifold Approximation and Projection Applied to Mass Spectrometry Imaging Data.Anal Chem. 2019 May 7;91(9):5706-5714. doi: 10.1021/acs.analchem.8b05827. Epub 2019 Apr 25. Anal Chem. 2019. PMID: 30986042
-
Capturing discrete latent structures: choose LDs over PCs.Biostatistics. 2022 Dec 12;24(1):1-16. doi: 10.1093/biostatistics/kxab030. Biostatistics. 2022. PMID: 34467372 Free PMC article.
-
Computational solutions for spatial transcriptomics.Comput Struct Biotechnol J. 2022 Sep 1;20:4870-4884. doi: 10.1016/j.csbj.2022.08.043. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 36147664 Free PMC article. Review.
-
Neural manifold analysis of brain circuit dynamics in health and disease.J Comput Neurosci. 2023 Feb;51(1):1-21. doi: 10.1007/s10827-022-00839-3. Epub 2022 Dec 16. J Comput Neurosci. 2023. PMID: 36522604 Free PMC article. Review.
Cited by
-
Dietary patterns associated with the incidence of hypertension among adult Japanese males: application of machine learning to a cohort study.Eur J Nutr. 2024 Feb 25. doi: 10.1007/s00394-024-03342-w. Online ahead of print. Eur J Nutr. 2024. PMID: 38403812
-
A supervised learning method for classifying methylation disorders.BMC Bioinformatics. 2024 Feb 12;25(1):66. doi: 10.1186/s12859-024-05673-1. BMC Bioinformatics. 2024. PMID: 38347515 Free PMC article.
-
Computational inference of eIF4F complex function and structure in human cancers.Proc Natl Acad Sci U S A. 2024 Jan 30;121(5):e2313589121. doi: 10.1073/pnas.2313589121. Epub 2024 Jan 24. Proc Natl Acad Sci U S A. 2024. PMID: 38266053 Free PMC article.
-
Identification and Preliminary Clinical Validation of Key Extracellular Proteins as the Potential Biomarkers in Hashimoto's Thyroiditis by Comprehensive Analysis.Biomedicines. 2023 Nov 24;11(12):3127. doi: 10.3390/biomedicines11123127. Biomedicines. 2023. PMID: 38137348 Free PMC article.
-
Depression, anxiety, and burnout in academia: topic modeling of PubMed abstracts.Front Res Metr Anal. 2023 Nov 27;8:1271385. doi: 10.3389/frma.2023.1271385. eCollection 2023. Front Res Metr Anal. 2023. PMID: 38090103 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous
