UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study
- PMID: 33973773
- PMCID: PMC8356557
- DOI: 10.1021/acs.jpcb.1c02081
UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study
Abstract
Proteins are the molecular machines of life. The multitude of possible conformations that proteins can adopt determines their free-energy landscapes. However, the inherently high dimensionality of a protein free-energy landscape poses a challenge to deciphering how proteins perform their functions. For this reason, dimensionality reduction is an active field of research for molecular biologists. The uniform manifold approximation and projection (UMAP) is a dimensionality reduction method based on a fuzzy topological analysis of data. In the present study, the performance of UMAP is compared with that of other popular dimensionality reduction methods such as t-distributed stochastic neighbor embedding (t-SNE), principal component analysis (PCA), and time-structure independent components analysis (tICA) in the context of analyzing molecular dynamics simulations of the circadian clock protein VIVID. A good dimensionality reduction method should accurately represent the data structure on the projected components. The comparison of the raw high-dimensional data with the projections obtained using different dimensionality reduction methods based on various metrics showed that UMAP has superior performance when compared with linear reduction methods (PCA and tICA) and has competitive performance and scalable computational cost.
Figures
Similar articles
-
Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.Cell Rep. 2021 Jul 27;36(4):109442. doi: 10.1016/j.celrep.2021.109442. Cell Rep. 2021. PMID: 34320340
-
Evaluation of Distance Metrics and Spatial Autocorrelation in Uniform Manifold Approximation and Projection Applied to Mass Spectrometry Imaging Data.Anal Chem. 2019 May 7;91(9):5706-5714. doi: 10.1021/acs.analchem.8b05827. Epub 2019 Apr 25. Anal Chem. 2019. PMID: 30986042
-
Protein folding intermediates on the dimensionality reduced landscape with UMAP and native contact likelihood.J Chem Phys. 2022 Aug 21;157(7):075101. doi: 10.1063/5.0099094. J Chem Phys. 2022. PMID: 35987583
-
Neural manifold analysis of brain circuit dynamics in health and disease.J Comput Neurosci. 2023 Feb;51(1):1-21. doi: 10.1007/s10827-022-00839-3. Epub 2022 Dec 16. J Comput Neurosci. 2023. PMID: 36522604 Free PMC article. Review.
-
Supervised application of internal validation measures to benchmark dimensionality reduction methods in scRNA-seq data.Brief Bioinform. 2021 Nov 5;22(6):bbab304. doi: 10.1093/bib/bbab304. Brief Bioinform. 2021. PMID: 34374742 Review.
Cited by
-
Hybrid whale algorithm with evolutionary strategies and filtering for high-dimensional optimization: Application to microarray cancer data.PLoS One. 2024 Mar 11;19(3):e0295643. doi: 10.1371/journal.pone.0295643. eCollection 2024. PLoS One. 2024. PMID: 38466740 Free PMC article.
-
Comparative Analysis of Conformational Dynamics and Systematic Characterization of Cryptic Pockets in the SARS-CoV-2 Omicron BA.2, BA.2.75 and XBB.1 Spike Complexes with the ACE2 Host Receptor: Confluence of Binding and Structural Plasticity in Mediating Networks of Conserved Allosteric Sites.Viruses. 2023 Oct 10;15(10):2073. doi: 10.3390/v15102073. Viruses. 2023. PMID: 37896850 Free PMC article.
-
Clustering molecular dynamics conformations of the CC'-loop of the PD-1 immuno-checkpoint receptor.Comput Struct Biotechnol J. 2023 Jul 13;21:3920-3932. doi: 10.1016/j.csbj.2023.07.004. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 37602229 Free PMC article.
-
Functional Protein Dynamics in a Crystal.bioRxiv [Preprint]. 2024 Jan 30:2023.07.06.548023. doi: 10.1101/2023.07.06.548023. bioRxiv. 2024. PMID: 37461732 Free PMC article. Preprint.
-
GPX4 is a key ferroptosis biomarker and correlated with immune cell populations and immune checkpoints in childhood sepsis.Sci Rep. 2023 Jul 13;13(1):11358. doi: 10.1038/s41598-023-32992-9. Sci Rep. 2023. PMID: 37443372 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
