The application of Uniform Manifold Approximation and Projection (UMAP) for unconstrained ordination and classification of biological indicators in aquatic ecology
- PMID: 34963591
- DOI: 10.1016/j.scitotenv.2021.152365
The application of Uniform Manifold Approximation and Projection (UMAP) for unconstrained ordination and classification of biological indicators in aquatic ecology
Abstract
The analysis of community structure in studies of freshwater ecology often requires the application of dimensionality reduction to process multivariate data. A high number of dimensions (number of taxa/environmental parameters × number of samples), nonlinear relationships, outliers, and high variability usually hinder the visualization and interpretation of multivariate datasets. Here, we proposed a new statistical design using Uniform Manifold Approximation and Projection (UMAP), and community partitioning using Louvain algorithms, to ordinate and classify the structure of aquatic biota in two-dimensional space. We present this approach with a demonstration of five previously published datasets for diatoms, macrophytes, chironomids (larval and subfossil), and fish. Principal Component Analysis (PCA) and Ward's clustering were also used to assess the comparability of the UMAP approach compared to traditional approaches for ordination and classification. The ordination of sampling sites in 2-dimensional space showed a much denser, and easier to interpret, grouping using the UMAP approach in comparison to PCA. The classification of community structure using the Louvain algorithm in UMAP ordinal space showed a high classification strength for data with a high number of dimensions than the cluster patterns obtained with the use of a Ward's algorithm in PCA. Environmental gradients, presented via heat maps, were overlayed with the ordination patterns of aquatic communities, confirming that the ordinations obtained by UMAP were ecologically meaningful. This is the first study that has applied a UMAP approach with classification using Louvain algorithms on ecological datasets. We show that the performance of local and global structures, as well as the number of clusters determined by the algorithm, make this approach more powerful than traditional approaches.
Keywords: Aquatic ecology; Classification; Community structure; Dimensionality reduction; Multivariate approach; Ordination.
Copyright © 2022 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.Cell Rep. 2021 Jul 27;36(4):109442. doi: 10.1016/j.celrep.2021.109442. Cell Rep. 2021. PMID: 34320340
-
DGCyTOF: Deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data.PLoS Comput Biol. 2022 Apr 11;18(4):e1008885. doi: 10.1371/journal.pcbi.1008885. eCollection 2022 Apr. PLoS Comput Biol. 2022. PMID: 35404970 Free PMC article.
-
Fuzzy Information Discrimination Measures and Their Application to Low Dimensional Embedding Construction in the UMAP Algorithm.J Imaging. 2022 Apr 15;8(4):113. doi: 10.3390/jimaging8040113. J Imaging. 2022. PMID: 35448241 Free PMC article.
-
Neural manifold analysis of brain circuit dynamics in health and disease.J Comput Neurosci. 2023 Feb;51(1):1-21. doi: 10.1007/s10827-022-00839-3. Epub 2022 Dec 16. J Comput Neurosci. 2023. PMID: 36522604 Free PMC article. Review.
-
A review of UMAP in population genetics.J Hum Genet. 2021 Jan;66(1):85-91. doi: 10.1038/s10038-020-00851-4. Epub 2020 Oct 14. J Hum Genet. 2021. PMID: 33057159 Free PMC article. Review.
Cited by
-
m6A/m1A/m5C-Associated Methylation Alterations and Immune Profile in MDD.Mol Neurobiol. 2024 Mar 8. doi: 10.1007/s12035-024-04042-6. Online ahead of print. Mol Neurobiol. 2024. PMID: 38453794
-
Use of Machine Learning for the Identification and Validation of Immunogenic Cell Death Biomarkers and Immunophenotypes in Coronary Artery Disease.J Inflamm Res. 2024 Jan 12;17:223-249. doi: 10.2147/JIR.S439315. eCollection 2024. J Inflamm Res. 2024. PMID: 38229693 Free PMC article.
-
Integrative analyses and validation of ferroptosis-related genes and mechanisms associated with cerebrovascular and cardiovascular ischemic diseases.BMC Genomics. 2023 Dec 4;24(1):731. doi: 10.1186/s12864-023-09829-w. BMC Genomics. 2023. PMID: 38049739 Free PMC article.
-
Similarity-assisted variational autoencoder for nonlinear dimension reduction with application to single-cell RNA sequencing data.BMC Bioinformatics. 2023 Nov 14;24(1):432. doi: 10.1186/s12859-023-05552-1. BMC Bioinformatics. 2023. PMID: 37964243 Free PMC article.
-
A comprehensive analysis of biomarkers associated with synovitis and chondrocyte apoptosis in osteoarthritis.Front Immunol. 2023 Jul 21;14:1149686. doi: 10.3389/fimmu.2023.1149686. eCollection 2023. Front Immunol. 2023. PMID: 37545537 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
