dendsort: modular leaf ordering methods for dendrogram representations in R
- PMID: 25232468
- PMCID: PMC4162509
- DOI: 10.12688/f1000research.4784.1
dendsort: modular leaf ordering methods for dendrogram representations in R
Abstract
Dendrograms are graphical representations of binary tree structures resulting from agglomerative hierarchical clustering. In Life Science, a cluster heat map is a widely accepted visualization technique that utilizes the leaf order of a dendrogram to reorder the rows and columns of the data table. The derived linear order is more meaningful than a random order, because it groups similar items together. However, two consecutive items can be quite dissimilar despite proximity in the order. In addition, there are 2 (n-1) possible orderings given n input elements as the orientation of clusters at each merge can be flipped without affecting the hierarchical structure. We present two modular leaf ordering methods to encode both the monotonic order in which clusters are merged and the nested cluster relationships more faithfully in the resulting dendrogram structure. We compare dendrogram and cluster heat map visualizations created using our heuristics to the default heuristic in R and seriation-based leaf ordering methods. We find that our methods lead to a dendrogram structure with global patterns that are easier to interpret, more legible given a limited display space, and more insightful for some cases. The implementation of methods is available as an R package, named "dendsort", from the CRAN package repository. Further examples, documentations, and the source code are available at [https://bitbucket.org/biovizleuven/dendsort/].
Conflict of interest statement
Figures
Similar articles
-
MCLEAN: Multilevel Clustering Exploration As Network.PeerJ Comput Sci. 2018 Jan 29;4:e145. doi: 10.7717/peerj-cs.145. eCollection 2018. PeerJ Comput Sci. 2018. PMID: 33816801 Free PMC article.
-
DendroX: multi-level multi-cluster selection in dendrograms.BMC Genomics. 2024 Feb 2;25(1):134. doi: 10.1186/s12864-024-10048-0. BMC Genomics. 2024. PMID: 38308243 Free PMC article.
-
How frequently do clusters occur in hierarchical clustering analysis? A graph theoretical approach to studying ties in proximity.J Cheminform. 2016 Jan 25;8:4. doi: 10.1186/s13321-016-0114-x. eCollection 2016. J Cheminform. 2016. PMID: 26816532 Free PMC article.
-
InCHlib - interactive cluster heatmap for web applications.J Cheminform. 2014 Sep 17;6(1):44. doi: 10.1186/s13321-014-0044-4. eCollection 2014 Dec. J Cheminform. 2014. PMID: 25264459 Free PMC article.
-
HCsnip: An R Package for Semi-supervised Snipping of the Hierarchical Clustering Tree.Cancer Inform. 2015 Mar 22;14:1-19. doi: 10.4137/CIN.S22080. eCollection 2015. Cancer Inform. 2015. PMID: 25861213 Free PMC article. Review.
Cited by
-
Organization of an ascending circuit that conveys flight motor state in Drosophila.Curr Biol. 2024 Mar 11;34(5):1059-1075.e5. doi: 10.1016/j.cub.2024.01.071. Epub 2024 Feb 22. Curr Biol. 2024. PMID: 38402616
-
Snowflake: visualizing microbiome abundance tables as multivariate bipartite graphs.Front Bioinform. 2024 Feb 5;4:1331043. doi: 10.3389/fbinf.2024.1331043. eCollection 2024. Front Bioinform. 2024. PMID: 38375239 Free PMC article.
-
Coordinated immune dysregulation in Juvenile Dermatomyositis revealed by single-cell genomics.bioRxiv [Preprint]. 2023 Nov 10:2023.11.07.566033. doi: 10.1101/2023.11.07.566033. bioRxiv. 2023. PMID: 37986917 Free PMC article. Preprint.
-
Proteomic analysis of peripheral nerve myelin during murine aging.Front Cell Neurosci. 2023 Oct 30;17:1214003. doi: 10.3389/fncel.2023.1214003. eCollection 2023. Front Cell Neurosci. 2023. PMID: 37964793 Free PMC article.
-
Diagnostic and commensal Staphylococcus pseudintermedius genomes reveal niche adaptation through parallel selection of defense mechanisms.Nat Commun. 2023 Nov 3;14(1):7065. doi: 10.1038/s41467-023-42694-5. Nat Commun. 2023. PMID: 37923729 Free PMC article.
References
-
- Wilkinson L, Friendly M: The History of the Cluster Heat Map. Am Stat. 2009;63(2):179–184 10.1198/tas.2009.0033 - DOI
-
- Hastie T, Tibshirani R, Friedman J: The Elements of Statistical Learning. Springer Series Statistics. 2009. 10.1007/978-0-387-84858-7 - DOI
-
- Tan P, Kumar V, Steinbach M: Introduction to data mining. Boston: Pearson Addison Wesley, 1st ed edition.2005. Reference Source
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
