Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 5;37(21):3964-3965.
doi: 10.1093/bioinformatics/btab420.

HGC: fast hierarchical clustering for large-scale single-cell data

Affiliations

HGC: fast hierarchical clustering for large-scale single-cell data

Ziheng Zou et al. Bioinformatics. .

Abstract

Summary: Clustering is a key step in revealing heterogeneities in single-cell data. Most existing single-cell clustering methods output a fixed number of clusters without the hierarchical information. Classical hierarchical clustering (HC) provides dendrograms of cells, but cannot scale to large datasets due to high computational complexity. We present HGC, a fast Hierarchical Graph-based Clustering tool to address both problems. It combines the advantages of graph-based clustering and HC. On the shared nearest-neighbor graph of cells, HGC constructs the hierarchical tree with linear time complexity. Experiments showed that HGC enables multiresolution exploration of the biological hierarchy underlying the data, achieves state-of-the-art accuracy on benchmark data and can scale to large datasets.

Availability and implementation: The R package of HGC is available at https://bioconductor.org/packages/HGC/.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Similar articles

Cited by

Publication types

LinkOut - more resources