SCOTv2: Single-Cell Multiomic Alignment with Disproportionate Cell-Type Representation

J Comput Biol. 2022 Nov;29(11):1213-1228. doi: 10.1089/cmb.2022.0270. Epub 2022 Oct 12.

Abstract

Multiomic single-cell data allow us to perform integrated analysis to understand genomic regulation of biological processes. However, most single-cell sequencing assays are performed on separately sampled cell populations, as applying them to the same single-cell is challenging. Existing unsupervised single-cell alignment algorithms have been primarily benchmarked on coassay experiments. Our investigation revealed that these methods do not perform well for noncoassay single-cell experiments when there is disproportionate cell-type representation across measurement domains. Therefore, we extend our previous work-Single Cell alignment using Optimal Transport (SCOT)-by using unbalanced Gromov-Wasserstein optimal transport to handle disproportionate cell-type representation and differing sample sizes across single-cell measurements. Our method, SCOTv2, gives state-of-the-art alignment performance across five non-coassay data sets (simulated and real world). It can also integrate multiple (M2) single-cell measurements while preserving the self-tuning capabilities and computational tractability of its original version.

Keywords: data integration; manifold alignment; multiomics; single-cell sequencing; unbalanced optimal transport.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Genomics*