scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization

Brief Bioinform. 2024 Mar 27;25(3):bbae228. doi: 10.1093/bib/bbae228.

Abstract

Motivation: The technology for analyzing single-cell multi-omics data has advanced rapidly and has provided comprehensive and accurate cellular information by exploring cell heterogeneity in genomics, transcriptomics, epigenomics, metabolomics and proteomics data. However, because of the high-dimensional and sparse characteristics of single-cell multi-omics data, as well as the limitations of various analysis algorithms, the clustering performance is generally poor. Matrix factorization is an unsupervised, dimensionality reduction-based method that can cluster individuals and discover related omics variables from different blocks. Here, we present a novel algorithm that performs joint dimensionality reduction learning and cell clustering analysis on single-cell multi-omics data using non-negative matrix factorization that we named scMNMF. We formulate the objective function of joint learning as a constrained optimization problem and derive the corresponding iterative formulas through alternating iterative algorithms. The major advantage of the scMNMF algorithm remains its capability to explore hidden related features among omics data. Additionally, the feature selection for dimensionality reduction and cell clustering mutually influence each other iteratively, leading to a more effective discovery of cell types. We validated the performance of the scMNMF algorithm using two simulated and five real datasets. The results show that scMNMF outperformed seven other state-of-the-art algorithms in various measurements.

Availability and implementation: scMNMF code can be found at https://github.com/yushanqiu/scMNMF.

Keywords: joint learning; non-negative matrix factorization; single-cell multi-omics.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Computational Biology / methods
  • Epigenomics / methods
  • Genomics / methods
  • Humans
  • Metabolomics / methods
  • Multiomics
  • Proteomics / methods
  • Single-Cell Analysis* / methods