Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data

Brief Bioinform. 2021 Jul 20;22(4):bbaa287. doi: 10.1093/bib/bbaa287.

Abstract

Simultaneous profiling transcriptomic and chromatin accessibility information in the same individual cells offers an unprecedented resolution to understand cell states. However, computationally effective methods for the integration of these inherent sparse and heterogeneous data are lacking. Here, we present a single-cell multimodal variational autoencoder model, which combines three types of joint-learning strategies with a probabilistic Gaussian Mixture Model to learn the joint latent features that accurately represent these multilayer profiles. Studies on both simulated datasets and real datasets demonstrate that it has more preferable capability (i) dissecting cellular heterogeneity in the joint-learning space, (ii) denoising and imputing data and (iii) constructing the association between multilayer omics data, which can be used for understanding transcriptional regulatory mechanisms.

Keywords: data integration; deep joint-learning model; multimodal variational autoencoder; single-cell multiple omics data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromatin / genetics
  • Chromatin / metabolism*
  • Databases, Factual*
  • Deep Learning*
  • Humans
  • K562 Cells
  • Models, Biological*
  • Single-Cell Analysis*
  • Transcriptome*

Substances

  • Chromatin