CITEMOXMBD: A flexible single-cell multimodal omics analysis framework to reveal the heterogeneity of immune cells

RNA Biol. 2022 Jan;19(1):290-304. doi: 10.1080/15476286.2022.2027151.

Abstract

Simultaneous measurement of multiple modalities in single-cell analysis, represented by CITE-seq, is a promising approach to link transcriptional changes to cellular phenotype and function, requiring new computational methods to define cellular subtypes and states based on multiple data types. Here, we design a flexible single-cell multimodal analysis framework, called CITEMO, to integrate the transcriptome and antibody-derived tags (ADT) data to capture cell heterogeneity from the multi omics perspective. CITEMO uses Principal Component Analysis (PCA) to obtain a low-dimensional representation of the transcriptome and ADT, respectively, and then employs PCA again to integrate these low-dimensional multimodal data for downstream analysis. To investigate the effectiveness of the CITEMO framework, we apply CITEMO to analyse the cell subtypes of Cord Blood Mononuclear Cells (CBMC) samples. Results show that the CITEMO framework can comprehensively analyse single-cell multimodal samples and accurately identify cell subtypes. Besides, we find some specific immune cells that co-express multiple ADT markers. To better describe the co-expression phenomenon, we introduce the co-expression entropy to measure the heterogeneous distribution of the ADT combinations. To further validate the robustness of the CITEMO framework, we analyse Human Bone Marrow Cell (HBMC) samples and identify different states of the same cell type. CITEMO has an excellent performance in identifying cell subtypes and states for multimodal omics data. We suggest that the flexible design idea of CITEMO can be an inspiration for other single-cell multimodal tasks. The complete source code and dataset of the CITEMO framework can be obtained from https://github.com/studentiz/CITEMO.

Keywords: CITE-seq; data integration; immune system; multi-omics; single-cell.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Lineage / genetics
  • Computational Biology / methods*
  • Gene Expression Regulation
  • Genetic Heterogeneity*
  • Genomics / methods
  • Humans
  • Immune System / cytology*
  • Immune System / immunology
  • Immune System / metabolism*
  • Single-Cell Analysis / methods*
  • Software*

Grants and funding

This work was supported by the National Natural Science Foundation of China [11874310]; National Natural Science Foundation of China [12090052]; Foundation of Education Department of Liaoning Province [LJKZ0280].