iMAP: integration of multiple single-cell datasets by adversarial paired transfer networks

Genome Biol. 2021 Feb 18;22(1):63. doi: 10.1186/s13059-021-02280-8.

Abstract

The integration of single-cell RNA-sequencing datasets from multiple sources is critical for deciphering cell-to-cell heterogeneities and interactions in complex biological systems. We present a novel unsupervised batch effect removal framework, called iMAP, based on both deep autoencoders and generative adversarial networks. Compared with current methods, iMAP shows superior, robust, and scalable performance in terms of both reliably detecting the batch-specific cells and effectively mixing distributions of the batch-shared cell types. Applying iMAP to tumor microenvironment datasets from two platforms, Smart-seq2 and 10x Genomics, we find that iMAP can leverage the powers of both platforms to discover novel cell-cell interactions.

Keywords: Data integration; Deep learning; GAN; scRNA-seq.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Deep Learning
  • Genomics / methods*
  • Humans
  • Organ Specificity / genetics
  • RNA-Seq* / methods
  • Reproducibility of Results
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis* / methods