SSBER: removing batch effect for single-cell RNA sequencing data

BMC Bioinformatics. 2021 May 14;22(1):249. doi: 10.1186/s12859-021-04165-w.

Abstract

Background: With the continuous maturity of sequencing technology, different laboratories or different sequencing platforms have generated a large amount of single-cell transcriptome sequencing data for the same or different tissues. Due to batch effects and high dimensions of scRNA data, downstream analysis often faces challenges. Although a number of algorithms and tools have been proposed for removing batch effects, the current mainstream algorithms have faced the problem of data overcorrection when the cell type composition varies greatly between batches.

Results: In this paper, we propose a novel method named SSBER by utilizing biological prior knowledge to guide the correction, aiming to solve the problem of poor batch-effect correction when the cell type composition differs greatly between batches.

Conclusions: SSBER effectively solves the above problems and outperforms other algorithms when the cell type structure among batches or distribution of cell population varies considerably, or some similar cell types exist across batches.

Keywords: Batch effect; Data integration; Supervised cell type assignment; The shared cell type.

MeSH terms

  • Algorithms*
  • Exome Sequencing
  • RNA* / genetics
  • Sequence Analysis, RNA
  • Single-Cell Analysis
  • Transcriptome

Substances

  • RNA