SMNN: batch effect correction for single-cell RNA-seq data via supervised mutual nearest neighbor detection

Brief Bioinform. 2021 May 20;22(3):bbaa097. doi: 10.1093/bib/bbaa097.

Abstract

Batch effect correction has been recognized to be indispensable when integrating single-cell RNA sequencing (scRNA-seq) data from multiple batches. State-of-the-art methods ignore single-cell cluster label information, but such information can improve the effectiveness of batch effect correction, particularly under realistic scenarios where biological differences are not orthogonal to batch effects. To address this issue, we propose SMNN for batch effect correction of scRNA-seq data via supervised mutual nearest neighbor detection. Our extensive evaluations in simulated and real datasets show that SMNN provides improved merging within the corresponding cell types across batches, leading to reduced differentiation across batches over MNN, Seurat v3 and LIGER. Furthermore, SMNN retains more cell-type-specific features, partially manifested by differentially expressed genes identified between cell types after SMNN correction being biologically more relevant, with precision improving by up to 841.0%.

Keywords: batch effect; single-cell RNA sequencing; supervised mutual nearest neighbor.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Databases, Nucleic Acid*
  • Humans
  • RNA-Seq*
  • Single-Cell Analysis*