scRMD: imputation for single cell RNA-seq data via robust matrix decomposition

Bioinformatics. 2020 May 1;36(10):3156-3161. doi: 10.1093/bioinformatics/btaa139.

Abstract

Motivation: Single cell RNA-sequencing (scRNA-seq) technology enables whole transcriptome profiling at single cell resolution and holds great promises in many biological and medical applications. Nevertheless, scRNA-seq often fails to capture expressed genes, leading to the prominent dropout problem. These dropouts cause many problems in down-stream analysis, such as significant increase of noises, power loss in differential expression analysis and obscuring of gene-to-gene or cell-to-cell relationship. Imputation of these dropout values can be beneficial in scRNA-seq data analysis.

Results: In this article, we model the dropout imputation problem as robust matrix decomposition. This model has minimal assumptions and allows us to develop a computational efficient imputation method called scRMD. Extensive data analysis shows that scRMD can accurately recover the dropout values and help to improve downstream analysis such as differential expression analysis and clustering analysis.

Availability and implementation: The R package scRMD is available at https://github.com/XiDsLab/scRMD.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Exome Sequencing
  • Gene Expression Profiling
  • RNA-Seq*
  • Sequence Analysis, RNA
  • Single-Cell Analysis
  • Software*