Inferring single cell expression profiles from overlapped pooling sequencing data with compressed sensing strategy

Nucleic Acids Res. 2021 Aug 20;49(14):7995-8006. doi: 10.1093/nar/gkab581.

Abstract

Though single cell RNA sequencing (scRNA-seq) technologies have been well developed, the acquisition of large-scale single cell expression data may still lead to high costs. Single cell expression profile has its inherent sparse properties, which makes it compressible, thus providing opportunities for solutions. Here, by computational simulation as well as experiment of 54 single cells, we propose that expression profiles can be compressed from the dimension of samples by overlapped assigning each cell into plenty of pools. And we prove that expression profiles can be inferred from these pool expression data with overlapped pooling design and compressed sensing strategy. We also show that by combining this approach with plate-based scRNA-seq measurement, it can maintain its superiorities in gene detection sensitivity and individual identity and recover the expression profile with high precision, while saving about half of the library cost. This method can inspire novel conceptions on the measurement, storage or computation improvements for other compressible signals in many biological areas.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Computer Simulation*
  • Databases, Genetic / statistics & numerical data
  • Gene Expression Profiling / methods*
  • Gene Library
  • Humans
  • Models, Theoretical*
  • Reproducibility of Results
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*