AsaruSim: a single-cell and spatial RNA-Seq Nanopore long-reads simulation workflow

Bioinformatics. 2025 Mar 4;41(3):btaf087. doi: 10.1093/bioinformatics/btaf087.

Abstract

Motivation: The combination of long-read sequencing technologies like Oxford Nanopore with single-cell RNA sequencing (scRNAseq) assays enables the detailed exploration of transcriptomic complexity, including isoform detection and quantification, by capturing full-length cDNAs. However, challenges remain, including the lack of advanced simulation tools that can effectively mimic the unique complexities of scRNAseq long-read datasets. Such tools are essential for the evaluation and optimization of isoform detection methods dedicated to single-cell long-read studies.

Results: We developed AsaruSim, a workflow that simulates synthetic single-cell long-read Nanopore datasets, closely mimicking real experimental data. AsaruSim employs a multi-step process that includes the creation of a synthetic count matrix, generation of perfect reads, optional PCR amplification, introduction of sequencing errors, and comprehensive quality control reporting. Applied to a dataset of human peripheral blood mononuclear cells, AsaruSim accurately reproduced experimental read characteristics.

Availability and implementation: The source code and full documentation are available at https://github.com/GenomiqueENS/AsaruSim.

MeSH terms

  • Humans
  • Leukocytes, Mononuclear / metabolism
  • Nanopore Sequencing* / methods
  • Nanopores*
  • RNA-Seq* / methods
  • Sequence Analysis, RNA* / methods
  • Single-Cell Analysis* / methods
  • Software*
  • Workflow