SCSsim: an integrated tool for simulating single-cell genome sequencing data

Bioinformatics. 2020 Feb 15;36(4):1281-1282. doi: 10.1093/bioinformatics/btz713.

Abstract

Motivation: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency.

Availability and implementation: SCSsim is freely available at https://github.com/qasimyu/scssim.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Genomics*
  • High-Throughput Nucleotide Sequencing
  • Sequence Analysis, DNA
  • Software*
  • Whole Genome Sequencing