ViralConsensus: a fast and memory-efficient tool for calling viral consensus genome sequences directly from read alignment data

Bioinformatics. 2023 May 4;39(5):btad317. doi: 10.1093/bioinformatics/btad317.

Abstract

Motivation: In viral molecular epidemiology, reconstruction of consensus genomes from sequence data is critical for tracking mutations and variants of concern. However, as the number of samples that are sequenced grows rapidly, compute resources needed to reconstruct consensus genomes can become prohibitively large.

Results: ViralConsensus is a fast and memory-efficient tool for calling viral consensus genome sequences directly from read alignment data. ViralConsensus is orders of magnitude faster and more memory-efficient than existing methods. Further, unlike existing methods, ViralConsensus can pipe data directly from a read mapper via standard input and performs viral consensus calling on-the-fly, making it an ideal tool for viral sequencing pipelines.

Availability and implementation: ViralConsensus is freely available at https://github.com/niemasd/ViralConsensus as an open-source software project.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Consensus
  • Genome, Viral
  • High-Throughput Nucleotide Sequencing*
  • Sequence Analysis, DNA / methods
  • Software*