GraphAligner: rapid and versatile sequence-to-graph alignment

Genome Biol. 2020 Sep 24;21(1):253. doi: 10.1186/s13059-020-02157-2.

Abstract

Genome graphs can represent genetic variation and sequence uncertainty. Aligning sequences to genome graphs is key to many applications, including error correction, genome assembly, and genotyping of variants in a pangenome graph. Yet, so far, this step is often prohibitively slow. We present GraphAligner, a tool for aligning long reads to genome graphs. Compared to the state-of-the-art tools, GraphAligner is 13x faster and uses 3x less memory. When employing GraphAligner for error correction, we find it to be more than twice as accurate and over 12x faster than extant tools.Availability: Package manager: https://anaconda.org/bioconda/graphaligner and source code: https://github.com/maickrau/GraphAligner.

Keywords: Error correction; Genome graphs; Long reads; Pangenome; Sequence alignment.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Genomics / methods*
  • Humans
  • Sequence Alignment / methods*
  • Software*