RazerS--fast read mapping with sensitivity control

Genome Res. 2009 Sep;19(9):1646-54. doi: 10.1101/gr.088823.108. Epub 2009 Jul 10.

Abstract

Second-generation sequencing technologies deliver DNA sequence data at unprecedented high throughput. Common to most biological applications is a mapping of the reads to an almost identical or highly similar reference genome. Due to the large amounts of data, efficient algorithms and implementations are crucial for this task. We present an efficient read mapping tool called RazerS. It allows the user to align sequencing reads of arbitrary length using either the Hamming distance or the edit distance. Our tool can work either lossless or with a user-defined loss rate at higher speeds. Given the loss rate, we present an approach that guarantees not to lose more reads than specified. This enables the user to adapt to the problem at hand and provides a seamless tradeoff between sensitivity and running time.

MeSH terms

  • Algorithms
  • Animals
  • Chromosome Mapping / methods*
  • Drosophila melanogaster / genetics
  • Genome, Human / genetics
  • Genome, Insect / genetics
  • Humans
  • Sensitivity and Specificity
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*
  • Software*
  • Time Factors
  • User-Computer Interface