Sensitive and fast mapping of di-base encoded reads

Bioinformatics. 2011 Jul 15;27(14):1915-21. doi: 10.1093/bioinformatics/btr303. Epub 2011 May 17.

Abstract

Motivation: Discovering variation among high-throughput sequenced genomes relies on efficient and effective mapping of sequence reads. The speed, sensitivity and accuracy of read mapping are crucial to determining the full spectrum of single nucleotide variants (SNVs) as well as structural variants (SVs) in the donor genomes analyzed.

Results: We present drFAST, a read mapper designed for di-base encoded 'color-space' sequences generated with the AB SOLiD platform. drFAST is specially designed for better delineation of structural variants, including segmental duplications, and is able to return all possible map locations and underlying sequence variation of short reads within a user-specified distance threshold. We show that drFAST is more sensitive in comparison to all commonly used aligners such as Bowtie, BFAST and SHRiMP. drFAST is also faster than both BFAST and SHRiMP and achieves a mapping speed comparable to Bowtie.

Availability: The source code for drFAST is available at http://drfast.sourceforge.net

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Sequence
  • Chromosome Mapping
  • Chromosomes, Human, Pair 1
  • Genetic Variation
  • Genome
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Nucleotides
  • Polymorphism, Single Nucleotide
  • Software

Substances

  • Nucleotides