FHAST: FPGA-Based Acceleration of Bowtie in Hardware

IEEE/ACM Trans Comput Biol Bioinform. 2015 Sep-Oct;12(5):973-81. doi: 10.1109/TCBB.2015.2405333.

Abstract

While the sequencing capability of modern instruments continues to increase exponentially, the computational problem of mapping short sequenced reads to a reference genome still constitutes a bottleneck in the analysis pipeline. A variety of mapping tools (e.g., Bowtie, BWA) is available for general-purpose computer architectures. These tools can take many hours or even days to deliver mapping results, depending on the number of input reads, the size of the reference genome and the number of allowed mismatches or insertion/deletions, making the mapping problem an ideal candidate for hardware acceleration. In this paper, we present FHAST (FPGA hardware accelerated sequence-matching tool), a drop-in replacement for Bowtie that uses a hardware design based on field programmable gate arrays (FPGA). Our architecture masks memory latency by executing multiple concurrent hardware threads accessing memory simultaneously. FHAST is composed by multiple parallel engines to exploit the parallelism available to us on an FPGA. We have implemented and tested FHAST on the Convey HC-1 and later ported on the Convey HC-2ex, taking advantage of the large memory bandwidth available to these systems and the shared memory image between hardware and software. A preliminary version of FHAST running on the Convey HC-1 achieved up to 70x speedup compared to Bowtie (single-threaded). An improved version of FHAST running on the Convey HC-2ex FPGAs achieved up to 12x fold speed gain compared to Bowtie running eight threads on an eight-core conventional architecture, while maintaining almost identical mapping accuracy. FHAST is a drop-in replacement for Bowtie, so it can be incorporated in any analysis pipeline that uses Bowtie (e.g., TopHat).

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Chromosome Mapping / instrumentation*
  • Chromosome Mapping / methods
  • DNA / genetics*
  • Equipment Design
  • Equipment Failure Analysis
  • High-Throughput Nucleotide Sequencing / instrumentation*
  • High-Throughput Nucleotide Sequencing / methods
  • Sequence Analysis, DNA / instrumentation*
  • Sequence Analysis, DNA / methods
  • Signal Processing, Computer-Assisted / instrumentation*
  • Software*

Substances

  • DNA