hapbin: An Efficient Program for Performing Haplotype-Based Scans for Positive Selection in Large Genomic Datasets

Mol Biol Evol. 2015 Nov;32(11):3027-9. doi: 10.1093/molbev/msv172. Epub 2015 Aug 6.

Abstract

Understanding how the genome is shaped by selective processes forms an integral part of modern biology. However, as genomic datasets continue to grow larger it is becoming increasingly difficult to apply traditional statistics for detecting signatures of selection to these cohorts. There is therefore a pressing need for the development of the next generation of computational and analytical tools for detecting signatures of selection in large genomic datasets. Here, we present hapbin, an efficient multithreaded implementation of extended haplotype homzygosity-based statistics for detecting selection, which is up to 3,400 times faster than the current fastest implementations of these algorithms.

Keywords: EHH; XP-EHH; iHS; selection; software.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Databases, Genetic
  • Genetics, Population / methods
  • Genome
  • Genomics / methods*
  • Haplotypes
  • Humans
  • Models, Genetic*
  • Models, Statistical
  • Polymorphism, Single Nucleotide
  • Selection, Genetic
  • Software*