genipe: an automated genome-wide imputation pipeline with automatic reporting and statistical tools

Bioinformatics. 2016 Dec 1;32(23):3661-3663. doi: 10.1093/bioinformatics/btw487. Epub 2016 Aug 6.


Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies).

Availability and implementation: The genipe package is an open source Python software and is freely available for non-commercial use (CC BY-NC 4.0) at Documentation and tutorials are available at CONTACT: or marie-pierre.dube@statgen.orgSupplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Computational Biology / methods*
  • Electronic Data Processing
  • Genome*
  • Genomics / methods*
  • Genotype
  • Humans
  • Software*