Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun 15;33(12):1867-1869.
doi: 10.1093/bioinformatics/btx057.

cyvcf2: fast, flexible variant analysis with Python

Affiliations

cyvcf2: fast, flexible variant analysis with Python

Brent S Pedersen et al. Bioinformatics. .

Abstract

Motivation: Variant call format (VCF) files document the genetic variation observed after DNA sequencing, alignment and variant calling of a sample cohort. Given the complexity of the VCF format as well as the diverse variant annotations and genotype metadata, there is a need for fast, flexible methods enabling intuitive analysis of the variant data within VCF and BCF files.

Results: We introduce cyvcf2 , a Python library and software package for fast parsing and querying of VCF and BCF files and illustrate its speed, simplicity and utility.

Contact: bpederse@gmail.com or aaronquinlan@gmail.com.

Availability and implementation: cyvcf2 is available from https://github.com/brentp/cyvcf2 under the MIT license and from common python package managers. Detailed documentation is available at http://brentp.github.io/cyvcf2/.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Behnel S. et al. (2011) Cython: the best of both worlds. Comput. Sci. Eng., 13, 31–39.
    1. Consortium G.P. et al. (2015) A global reference for human genetic variation. Nature, 526, 68–74. - PMC - PubMed
    1. Danecek P. et al. (2011) The variant call format and vcftools. Bioinformatics, 27, 2156–2158. - PMC - PubMed
    1. Li H. (2011) A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics, 27, 2987–2993. - PMC - PubMed
    1. Van Der Walt S. et al. (2011) The numpy array: a structure for efficient numerical computation. Comput. Sci. Eng., 13, 22–30.