YHap: a population model for probabilistic assignment of Y haplogroups from re-sequencing data

Fan Zhang; Ruoyan Chen; Dongbing Liu; Xiaotian Yao; Guoqing Li; Yabin Jin; Chang Yu; Yingrui Li; Lachlan J M Coin

doi:10.1186/1471-2105-14-331

YHap: a population model for probabilistic assignment of Y haplogroups from re-sequencing data

BMC Bioinformatics. 2013 Nov 19:14:331. doi: 10.1186/1471-2105-14-331.

Authors

Fan Zhang¹, Ruoyan Chen, Dongbing Liu, Xiaotian Yao, Guoqing Li, Yabin Jin, Chang Yu, Yingrui Li, Lachlan J M Coin

Affiliation

¹ BGI-shenzhen, Shenzhen, China. yuchang@genomics.org.cn.

Abstract

Background: Y haplogroup analyses are an important component of genealogical reconstruction, population genetic analyses, medical genetics and forensics. These fields are increasingly moving towards use of low-coverage, high throughput sequencing. While there have been methods recently proposed for assignment of Y haplogroups on the basis of high-coverage sequence data, assignment on the basis of low-coverage data remains challenging.

Results: We developed a new algorithm, YHap, which uses an imputation framework to jointly predict Y chromosome genotypes and assign Y haplogroups using low coverage population sequence data. We use data from the 1000 genomes project to demonstrate that YHap provides accurate Y haplogroup assignment with less than 2x coverage.

Conclusions: Borrowing information across multiple samples within a population using an imputation framework enables accurate Y haplogroup assignment.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Chromosomes, Human, Y / genetics*
Genetic Variation
Genetics, Population*
Genome, Human
Genotype
Haplotypes / genetics*
Humans
Male
Mutation / genetics
Predictive Value of Tests
Probability
Sequence Analysis, DNA / methods*
Tandem Repeat Sequences / genetics