Allele balance bias identifies systematic genotyping errors and false disease associations

Hum Mutat. 2019 Jan;40(1):115-126. doi: 10.1002/humu.23674. Epub 2018 Nov 23.

Abstract

In recent years, next-generation sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although random sequencing errors can be modeled statistically and deep sequencing minimizes their impact, systematic errors remain a problem even at high depth of coverage. Understanding their source is crucial to increase precision of clinical NGS applications. In this work, we studied the relation between recurrent biases in allele balance (AB), systematic errors, and false positive variant calls across a large cohort of human samples analyzed by whole exome sequencing (WES). We have modeled the AB distribution for biallelic genotypes in 987 WES samples in order to identify positions recurrently deviating significantly from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we have developed a genotype callability score based on ABB for all positions of the human exome, which detects false positive variant calls that passed state-of-the-art filters. Finally, we demonstrate the use of ABB for detection of false associations proposed by rare variant association studies. Availability: https://github.com/Francesc-Muyas/ABB.

Keywords: allele balance; false positive variant calls; genetic variant detection; systematic NGS errors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles*
  • Bias
  • Databases, Genetic
  • Disease / genetics*
  • Genetic Association Studies
  • Genome, Human
  • Genotype
  • Genotyping Techniques*
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide / genetics