A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data

Am J Hum Genet. 2000 Apr;66(4):1287-97. doi: 10.1086/302861. Epub 2000 Mar 28.

Abstract

The identification of genes contributing to complex diseases and quantitative traits requires genetic data of high fidelity, because undetected errors and mutations can profoundly affect linkage information. The recent emphasis on the use of the sibling-pair design eliminates or decreases the likelihood of detection of genotyping errors and marker mutations through apparent Mendelian incompatibilities or close double recombinants. In this article, we describe a hidden Markov method for detecting genotyping errors and mutations in multilocus linkage data. Specifically, we calculate the posterior probability of genotyping error or mutation for each sibling-pair-marker combination, conditional on all marker data and an assumed genotype-error rate. The method is designed for use with sibling-pair data when parental genotypes are unavailable. Through Monte Carlo simulation, we explore the effects of map density, marker-allele frequencies, marker position, and genotype-error rate on the accuracy of our error-detection method. In addition, we examine the impact of genotyping errors and error detection and correction on multipoint linkage information. We illustrate that even moderate error rates can result in substantial loss of linkage information, given efforts to fine-map a putative disease locus. Although simulations suggest that our method detects </=50% of genotyping errors, it generally flags those errors that have the largest impact on linkage results. For high-resolution genetic maps, removal of the errors identified by our method restores most or nearly all the lost linkage information and can be accomplished without generating false evidence for linkage by removing incorrectly identified errors.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alleles
  • Chromosome Mapping / methods*
  • Chromosome Mapping / statistics & numerical data
  • Computer Simulation
  • Gene Frequency / genetics
  • Genetic Diseases, Inborn / genetics*
  • Genetic Linkage / genetics*
  • Genetic Markers / genetics
  • Genotype
  • Haplotypes / genetics
  • Humans
  • Lod Score
  • Markov Chains
  • Matched-Pair Analysis
  • Models, Genetic
  • Mutation / genetics*
  • Nuclear Family*
  • Research Design*
  • Sensitivity and Specificity

Substances

  • Genetic Markers