Allele frequency-free inference of close familial relationships from genotypes or low-depth sequencing data

Mol Ecol. 2019 Jan;28(1):35-48. doi: 10.1111/mec.14954.

Abstract

Knowledge of how individuals are related is important in many areas of research, and numerous methods for inferring pairwise relatedness from genetic data have been developed. However, the majority of these methods were not developed for situations where data are limited. Specifically, most methods rely on the availability of population allele frequencies, the relative genomic position of variants and accurate genotype data. But in studies of non-model organisms or ancient samples, such data are not always available. Motivated by this, we present a new method for pairwise relatedness inference, which requires neither allele frequency information nor information on genomic position. Furthermore, it can be applied not only to accurate genotype data but also to low-depth sequencing data from which genotypes cannot be accurately called. We evaluate it using data from a range of human populations and show that it can be used to infer close familial relationships with a similar accuracy as a widely used method that relies on population allele frequencies. Additionally, we show that our method is robust to SNP ascertainment and applicable to low-depth sequencing data generated using different strategies, including resequencing and RADseq, which is important for application to a diverse range of populations and species.

Keywords: IBD; NGS; ascertainment bias; identity by descent; low-depth; non-model; relatedness.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Gene Frequency / genetics*
  • Genome, Human / genetics*
  • Genomics*
  • Genotype
  • High-Throughput Nucleotide Sequencing / statistics & numerical data
  • Humans
  • Models, Genetic
  • Pedigree
  • Polymorphism, Single Nucleotide / genetics
  • Software