Increased ultra-rare variant load in an isolated Scottish population impacts exonic and regulatory regions

PLoS Genet. 2019 Nov 25;15(11):e1008480. doi: 10.1371/journal.pgen.1008480. eCollection 2019 Nov.

Abstract

Human population isolates provide a snapshot of the impact of historical demographic processes on population genetics. Such data facilitate studies of the functional impact of rare sequence variants on biomedical phenotypes, as strong genetic drift can result in higher frequencies of variants that are otherwise rare. We present the first whole genome sequencing (WGS) study of the VIKING cohort, a representative collection of samples from the isolated Shetland population in northern Scotland, and explore how its genetic characteristics compare to a mainland Scottish population. Our analyses reveal the strong contributions played by the founder effect and genetic drift in shaping genomic variation in the VIKING cohort. About one tenth of all high-quality variants discovered are unique to the VIKING cohort or are seen at frequencies at least ten fold higher than in more cosmopolitan control populations. Multiple lines of evidence also suggest relaxation of purifying selection during the evolutionary history of the Shetland isolate. We demonstrate enrichment of ultra-rare VIKING variants in exonic regions and for the first time we also show that ultra-rare variants are enriched within regulatory regions, particularly promoters, suggesting that gene expression patterns may diverge relatively rapidly in human isolates.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions / genetics
  • Alleles
  • Chromatin / genetics
  • Demography*
  • Europe
  • Exons / genetics
  • Founder Effect
  • Genetic Drift
  • Genetic Variation / genetics*
  • Genetics, Population*
  • Genome-Wide Association Study
  • Genomics
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Promoter Regions, Genetic / genetics
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Scotland
  • Whole Genome Sequencing

Substances

  • 5' Untranslated Regions
  • Chromatin