Improved power and precision with whole genome sequencing data in genome-wide association studies of inflammatory biomarkers

Sci Rep. 2019 Nov 14;9(1):16844. doi: 10.1038/s41598-019-53111-7.

Abstract

Genome-wide association studies (GWAS) have identified associations between thousands of common genetic variants and human traits. However, common variants usually explain a limited fraction of the heritability of a trait. A powerful resource for identifying trait-associated variants is whole genome sequencing (WGS) data in cohorts comprised of families or individuals from a limited geographical area. To evaluate the power of WGS compared to imputations, we performed GWAS on WGS data for 72 inflammatory biomarkers, in a kinship-structured cohort. When using WGS data, we identified 18 novel associations that were not detected when analyzing the same biomarkers with genotyped or imputed SNPs. Five of the novel top variants were low frequency variants with a minor allele frequency (MAF) of <5%. Our results suggest that, even when applying a GWAS approach, we gain power and precision using WGS data, presumably due to more accurate determination of genotypes. The lack of a comparable dataset for replication of our results is a limitation in our study. However, this further highlights that there is a need for more genetic epidemiological studies based on WGS data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Blood Proteins / genetics*
  • Blood Proteins / immunology
  • Female
  • Gene Frequency
  • Genetic Markers
  • Genome-Wide Association Study
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • Middle Aged
  • Polymorphism, Single Nucleotide*
  • Whole Genome Sequencing / methods*
  • Young Adult

Substances

  • Blood Proteins
  • Genetic Markers