Genome-wide population-based association study of extremely overweight young adults--the GOYA study

PLoS One. 2011;6(9):e24303. doi: 10.1371/journal.pone.0024303. Epub 2011 Sep 15.


Background: Thirty-two common variants associated with body mass index (BMI) have been identified in genome-wide association studies, explaining ∼1.45% of BMI variation in general population cohorts. We performed a genome-wide association study in a sample of young adults enriched for extremely overweight individuals. We aimed to identify new loci associated with BMI and to ascertain whether using an extreme sampling design would identify the variants known to be associated with BMI in general populations.

Methodology/principal findings: From two large Danish cohorts we selected all extremely overweight young men and women (n = 2,633), and equal numbers of population-based controls (n = 2,740, drawn randomly from the same populations as the extremes, representing ∼212,000 individuals). We followed up novel (at the time of the study) association signals (p<0.001) from the discovery cohort in a genome-wide study of 5,846 Europeans, before attempting to replicate the most strongly associated 28 SNPs in an independent sample of Danish individuals (n = 20,917) and a population-based cohort of 15-year-old British adolescents (n = 2,418). Our discovery analysis identified SNPs at three loci known to be associated with BMI with genome-wide confidence (P<5×10(-8); FTO, MC4R and FAIM2). We also found strong evidence of association at the known TMEM18, GNPDA2, SEC16B, TFAP2B, SH2B1 and KCTD15 loci (p<0.001), and nominal association (p<0.05) at a further 8 loci known to be associated with BMI. However, meta-analyses of our discovery and replication cohorts identified no novel associations.

Significance: Our results indicate that the detectable genetic variation associated with extreme overweight is very similar to that previously found for general BMI. This suggests that population-based study designs with enriched sampling of individuals with the extreme phenotype may be an efficient method for identifying common variants that influence quantitative traits and a valid alternative to genotyping all individuals in large population-based studies, which may require tens of thousands of subjects to achieve similar power.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Alpha-Ketoglutarate-Dependent Dioxygenase FTO
  • Apoptosis Regulatory Proteins / genetics
  • Body Mass Index
  • DNA-Binding Proteins / genetics
  • Genetic Predisposition to Disease / genetics
  • Genome-Wide Association Study*
  • Genotype
  • Humans
  • Membrane Proteins / genetics
  • Overweight / genetics*
  • Polymorphism, Single Nucleotide / genetics
  • Potassium Channels / genetics
  • Proteins / genetics
  • Receptor, Melanocortin, Type 4 / genetics
  • Transcription Factor AP-2 / genetics
  • Young Adult


  • Apoptosis Regulatory Proteins
  • DNA-Binding Proteins
  • FAIM2 protein, human
  • KCTD15 protein, human
  • MC4R protein, human
  • Membrane Proteins
  • Potassium Channels
  • Proteins
  • Receptor, Melanocortin, Type 4
  • SEC16B protein, human
  • TMEM18 protein, human
  • Transcription Factor AP-2
  • Alpha-Ketoglutarate-Dependent Dioxygenase FTO
  • FTO protein, human