Comprehensive population-based genome sequencing provides insight into hematopoietic regulatory mechanisms

Proc Natl Acad Sci U S A. 2017 Jan 17;114(3):E327-E336. doi: 10.1073/pnas.1619052114. Epub 2016 Dec 28.


Genetic variants affecting hematopoiesis can influence commonly measured blood cell traits. To identify factors that affect hematopoiesis, we performed association studies for blood cell traits in the population-based Estonian Biobank using high-coverage whole-genome sequencing (WGS) in 2,284 samples and SNP genotyping in an additional 14,904 samples. Using up to 7,134 samples with available phenotype data, our analyses identified 17 associations across 14 blood cell traits. Integration of WGS-based fine-mapping and complementary epigenomic datasets provided evidence for causal mechanisms at several loci, including at a previously undiscovered basophil count-associated locus near the master hematopoietic transcription factor CEBPA The fine-mapped variant at this basophil count association near CEBPA overlapped an enhancer active in common myeloid progenitors and influenced its activity. In situ perturbation of this enhancer by CRISPR/Cas9 mutagenesis in hematopoietic stem and progenitor cells demonstrated that it is necessary for and specifically regulates CEBPA expression during basophil differentiation. We additionally identified basophil count-associated variation at another more pleiotropic myeloid enhancer near GATA2, highlighting regulatory mechanisms for ordered expression of master hematopoietic regulators during lineage specification. Our study illustrates how population-based genetic studies can provide key insights into poorly understood cell differentiation processes of considerable physiologic relevance.

Keywords: CEBPA; GWAS; basophils; genome sequencing; hematopoiesis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Basophils / cytology
  • CCAAT-Enhancer-Binding Proteins / genetics*
  • Cell Differentiation / genetics
  • Cell Lineage / genetics
  • Chromosome Mapping
  • Databases, Nucleic Acid
  • Enhancer Elements, Genetic
  • Epigenesis, Genetic
  • Estonia
  • Female
  • GATA2 Transcription Factor / genetics
  • Gene Expression Regulation, Developmental
  • Genome-Wide Association Study
  • Hematopoiesis / genetics*
  • Humans
  • Leukocyte Count
  • Male
  • Polymorphism, Single Nucleotide
  • Whole Genome Sequencing


  • CCAAT-Enhancer-Binding Proteins
  • CEBPA protein, human
  • GATA2 Transcription Factor
  • GATA2 protein, human