DNA methylation arrays as surrogate measures of cell mixture distribution

BMC Bioinformatics. 2012 May 8:13:86. doi: 10.1186/1471-2105-13-86.


Background: There has been a long-standing need in biomedical research for a method that quantifies the normally mixed composition of leukocytes beyond what is possible by simple histological or flow cytometric assessments. The latter is restricted by the labile nature of protein epitopes, requirements for cell processing, and timely cell analysis. In a diverse array of diseases and following numerous immune-toxic exposures, leukocyte composition will critically inform the underlying immuno-biology to most chronic medical conditions. Emerging research demonstrates that DNA methylation is responsible for cellular differentiation, and when measured in whole peripheral blood, serves to distinguish cancer cases from controls.

Results: Here we present a method, similar to regression calibration, for inferring changes in the distribution of white blood cells between different subpopulations (e.g. cases and controls) using DNA methylation signatures, in combination with a previously obtained external validation set consisting of signatures from purified leukocyte samples. We validate the fundamental idea in a cell mixture reconstruction experiment, then demonstrate our method on DNA methylation data sets from several studies, including data from a Head and Neck Squamous Cell Carcinoma (HNSCC) study and an ovarian cancer study. Our method produces results consistent with prior biological findings, thereby validating the approach.

Conclusions: Our method, in combination with an appropriate external validation set, promises new opportunities for large-scale immunological studies of both disease states and noxious exposures.

Publication types

  • Research Support, N.I.H., Extramural
  • Validation Study

MeSH terms

  • Computer Simulation
  • DNA Methylation*
  • Data Interpretation, Statistical
  • Down Syndrome / blood
  • Down Syndrome / diagnosis
  • Down Syndrome / immunology
  • Epigenesis, Genetic*
  • Female
  • Gene Expression Profiling*
  • Head and Neck Neoplasms / blood
  • Head and Neck Neoplasms / diagnosis
  • Head and Neck Neoplasms / immunology
  • Humans
  • Leukocyte Count / methods*
  • Leukocytes / immunology*
  • Obesity / blood
  • Obesity / genetics
  • Obesity / immunology
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Ovarian Neoplasms / blood
  • Ovarian Neoplasms / diagnosis
  • Ovarian Neoplasms / immunology