Probe Lasso: a novel method to rope in differentially methylated regions with 450K DNA methylation data

Methods. 2015 Jan 15:72:21-8. doi: 10.1016/j.ymeth.2014.10.036. Epub 2014 Nov 24.

Abstract

The speed and resolution at which we can scour the genome for DNA methylation changes has improved immeasurably in the last 10years and the advent of the Illumina 450K BeadChip has made epigenome-wide association studies (EWAS) a reality. The resulting datasets are conveniently formatted to allow easy alignment of significant hits to genes and genetic features, however; methods that parse significant hits into discreet differentially methylated regions (DMRs) remain a challenge to implement. In this paper we present details of a novel DMR caller, the Probe Lasso: a flexible window based approach that gathers neighbouring significant-signals to define clear DMR boundaries for subsequent in-depth analysis. The method is implemented in the R package ChAMP (Morris et al., 2014) and returns sets of DMRs according to user-tuned levels of probe filtering (e.g., inclusion of sex chromosomes, polymorphisms) and probe-lasso size distribution. Using a sub-sample of colon cancer- and healthy colon-samples from TCGA we show that Probe Lasso shifts DMR calling away from just probe-dense regions, and calls a range of DMR sizes ranging from tens-of-bases to tens-of-kilobases in scale. Moreover, using TCGA data we show that Probe Lasso leverages more information from the array and highlights a potential role of hypomethylated transcription factor binding motifs not discoverable using a basic, fixed-window approach.

Keywords: DNA methylation; Differentially methylated regions; EWAS; Epigenetics; Illumina 450K BeadChip.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • CpG Islands
  • DNA Methylation*
  • Data Interpretation, Statistical
  • Datasets as Topic
  • Epigenomics / methods
  • Oligonucleotide Array Sequence Analysis / methods*
  • Software