A multifactorial signature of DNA sequence and polycomb binding predicts aberrant CpG island methylation

Cancer Res. 2009 Jan 1;69(1):282-91. doi: 10.1158/0008-5472.CAN-08-3274.

Abstract

Aberrant CpG island methylation is associated with transcriptional silencing of regulatory genes in human cancer. Although most CpG islands remain unmethylated, a subset accrues aberrant methylation in cancer via unknown mechanisms. Previously, we showed that CpG islands differ in their intrinsic propensity towards hypermethylation. We developed a classifier (PatMAn) based on the frequencies of seven DNA sequence patterns that discriminated methylation-prone (MP) and methylation-resistant (MR) CpG islands. Here, we report on the genome-wide application and direct testing of PatMAn in cancer. Although trained on data from a cell culture model of de novo methylation involving the overexpression of DNMT1, PatMAn accurately predicted CpG islands at increased risk of hypermethylation in cancer cell lines and primary tumors. Analysis of CpG islands predicted to be MP revealed a strong association with embryonic targets of polycomb-repressive complex 2 (PRC2), indicating that PatMAn predicts not only aberrant methylation, but also PRC2 binding. A second classifier (SUPER-PatMAn) that integrates the seven PatMAn DNA patterns with SUZ12 enriched regions as a marker of PRC2 occupancy showed improved performance (prediction accuracy, 81-88%). In addition to many non-PRC2 targets, SUPER-PatMAn identified a subset of PRC2 targets that were more likely to be hypermethylated in cancer. Genome-wide, CpG islands predicted to be MP were enriched in genes known to undergo hypermethylation in cancer, genes functioning in transcriptional regulation, and components of developmental pathways. These findings show that hypermethylation of certain gene loci is controlled in part by an underlying susceptibility influenced by both local sequence context and trans-acting factors.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / genetics
  • Base Sequence
  • Breast Neoplasms / genetics*
  • Carcinoma, Squamous Cell / genetics
  • Cell Line
  • Cell Line, Tumor
  • CpG Islands*
  • DNA (Cytosine-5-)-Methyltransferase 1
  • DNA (Cytosine-5-)-Methyltransferases / biosynthesis
  • DNA (Cytosine-5-)-Methyltransferases / genetics
  • DNA Methylation*
  • Genome, Human
  • Humans
  • Lung Neoplasms / genetics*

Substances

  • DNA (Cytosine-5-)-Methyltransferase 1
  • DNA (Cytosine-5-)-Methyltransferases
  • DNMT1 protein, human