Genome-wide mapping of somatic mutation rates uncovers drivers of cancer

Nat Biotechnol. 2022 Nov;40(11):1634-1643. doi: 10.1038/s41587-022-01353-8. Epub 2022 Jun 20.

Abstract

Identification of cancer driver mutations that confer a proliferative advantage is central to understanding cancer; however, searches have often been limited to protein-coding sequences and specific non-coding elements (for example, promoters) because of the challenge of modeling the highly variable somatic mutation rates observed across tumor genomes. Here we present Dig, a method to search for driver elements and mutations anywhere in the genome. We use deep neural networks to map cancer-specific mutation rates genome-wide at kilobase-scale resolution. These estimates are then refined to search for evidence of driver mutations under positive selection throughout the genome by comparing observed to expected mutation counts. We mapped mutation rates for 37 cancer types and applied these maps to identify putative drivers within intronic cryptic splice regions, 5' untranslated regions and infrequently mutated genes. Our high-resolution mutation rate maps, available for web-based exploration, are a resource to enable driver discovery genome-wide.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Mutation / genetics
  • Mutation Rate*
  • Neoplasms* / genetics
  • Neoplasms* / pathology
  • Open Reading Frames
  • Promoter Regions, Genetic