A framework for mutational signature analysis based on DNA shape parameters

PLoS One. 2022 Jan 11;17(1):e0262495. doi: 10.1371/journal.pone.0262495. eCollection 2022.

Abstract

The mutation risk of a DNA locus depends on its oligonucleotide context. In turn, mutability of oligonucleotides varies across individuals, due to exposure to mutagenic agents or due to variable efficiency and/or accuracy of DNA repair. Such variability is captured by mutational signatures, a mathematical construct obtained by a deconvolution of mutation frequency spectra across individuals. There is a need to enhance methods for inferring mutational signatures to make better use of sparse mutation data (e.g., resulting from exome sequencing of cancers), to facilitate insight into underlying biological mechanisms, and to provide more accurate mutation rate baselines for inferring positive and negative selection. We propose a conceptualization of mutational signatures that represents oligonucleotides via descriptors of DNA conformation: base pair, base pair step, and minor groove width parameters. We demonstrate how such DNA structural parameters can accurately predict mutation occurrence due to DNA repair failures or due to exposure to diverse mutagens such as radiation, chemical exposure, and the APOBEC cytosine deaminase enzymes. Furthermore, the mutation frequency of DNA oligomers classed by structural features can accurately capture systematic variability in mutagenesis of >1,000 tumors originating from diverse human tissues. A nonnegative matrix factorization was applied to mutation spectra stratified by DNA structural features, thereby extracting novel mutational signatures. Moreover, many of the known trinucleotide signatures were associated with an additional spectrum in the DNA structural descriptor space, which may aid interpretation and provide mechanistic insight. Overall, we suggest that the power of DNA sequence motif-based mutational signature analysis can be enhanced by drawing on DNA shape features.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • APOBEC Deaminases / metabolism
  • DNA / chemistry*
  • DNA / genetics*
  • DNA Damage
  • DNA Mutational Analysis / methods*
  • DNA Repair
  • Genome, Human*
  • Humans
  • Mutation*
  • Neoplasms / genetics
  • Neoplasms / pathology*
  • Nucleic Acid Conformation*
  • Transcriptome

Substances

  • DNA
  • APOBEC Deaminases