Computational analysis of mutation spectra

Brief Bioinform. 2003 Sep;4(3):210-27. doi: 10.1093/bib/4.3.210.

Abstract

Mutation frequencies vary along a nucleotide sequence, and nucleotide positions with an exceptionally high mutation frequency are called hotspots. Mutation hotspots in DNA often reflect intrinsic properties of the mutation process, such as the specificity with which mutagens interact with nucleic acids and the sequence-specificity of DNA repair/replication enzymes. They might also reflect structural and functional features of target protein or RNA sequences in which they occur. The determinants of mutation frequency and specificity are complex and there are many analytical methods for their study. This paper discusses computational approaches to analysing mutation spectra (distribution of mutations along the target genes) that include many detectable (mutable) positions. The following methods are reviewed: mutation hotspot prediction; pairwise and multiple comparisons of mutation spectra; derivation of a consensus sequence; and analysis of correlation between nucleotide sequence features and mutation spectra. Spectra of spontaneous and induced mutations are used for illustration of the complexities and pitfalls of such analyses. In general, the DNA sequence context of mutation hotspots is a fingerprint of interactions between DNA and DNA repair/replication/modification enzymes, and the analysis of hotspot context provides evidence of such interactions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Animals
  • Base Composition
  • Base Sequence
  • Computational Biology
  • DNA / genetics
  • DNA / metabolism
  • DNA Mutational Analysis*
  • DNA Repair
  • Databases, Nucleic Acid
  • Genes, Immunoglobulin
  • Humans
  • Molecular Sequence Data
  • Mutagens / metabolism
  • Nucleic Acid Conformation
  • Software
  • Statistics as Topic

Substances

  • Mutagens
  • DNA