The Effect of Local Sequence Context on Mutational Bias of Genes Encoded on the Leading and Lagging Strands

Curr Biol. 2016 Mar 7;26(5):692-7. doi: 10.1016/j.cub.2016.01.016. Epub 2016 Feb 25.

Abstract

All organisms must replicate their genetic information accurately to ensure its faithful transmission. DNA polymerase errors provide an important source of genetic variation that can drive evolution. Understanding the origins of genetic variation will inform our understanding of evolution and the development of genetic diseases. A number of factors have been proposed to influence mutagenesis [1-10]. Here, we used mutation accumulation lines, whole-genome sequencing, and whole-transcriptome analysis to study the locations and rate at which mutations arise in bacteria with as little selection bias as possible [11, 12]. Our analysis of greater than 7,000 replication errors in over 180 sequenced lines that underwent a total of more than 370,000 generations has provided new insights into how DNA polymerase errors sculpt genetic variation and drive evolution. Homopolymer run enrichment outside of genes causes insertions and deletions in these regions. Genes encoded in the lagging strand are transcribed such that RNA polymerase and DNA polymerase collide head-on. Head-on genes have been proposed to mutate at a higher rate than genes transcribed codirectionally with DNA polymerase progression due to conflicts between transcription and DNA replication [6, 10]. We did not detect associations between the number of base pair substitutions in genes and their orientation or expression. Strikingly, any higher mutation rate for head-on genes can be explained by differing sequence composition between the leading and lagging strands and the error bias for DNA polymerase in specific sequence contexts. Therefore, we find local sequence context is the major determinant of mutagenesis in bacteria.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacillus subtilis / genetics*
  • Bacillus subtilis / metabolism
  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism
  • DNA Damage
  • DNA Replication*
  • DNA-Directed DNA Polymerase / genetics
  • DNA-Directed DNA Polymerase / metabolism
  • DNA-Directed RNA Polymerases / genetics
  • DNA-Directed RNA Polymerases / metabolism
  • Mutagenesis*
  • Sequence Analysis, DNA

Substances

  • Bacterial Proteins
  • DNA-Directed RNA Polymerases
  • DNA-Directed DNA Polymerase