Genome Modularization Reveals Overlapped Gene Topology Is Necessary for Efficient Viral Reproduction

ACS Synth Biol. 2020 Nov 20;9(11):3079-3090. doi: 10.1021/acssynbio.0c00323. Epub 2020 Oct 12.


Sequence overlap between two genes is common across all genomes, with viruses having high proportions of these gene overlaps. Genome modularization and refactoring is the process of disrupting natural gene overlaps to separate coding sequences to enable their individual manipulation. The biological function and fitness effects of gene overlaps are not fully understood, and their effects on gene cluster and genome-level refactoring are unknown. The bacteriophage φX174 genome has ∼26% of nucleotides involved in encoding more than one gene. In this study we use an engineered φX174 phage containing a genome with all gene overlaps removed to show that gene overlap is critical to maintaining optimal viral fecundity. Through detailed phenotypic measurements we reveal that genome modularization in φX174 causes virion replication, stability, and attachment deficiencies. Quantitation of the complete phage proteome across an infection cycle reveals 30% of proteins display abnormal expression patterns. Taken together, we have for the first time comprehensively demonstrated that gene modularization severely perturbs the coordinated functioning of a bacteriophage replication cycle. This work highlights the biological importance of gene overlap in natural genomes and that reducing gene overlap disruption should be an integral part of future genome engineering projects.

Keywords: bacteriophage; genome engineering; proteomics; refactoring; synthetic biology; virus structure.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteriophages / genetics
  • DNA, Viral / genetics
  • Genome, Viral / genetics*
  • Viral Proteins / genetics
  • Virus Replication / genetics*


  • DNA, Viral
  • Viral Proteins