Enzyme-assisted high throughput sequencing of an expanded genetic alphabet at single base resolution

Nat Commun. 2024 May 14;15(1):4057. doi: 10.1038/s41467-024-48408-9.

Abstract

With just four building blocks, low sequence information density, few functional groups, poor control over folding, and difficulties in forming compact folds, natural DNA and RNA have been disappointing platforms from which to evolve receptors, ligands, and catalysts. Accordingly, synthetic biology has created "artificially expanded genetic information systems" (AEGIS) to add nucleotides, functionality, and information density. With the expected improvements seen in AegisBodies and AegisZymes, the task for synthetic biologists shifts to developing for expanded DNA the same analytical tools available to natural DNA. Here we report one of these, an enzyme-assisted sequencing of expanded genetic alphabet (ESEGA) method to sequence six-letter AEGIS DNA. We show how ESEGA analyses this DNA at single base resolution, and applies it to optimized conditions for six-nucleotide PCR, assessing the fidelity of various DNA polymerases, and extending this to AEGIS components with functional groups. This supports the renewed exploitation of expanded DNA alphabets in biotechnology.

MeSH terms

  • Base Sequence
  • DNA* / genetics
  • DNA* / metabolism
  • DNA-Directed DNA Polymerase / genetics
  • DNA-Directed DNA Polymerase / metabolism
  • High-Throughput Nucleotide Sequencing* / methods
  • Polymerase Chain Reaction / methods
  • Sequence Analysis, DNA / methods
  • Synthetic Biology / methods