Deep learning models simultaneously trained on multiple datasets improve base-editing activity prediction

Nat Commun. 2025 Nov 7;16(1):9821. doi: 10.1038/s41467-025-65200-5.

Abstract

CRISPR-derived base editors (BE) enable precise single nucleotide substitution without introducing double-stranded DNA breaks. Apart from the base editing enzymes, efficient base editing strongly depends on both the CRISPR guide RNA (gRNA) efficiency and the edited position. Here, we show that the accuracy of BE gRNA design can be significantly improved by generating more data and by introducing deep neural networks trained on multiple different datasets simultaneously. Generating ~20,000 gRNAs for A•T to G•C and C•G to T•A conversions, we present such deep learning models, which also allow users to do dataset-aware predictions. The methods are available online and as stand-alone software.

MeSH terms

  • CRISPR-Cas Systems* / genetics
  • Deep Learning*
  • Gene Editing* / methods
  • Humans
  • Neural Networks, Computer
  • RNA, Guide, CRISPR-Cas Systems / genetics
  • Software

Substances

  • RNA, Guide, CRISPR-Cas Systems