Predicting adenine base editing efficiencies in different cellular contexts by deep learning

Genome Biol. 2025 May 8;26(1):115. doi: 10.1186/s13059-025-03586-7.

Abstract

Background: Adenine base editors (ABEs) enable the conversion of A•T to G•C base pairs. Since the sequence of the target locus influences base editing efficiency, efforts have been made to develop computational models that can predict base editing outcomes based on the targeted sequence. However, these models were trained on base editing datasets generated in cell lines and their predictive power for base editing in primary cells in vivo remains uncertain.

Results: In this study, we conduct base editing screens using SpRY-ABEmax and SpRY-ABE8e to target 2,195 pathogenic mutations with a total of 12,000 guide RNAs in cell lines and in the murine liver. We observe strong correlations between in vitro datasets generated by ABE-mRNA electroporation into HEK293T cells and in vivo datasets generated by adeno-associated virus (AAV)- or lipid nanoparticle (LNP)-mediated nucleoside-modified mRNA delivery (Spearman R = 0.83-0.92). We subsequently develop BEDICT2.0, a deep learning model that predicts adenine base editing efficiencies with high accuracy in cell lines (R = 0.60-0.94) and in the liver (R = 0.62-0.81).

Conclusions: In conclusion, our work confirms that adenine base editing holds considerable potential for correcting a large fraction of pathogenic mutations. We also provide BEDICT2.0 - a robust computational model that helps identify sgRNA-ABE combinations capable of achieving high on-target editing with minimal bystander effects in both in vitro and in vivo settings.

Keywords: CRISPR-Cas9 genome editing; Genomics; In vivo; Machine learning; Mouse.

MeSH terms

  • Adenine* / metabolism
  • Animals
  • Deep Learning*
  • Gene Editing* / methods
  • HEK293 Cells
  • Humans
  • Liver / metabolism
  • Mice
  • RNA, Messenger / genetics

Substances

  • Adenine
  • RNA, Messenger