Prediction of on-target and off-target activity of CRISPR-Cas13d guide RNAs using deep learning

Nat Biotechnol. 2024 Apr;42(4):628-637. doi: 10.1038/s41587-023-01830-8. Epub 2023 Jul 3.

Abstract

Transcriptome engineering applications in living cells with RNA-targeting CRISPR effectors depend on accurate prediction of on-target activity and off-target avoidance. Here we design and test ~200,000 RfxCas13d guide RNAs targeting essential genes in human cells with systematically designed mismatches and insertions and deletions (indels). We find that mismatches and indels have a position- and context-dependent impact on Cas13d activity, and mismatches that result in G-U wobble pairings are better tolerated than other single-base mismatches. Using this large-scale dataset, we train a convolutional neural network that we term targeted inhibition of gene expression via gRNA design (TIGER) to predict efficacy from guide sequence and context. TIGER outperforms the existing models at predicting on-target and off-target activity on our dataset and published datasets. We show that TIGER scoring combined with specific mismatches yields the first general framework to modulate transcript expression, enabling the use of RNA-targeting CRISPRs to precisely control gene dosage.

MeSH terms

  • CRISPR-Cas Systems / genetics
  • Clustered Regularly Interspaced Short Palindromic Repeats
  • Deep Learning*
  • Gene Editing
  • Humans
  • RNA
  • RNA, Guide, CRISPR-Cas Systems*

Substances

  • RNA, Guide, CRISPR-Cas Systems
  • RNA