Deepred-Mt: Deep representation learning for predicting C-to-U RNA editing in plant mitochondria

Comput Biol Med. 2021 Sep:136:104682. doi: 10.1016/j.compbiomed.2021.104682. Epub 2021 Jul 27.

Abstract

In land plant mitochondria, C-to-U RNA editing converts cytidines into uridines at highly specific RNA positions called editing sites. This editing step is essential for the correct functioning of mitochondrial proteins. When using sequence homology information, edited positions can be computationally predicted with high precision. However, predictions based on the sequence contexts of such edited positions often result in lower precision, which is limiting further advances on novel genetic engineering techniques for RNA regulation. Here, a deep convolutional neural network called Deepred-Mt is proposed. It predicts C-to-U editing events based on the 40 nucleotides flanking a given cytidine. Unlike existing methods, Deepred-Mt was optimized by using editing extent information, novel strategies of data augmentation, and a large-scale training dataset, constructed with deep RNA sequencing data of 21 plant mitochondrial genomes. In comparison to predictive methods based on sequence homology, Deepred-Mt attains significantly better predictive performance, in terms of average precision as well as F1 score. In addition, our approach is able to recognize well-known sequence motifs linked to RNA editing, and shows that the local RNA structure surrounding editing sites may be a relevant factor regulating their editing. These results demonstrate that Deepred-Mt is an effective tool for predicting C-to-U RNA editing in plant mitochondria. Source code, datasets, and detailed use cases are freely available at https://github.com/aedera/deepredmt.

Keywords: C-to-U RNA editing; Convolutional neural networks; Land plants; Mitochondrial genomes; Representation learning; Sequence classification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Mitochondria* / genetics
  • RNA Editing* / genetics