iRNAD: a computational tool for identifying D modification sites in RNA sequence

Zhao-Chun Xu; Peng-Mian Feng; Hui Yang; Wang-Ren Qiu; Wei Chen; Hao Lin

doi:10.1093/bioinformatics/btz358

iRNAD: a computational tool for identifying D modification sites in RNA sequence

Bioinformatics. 2019 Dec 1;35(23):4922-4929. doi: 10.1093/bioinformatics/btz358.

Authors

Zhao-Chun Xu^{1

2}, Peng-Mian Feng³, Hui Yang², Wang-Ren Qiu¹, Wei Chen³, Hao Lin²

Affiliations

¹ Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.
² Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
³ Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China.

PMID: 31077296
DOI: 10.1093/bioinformatics/btz358

Abstract

Motivation: Dihydrouridine (D) is a common RNA post-transcriptional modification found in eukaryotes, bacteria and a few archaea. The modification can promote the conformational flexibility of individual nucleotide bases. And its levels are increased in cancerous tissues. Therefore, it is necessary to detect D in RNA for further understanding its functional roles. Since wet-experimental techniques for the aim are time-consuming and laborious, it is urgent to develop computational models to identify D modification sites in RNA.

Results: We constructed a predictor, called iRNAD, for identifying D modification sites in RNA sequence. In this predictor, the RNA samples derived from five species were encoded by nucleotide chemical property and nucleotide density. Support vector machine was utilized to perform the classification. The final model could produce the overall accuracy of 96.18% with the area under the receiver operating characteristic curve of 0.9839 in jackknife cross-validation test. Furthermore, we performed a series of validations from several aspects and demonstrated the robustness and reliability of the proposed model.

Availability and implementation: A user-friendly web-server called iRNAD can be freely accessible at http://lin-group.cn/server/iRNAD, which will provide convenience and guide to users for further studying D modification.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Base Sequence
Computational Biology
Nucleotides
RNA
Reproducibility of Results
Support Vector Machine*

Substances

Nucleotides
RNA