Comparative study of bioinformatic tools for the identification of chimeric RNAs from RNA Sequencing

RNA Biol. 2021 Oct 15;18(sup1):254-267. doi: 10.1080/15476286.2021.1940047. Epub 2021 Jun 18.

Abstract

Chimeric RNAs are gaining more and more attention as they have broad implications in both cancer and normal physiology. To date, over 40 chimeric RNA prediction methods have been developed to facilitate their identification from RNA sequencing data. However, a limited number of studies have been conducted to compare the performance of these tools; additionally, previous studies have become outdated as more software tools have been developed within the last three years. In this study, we benchmarked 16 chimeric RNA prediction software, including seven top performers in previous benchmarking studies, and nine that were recently developed. We used two simulated and two real RNA-Seq datasets, compared the 16 tools for their sensitivity, positive prediction value (PPV), F-measure, and also documented the computational requirements (time and memory). We noticed that none of the tools are inclusive, and their performance varies depending on the dataset and objects. To increase the detection of true positive events, we also evaluated the pair-wise combination of these methods to suggest the best combination for sensitivity and F-measure. In addition, we compared the performance of the tools for the identification of three classes (read-through, inter-chromosomal and intra-others) of chimeric RNAs. Finally, we performed TOPSIS analyses and ranked the weighted performance of the 16 tools.

Keywords: Chimeric rna; benchmarking; datasets; fusion transcript; software tools.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Simulation
  • Gene Fusion*
  • Humans
  • Neoplasms / genetics*
  • Oncogene Proteins, Fusion / analysis
  • Oncogene Proteins, Fusion / genetics*
  • RNA, Neoplasm / analysis
  • RNA, Neoplasm / genetics*
  • Sequence Analysis, RNA / methods*
  • Software*

Substances

  • Oncogene Proteins, Fusion
  • RNA, Neoplasm