A community challenge to evaluate RNA-seq, fusion detection, and isoform quantification methods for cancer discovery

Cell Syst. 2021 Aug 18;12(8):827-838.e5. doi: 10.1016/j.cels.2021.05.021. Epub 2021 Jun 18.

Abstract

The accurate identification and quantitation of RNA isoforms present in the cancer transcriptome is key for analyses ranging from the inference of the impacts of somatic variants to pathway analysis to biomarker development and subtype discovery. The ICGC-TCGA DREAM Somatic Mutation Calling in RNA (SMC-RNA) challenge was a crowd-sourced effort to benchmark methods for RNA isoform quantification and fusion detection from bulk cancer RNA sequencing (RNA-seq) data. It concluded in 2018 with a comparison of 77 fusion detection entries and 65 isoform quantification entries on 51 synthetic tumors and 32 cell lines with spiked-in fusion constructs. We report the entries used to build this benchmark, the leaderboard results, and the experimental features associated with the accurate prediction of RNA species. This challenge required submissions to be in the form of containerized workflows, meaning each of the entries described is easily reusable through CWL and Docker containers at https://github.com/SMC-RNA-challenge. A record of this paper's transparent peer review process is included in the supplemental information.

Keywords: Cancer; Cloud compute; DREAM Challenge; RNA fusion; RNA-seq; benchmark; crowd-sourced; evaluation; isoform quantification.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Humans
  • Neoplasms* / genetics
  • Protein Isoforms / genetics
  • RNA / genetics
  • RNA-Seq
  • Sequence Analysis, RNA

Substances

  • Protein Isoforms
  • RNA