PRADA: pipeline for RNA sequencing data analysis

Bioinformatics. 2014 Aug 1;30(15):2224-6. doi: 10.1093/bioinformatics/btu169. Epub 2014 Apr 1.

Abstract

Summary: Technological advances in high-throughput sequencing necessitate improved computational tools for processing and analyzing large-scale datasets in a systematic automated manner. For that purpose, we have developed PRADA (Pipeline for RNA-Sequencing Data Analysis), a flexible, modular and highly scalable software platform that provides many different types of information available by multifaceted analysis starting from raw paired-end RNA-seq data: gene expression levels, quality metrics, detection of unsupervised and supervised fusion transcripts, detection of intragenic fusion variants, homology scores and fusion frame classification. PRADA uses a dual-mapping strategy that increases sensitivity and refines the analytical endpoints. PRADA has been used extensively and successfully in the glioblastoma and renal clear cell projects of The Cancer Genome Atlas program.

Availability and implementation: http://sourceforge.net/projects/prada/

Contact: gadgetz@broadinstitute.org or rverhaak@mdanderson.org

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Gene Fusion
  • Genome, Human / genetics
  • Humans
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA / methods*
  • Software*
  • Statistics as Topic / methods*

Substances

  • RNA, Messenger