recount3: summaries and queries for large-scale RNA-seq expression and splicing

Genome Biol. 2021 Nov 29;22(1):323. doi: 10.1186/s13059-021-02533-6.

Abstract

We present recount3, a resource consisting of over 750,000 publicly available human and mouse RNA sequencing (RNA-seq) samples uniformly processed by our new Monorail analysis pipeline. To facilitate access to the data, we provide the recount3 and snapcount R/Bioconductor packages as well as complementary web resources. Using these tools, data can be downloaded as study-level summaries or queried for specific exon-exon junctions, genes, samples, or other features. Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, our tools help biologists maximize the utility of publicly available RNA-seq data, especially to improve their understanding of newly collected data. recount3 is available from http://rna.recount.bio .

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Computational Biology / methods
  • Exons
  • Gene Expression Regulation
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Mice
  • RNA / genetics*
  • RNA Splicing*
  • RNA-Seq / methods*
  • Sequence Analysis, RNA / methods
  • Software

Substances

  • RNA