Analysis and visualization of RNA-Seq expression data using RStudio, Bioconductor, and Integrated Genome Browser

Methods Mol Biol. 2015:1284:481-501. doi: 10.1007/978-1-4939-2444-8_24.


Sequencing costs are falling, but the cost of data analysis remains high, often because unforeseen problems arise, such as insufficient depth of sequencing or batch effects. Experimenting with data analysis methods during the planning phase of an experiment can reveal unanticipated problems and build valuable bioinformatics expertise in the organism or process being studied. This protocol describes using R Markdown and RStudio, user-friendly tools for statistical analysis and reproducible research in bioinformatics, to analyze and document the analysis of an example RNA-Seq data set from tomato pollen undergoing chronic heat stress. Also, we show how to use Integrated Genome Browser to visualize read coverage graphs for differentially expressed genes. Applying the protocol described here and using the provided data sets represent a useful first step toward building RNA-Seq data analysis expertise in a research group.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology / methods
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing*
  • RNA*
  • Software*
  • Solanum lycopersicum / genetics
  • Web Browser*


  • RNA