Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Aug 28:3:202.
doi: 10.3389/fpls.2012.00202. eCollection 2012.

A High-Throughput Method for Illumina RNA-Seq Library Preparation

Affiliations

A High-Throughput Method for Illumina RNA-Seq Library Preparation

Ravi Kumar et al. Front Plant Sci. .

Abstract

With the introduction of cost effective, rapid, and superior quality next generation sequencing techniques, gene expression analysis has become viable for labs conducting small projects as well as large-scale gene expression analysis experiments. However, the available protocols for construction of RNA-sequencing (RNA-Seq) libraries are expensive and/or difficult to scale for high-throughput applications. Also, most protocols require isolated total RNA as a starting point. We provide a cost-effective RNA-Seq library synthesis protocol that is fast, starts with tissue, and is high-throughput from tissue to synthesized library. We have also designed and report a set of 96 unique barcodes for library adapters that are amenable to high-throughput sequencing by a large combination of multiplexing strategies. Our developed protocol has more power to detect differentially expressed genes when compared to the standard Illumina protocol, probably owing to less technical variation amongst replicates. We also address the problem of gene-length biases affecting differential gene expression calls and demonstrate that such biases can be efficiently minimized during mRNA isolation for library preparation.

Keywords: Illumina; RNA-Seq; cDNA fragmentation; high-throughput; mRNA isolation; multiplexing; sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Outline of the high-throughput RNA-seq (HTR) library preparation. In short, frozen tissue samples are ground in the lysis buffer and mRNA is isolated from this using oligo dT beads (1). The mRNA is used to make first and second strands of cDNA (2) and this double stranded cDNA molecules are subsequently enzymatically fragmented (3). The ends of these molecules are repaired and an A nucleotide is added (4) to facilitate TA ligation of the barcoded adapters (5). The ligated samples are then enriched by amplification using adapter specific primers (6) and purified for sequencing.
Figure 2
Figure 2
Quality control analysis for Illumina (IL) and high-throughput RNA-seq (HTR) library preparations. The quality control data from IL and HTR protocols using S. lycopersicum (SLY) and S. pennellii (SPE) are shown. (A) Per base sequence quality. Average of the four replicates has been plotted here. Error bars represent SD. (B) Sequence duplication levels. (C) Per sequence GC content. (D) Per base sequence content. In (C) and (D), the SPE and SLY of HTR protocol are plotted in the top panel and SPE and SLY of IL protocol are plotted in the bottom panel. Graphs were made in R using ggplot2.
Figure 3
Figure 3
Read mapping for Illumina (IL) and high-throughput RNA-seq (HTR) library preparations. (A) Total number of reads. (B) Adapter contamination. (C) rRNA contamination. (D) Percentage reads mapped. (E) Number of detected genes. The read mapping data from IL and HTR protocols using S. lycopersicum (SLY) and S. pennellii (SPE) are shown. Graphs were made in R using ggplot2. Error bars are ±SEM.
Figure 4
Figure 4
Detection of gene expression for Illumina (IL) and high-throughput RNA-seq (HTR) library preparations. (A) Read coverage is shown along whole gene length. (B) Multidimensional scaling (MDS) plot for assessing the variations amongst samples. Graph was made using the edgeR package in R. (C) VennDiagram comparing IL and HTR protocols for differential expressed genes (BH adjusted p-value < 0.01) between S. lycopersicum (SLY) and S. pennellii (SPE). The categories (a–h) are described in Table S5 in Supplementary Material. (D–G): Gene counts by gene length for IL and HTR protocols (D), for each category in (C) (E), for IL and HTR using Sera-Mag beads protocols (F), and for IL and HTR increasing Dynabeads amount protocols (G). 0–25, 25–50, 50–75, and 75–100 are the four gene-length quartiles (the genes separated into quartiles based on percentile gene length). Graphs were made in R using ggplot2.

Similar articles

Cited by

References

    1. Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate – a practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B Stat. Methodol. 57, 289–300
    1. Chen H., Boutros P. C. (2011). VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics 12, 35.10.1186/1471-2105-12-199 - DOI - PMC - PubMed
    1. The Tomato Genome Consortium. (2012). The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–64110.1038/nature11119 - DOI - PMC - PubMed
    1. Craig D. W., Pearson J. V., Szelinger S., Sekar A., Redman M., Corneveaux J. J., Pawlowski T. L., Laub T., Nunn G., Stephan D. A., Homer N., Huentelman M. J. (2008). Identification of genetic variants using bar-coded multiplexed sequencing. Nat. Methods 5, 887–89310.1038/nmeth.1251 - DOI - PMC - PubMed
    1. Fisher S., Barry A., Abreu J., Minie B., Nolan J., Delorey T. M., Young G., Fennell T. J., Allen A., Ambrogio L., Berlin A. M., Blumenstiel B., Cibulskis K., Friedrich D., Johnson R., Juhn F., Reilly B., Shammas R., Stalker J., Sykes S. M., Thompson J., Walsh J., Zimmer A., Zwirko Z., Gabriel S., Nicol R., Nusbaum C. (2011). A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries. Genome Biol. 12, R1.10.1186/gb-2011-12-S1-P1 - DOI - PMC - PubMed