Sequencing degraded RNA addressed by 3' tag counting
- PMID: 24632678
- PMCID: PMC3954844
- DOI: 10.1371/journal.pone.0091851
Sequencing degraded RNA addressed by 3' tag counting
Abstract
RNA sequencing has become widely used in gene expression profiling experiments. Prior to any RNA sequencing experiment the quality of the RNA must be measured to assess whether or not it can be used for further downstream analysis. The RNA integrity number (RIN) is a scale used to measure the quality of RNA that runs from 1 (completely degraded) to 10 (intact). Ideally, samples with high RIN (> 8) are used in RNA sequencing experiments. RNA, however, is a fragile molecule which is susceptible to degradation and obtaining high quality RNA is often hard, or even impossible when extracting RNA from certain clinical tissues. Thus, occasionally, working with low quality RNA is the only option the researcher has. Here we investigate the effects of RIN on RNA sequencing and suggest a computational method to handle data from samples with low quality RNA which also enables reanalysis of published datasets. Using RNA from a human cell line we generated and sequenced samples with varying RINs and illustrate what effect the RIN has on the basic procedure of RNA sequencing; both quality aspects and differential expression. We show that the RIN has systematic effects on gene coverage, false positives in differential expression and the quantification of duplicate reads. We introduce 3' tag counting (3TC) as a computational approach to reliably estimate differential expression for samples with low RIN. We show that using the 3TC method in differential expression analysis significantly reduces false positives when comparing samples with different RIN, while retaining reasonable sensitivity.
Conflict of interest statement
Figures
of the fold change (fold change = expr(RIN 8)/expr(RIN 10)) on the y-axis and transcript length on the x-axis. (b) The DEGs shown in (a) are split into two groups; the ones that have higher expression in RIN 10 (red) and the ones that have higher expression in RIN 8 (blue). The average transcript length in the RIN 10 group is significantly higher than the average transcript length in the RIN 8 group (Student's t-test, p
0.001). Error bars denote the standard error. The distribution of these gene lengths is shown in Figure S6. (c) Expression profile of the comparison RIN 10 vs. RiboMinus. In total there are 3778 DEGs; with 2081 upregulated in the RM group and 1697 upregulated in the RIN 10 group. Some of the genes upregulated in the RM group show markedly high fold change. Many of those, marked with a circle, are histone genes. The transcripts of histone genes lack a poly A tail which explains why they show a markedly higher expression in the samples prepared with ribosomal depletion compared to samples prepared with poly A selection. Additionally, genes that show similar trend have been marked with a triangle. These data indicate that those genes may lack or have repressed poly adenylation sites.
Similar articles
-
Effects of RNA integrity on transcript quantification by total RNA sequencing of clinically collected human placental samples.FASEB J. 2017 Aug;31(8):3298-3308. doi: 10.1096/fj.201601031RR. Epub 2017 Apr 26. FASEB J. 2017. PMID: 28446590
-
RNA quality score evaluation: A preliminary study of RNA integrity number (RIN) and RNA integrity and quality number (RNA IQ).Forensic Sci Int. 2024 Apr;357:111976. doi: 10.1016/j.forsciint.2024.111976. Epub 2024 Mar 2. Forensic Sci Int. 2024. PMID: 38447345
-
Quality assessment and data handling methods for Affymetrix Gene 1.0 ST arrays with variable RNA integrity.BMC Genomics. 2013 Jan 16;14:14. doi: 10.1186/1471-2164-14-14. BMC Genomics. 2013. PMID: 23324084 Free PMC article.
-
Misuse of RPKM or TPM normalization when comparing across samples and sequencing protocols.RNA. 2020 Aug;26(8):903-909. doi: 10.1261/rna.074922.120. Epub 2020 Apr 13. RNA. 2020. PMID: 32284352 Free PMC article. Review.
-
Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools.IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):566-586. doi: 10.1109/TCBB.2018.2873010. Epub 2018 Oct 1. IEEE/ACM Trans Comput Biol Bioinform. 2020. PMID: 30281477 Review.
Cited by
-
RNA-Integrity and 8-Isoprostane Levels Are Stable in Prostate Tissue Samples Upon Long-Term Storage at -80°C.Biopreserv Biobank. 2021 Feb;19(1):2-10. doi: 10.1089/bio.2019.0136. Epub 2020 Aug 28. Biopreserv Biobank. 2021. PMID: 32865438 Free PMC article.
-
Analysis of public RNA-sequencing data reveals biological consequences of genetic heterogeneity in cell line populations.Sci Rep. 2018 Jul 25;8(1):11226. doi: 10.1038/s41598-018-29506-3. Sci Rep. 2018. PMID: 30046134 Free PMC article.
-
3' RNA-seq is superior to standard RNA-seq in cases of sparse data but inferior at identifying toxicity pathways in a model organism.Front Bioinform. 2023 Jul 27;3:1234218. doi: 10.3389/fbinf.2023.1234218. eCollection 2023. Front Bioinform. 2023. PMID: 37576716 Free PMC article.
-
Advantages of RNA-seq compared to RNA microarrays for transcriptome profiling of anterior cruciate ligament tears.J Orthop Res. 2018 Jan;36(1):484-497. doi: 10.1002/jor.23661. Epub 2017 Aug 29. J Orthop Res. 2018. PMID: 28749036 Free PMC article.
-
Fast prostate retrieval in robot-assisted laparoscopic prostatectomy for next-generation biobanking.J Robot Surg. 2020 Apr;14(2):271-274. doi: 10.1007/s11701-019-00974-7. Epub 2019 May 25. J Robot Surg. 2020. PMID: 31129776
References
-
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B (2008) Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat Meth 5: 621–628. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
