Measure transcript integrity using RNA-seq data
- PMID: 26842848
- PMCID: PMC4739097
- DOI: 10.1186/s12859-016-0922-z
Measure transcript integrity using RNA-seq data
Abstract
Background: Stored biological samples with pathology information and medical records are invaluable resources for translational medical research. However, RNAs extracted from the archived clinical tissues are often substantially degraded. RNA degradation distorts the RNA-seq read coverage in a gene-specific manner, and has profound influences on whole-genome gene expression profiling.
Result: We developed the transcript integrity number (TIN) to measure RNA degradation. When applied to 3 independent RNA-seq datasets, we demonstrated TIN is a reliable and sensitive measure of the RNA degradation at both transcript and sample level. Through comparing 10 prostate cancer clinical samples with lower RNA integrity to 10 samples with higher RNA quality, we demonstrated that calibrating gene expression counts with TIN scores could effectively neutralize RNA degradation effects by reducing false positives and recovering biologically meaningful pathways. When further evaluating the performance of TIN correction using spike-in transcripts in RNA-seq data generated from the Sequencing Quality Control consortium, we found TIN adjustment had better control of false positives and false negatives (sensitivity = 0.89, specificity = 0.91, accuracy = 0.90), as compared to gene expression analysis results without TIN correction (sensitivity = 0.98, specificity = 0.50, accuracy = 0.86).
Conclusion: TIN is a reliable measurement of RNA integrity and a valuable approach used to neutralize in vitro RNA degradation effect and improve differential gene expression analysis.
Figures
Similar articles
-
Using Synthetic Mouse Spike-In Transcripts to Evaluate RNA-Seq Analysis Tools.PLoS One. 2016 Apr 21;11(4):e0153782. doi: 10.1371/journal.pone.0153782. eCollection 2016. PLoS One. 2016. PMID: 27100792 Free PMC article.
-
Synthetic spike-in standards for RNA-seq experiments.Genome Res. 2011 Sep;21(9):1543-51. doi: 10.1101/gr.121095.111. Epub 2011 Aug 4. Genome Res. 2011. PMID: 21816910 Free PMC article.
-
RNA-seq: impact of RNA degradation on transcript quantification.BMC Biol. 2014 May 30;12:42. doi: 10.1186/1741-7007-12-42. BMC Biol. 2014. PMID: 24885439 Free PMC article.
-
Combinational usage of next generation sequencing and qPCR for the analysis of tumor samples.Methods. 2013 Jan;59(1):126-31. doi: 10.1016/j.ymeth.2012.11.002. Epub 2012 Nov 21. Methods. 2013. PMID: 23178393 Review.
-
Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools.IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):566-586. doi: 10.1109/TCBB.2018.2873010. Epub 2018 Oct 1. IEEE/ACM Trans Comput Biol Bioinform. 2020. PMID: 30281477 Review.
Cited by
-
Impact of RNA degradation on fusion detection by RNA-seq.BMC Genomics. 2016 Oct 20;17(1):814. doi: 10.1186/s12864-016-3161-9. BMC Genomics. 2016. PMID: 27765019 Free PMC article.
-
Standardization and quality management in next-generation sequencing.Appl Transl Genom. 2016 Jul 1;10:2-9. doi: 10.1016/j.atg.2016.06.001. eCollection 2016 Sep. Appl Transl Genom. 2016. PMID: 27668169 Free PMC article. Review.
-
Large scale, robust, and accurate whole transcriptome profiling from clinical formalin-fixed paraffin-embedded samples.Sci Rep. 2020 Oct 19;10(1):17597. doi: 10.1038/s41598-020-74483-1. Sci Rep. 2020. PMID: 33077815 Free PMC article.
-
High-yield identification of pathogenic NF1 variants by skin fibroblast transcriptome screening after apparently normal diagnostic DNA testing.Hum Mutat. 2022 Dec;43(12):2130-2140. doi: 10.1002/humu.24487. Epub 2022 Nov 8. Hum Mutat. 2022. PMID: 36251260 Free PMC article.
-
Sleep Deprivation Alters the Pituitary Stress Transcriptome in Male and Female Mice.Front Endocrinol (Lausanne). 2019 Oct 9;10:676. doi: 10.3389/fendo.2019.00676. eCollection 2019. Front Endocrinol (Lausanne). 2019. PMID: 31649619 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases
