Data Analysis Pipeline for RNA-seq Experiments: From Differential Expression to Cryptic Splicing
- PMID: 28902396
- PMCID: PMC6373869
- DOI: 10.1002/cpbi.33
Data Analysis Pipeline for RNA-seq Experiments: From Differential Expression to Cryptic Splicing
Abstract
RNA sequencing (RNA-seq) is a high-throughput technology that provides unique insights into the transcriptome. It has a wide variety of applications in quantifying genes/isoforms and in detecting non-coding RNA, alternative splicing, and splice junctions. It is extremely important to comprehend the entire transcriptome for a thorough understanding of the cellular system. Several RNA-seq analysis pipelines have been proposed to date. However, no single analysis pipeline can capture dynamics of the entire transcriptome. Here, we compile and present a robust and commonly used analytical pipeline covering the entire spectrum of transcriptome analysis, including quality checks, alignment of reads, differential gene/transcript expression analysis, discovery of cryptic splicing events, and visualization. Challenges, critical parameters, and possible downstream functional analysis pipelines associated with each step are highlighted and discussed. This unit provides a comprehensive understanding of state-of-the-art RNA-seq analysis pipeline and a greater understanding of the transcriptome. © 2017 by John Wiley & Sons, Inc.
Keywords: RNA-seq; alternative splicing; cryptic splicing; differential gene expression; differential isoform usage.
Copyright © 2017 John Wiley & Sons, Inc.
Figures
Similar articles
-
Alternative Splicing Signatures in RNA-seq Data: Percent Spliced in (PSI).Curr Protoc Hum Genet. 2015 Oct 6;87:11.16.1-11.16.14. doi: 10.1002/0471142905.hg1116s87. Curr Protoc Hum Genet. 2015. PMID: 26439713
-
A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.Genome Biol. 2022 Jul 7;23(1):149. doi: 10.1186/s13059-022-02711-0. Genome Biol. 2022. PMID: 35799267 Free PMC article.
-
Discover hidden splicing variations by mapping personal transcriptomes to personal genomes.Nucleic Acids Res. 2015 Dec 15;43(22):10612-22. doi: 10.1093/nar/gkv1099. Epub 2015 Nov 17. Nucleic Acids Res. 2015. PMID: 26578562 Free PMC article.
-
A survey of best practices for RNA-seq data analysis.Genome Biol. 2016 Jan 26;17:13. doi: 10.1186/s13059-016-0881-8. Genome Biol. 2016. PMID: 26813401 Free PMC article. Review.
-
Overview of available methods for diverse RNA-Seq data analyses.Sci China Life Sci. 2011 Dec;54(12):1121-8. doi: 10.1007/s11427-011-4255-x. Epub 2012 Jan 7. Sci China Life Sci. 2011. PMID: 22227904 Review.
Cited by
-
Targeting Asparagine and Serine Metabolism in Germinal Centre-Derived B Cells Non-Hodgkin Lymphomas (B-NHL).Cells. 2021 Sep 29;10(10):2589. doi: 10.3390/cells10102589. Cells. 2021. PMID: 34685569 Free PMC article.
-
The Ability of a 3-Gene Host Signature in Blood to Distinguish Tuberculous Meningitis From Other Brain Infections.J Infect Dis. 2024 Aug 16;230(2):e268-e278. doi: 10.1093/infdis/jiad606. J Infect Dis. 2024. PMID: 38169323 Free PMC article.
-
Regulation of telomere homeostasis and genomic stability in cancer by N 6-adenosine methylation (m6A).Sci Adv. 2021 Jul 28;7(31):eabg7073. doi: 10.1126/sciadv.abg7073. Print 2021 Jul. Sci Adv. 2021. PMID: 34321211 Free PMC article.
-
Exploration of prognosis and immunometabolism landscapes in ER+ breast cancer based on a novel lipid metabolism-related signature.Front Immunol. 2023 Jul 4;14:1199465. doi: 10.3389/fimmu.2023.1199465. eCollection 2023. Front Immunol. 2023. PMID: 37469520 Free PMC article.
-
UTAP: User-friendly Transcriptome Analysis Pipeline.BMC Bioinformatics. 2019 Mar 25;20(1):154. doi: 10.1186/s12859-019-2728-2. BMC Bioinformatics. 2019. PMID: 30909881 Free PMC article.
References
-
- Bray NL, Pimentel H, Melsted P and Pachter L (2016). “Near-optimal probabilistic RNA-seq quantification.” Nat Biotechnol 34(5): 525–527. - PubMed
-
- Dou T, Xu J, Gao Y, Gu J, Ji C, Xie Y and Zhou Y (2010). “Evolution of peroxisome proliferator-activated receptor gamma alternative splicing.” Front Biosci (Elite Ed) 2: 1334–1343. - PubMed
-
- Drăghici S (2012). Statistics and data analysis for microarrays using R and Bioconductor Boca Raton, FL, CRC Press.
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
