Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene
- PMID: 23815980
- PMCID: PMC4053754
- DOI: 10.1186/gb-2013-14-7-r70
Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene
Abstract
Background: RNA sequencing has opened new avenues for the study of transcriptome composition. Significant evidence has accumulated showing that the human transcriptome contains in excess of a hundred thousand different transcripts. However, it is still not clear to what extent this diversity prevails when considering the relative abundances of different transcripts from the same gene.
Results: Here we show that, in a given condition, most protein coding genes have one major transcript expressed at significantly higher level than others, that in human tissues the major transcripts contribute almost 85 percent to the total mRNA from protein coding loci, and that often the same major transcript is expressed in many tissues. We detect a high degree of overlap between the set of major transcripts and a recently published set of alternatively spliced transcripts that are predicted to be translated utilizing proteomic data. Thus, we hypothesize that although some minor transcripts may play a functional role, the major ones are likely to be the main contributors to the proteome. However, we still detect a non-negligible fraction of protein coding genes for which the major transcript does not code a protein.
Conclusions: Overall, our findings suggest that the transcriptome from protein coding loci is dominated by one transcript per gene and that not all the transcripts that contribute to transcriptome diversity are equally likely to contribute to protein diversity. This observation can help to prioritize candidate targets in proteomics research and to predict the functional impact of the detected changes in variation studies.
Figures
Similar articles
-
Characterization of the transcriptome of Haloferax volcanii, grown under four different conditions, with mixed RNA-Seq.PLoS One. 2019 Apr 30;14(4):e0215986. doi: 10.1371/journal.pone.0215986. eCollection 2019. PLoS One. 2019. PMID: 31039177 Free PMC article.
-
PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome.BMC Genomics. 2020 Jul 13;21(1):481. doi: 10.1186/s12864-020-06894-3. BMC Genomics. 2020. PMID: 32660426 Free PMC article.
-
Top-ranked expressed gene transcripts of human protein-coding genes investigated with GTEx dataset.Sci Rep. 2020 Oct 1;10(1):16245. doi: 10.1038/s41598-020-73081-5. Sci Rep. 2020. PMID: 33004865 Free PMC article.
-
Coding, or non-coding, that is the question.Cell Res. 2024 Sep;34(9):609-629. doi: 10.1038/s41422-024-00975-8. Epub 2024 Jul 25. Cell Res. 2024. PMID: 39054345 Free PMC article. Review.
-
Differentiating protein-coding and noncoding RNA: challenges and ambiguities.PLoS Comput Biol. 2008 Nov;4(11):e1000176. doi: 10.1371/journal.pcbi.1000176. Epub 2008 Nov 28. PLoS Comput Biol. 2008. PMID: 19043537 Free PMC article. Review.
Cited by
-
RNA-Seq Analysis Reveals Localization-Associated Alternative Splicing across 13 Cell Lines.Genes (Basel). 2020 Jul 18;11(7):820. doi: 10.3390/genes11070820. Genes (Basel). 2020. PMID: 32708427 Free PMC article.
-
APPRIS principal isoforms and MANE Select transcripts define reference splice variants.Bioinformatics. 2022 Sep 16;38(Suppl_2):ii89-ii94. doi: 10.1093/bioinformatics/btac473. Bioinformatics. 2022. PMID: 36124785 Free PMC article.
-
Direct Nanopore Sequencing of mRNA Reveals Landscape of Transcript Isoforms in Apicomplexan Parasites.mSystems. 2021 Mar 9;6(2):e01081-20. doi: 10.1128/mSystems.01081-20. mSystems. 2021. PMID: 33688018 Free PMC article.
-
Alternative Splicing May Not Be the Key to Proteome Complexity.Trends Biochem Sci. 2017 Feb;42(2):98-110. doi: 10.1016/j.tibs.2016.08.008. Epub 2016 Oct 3. Trends Biochem Sci. 2017. PMID: 27712956 Free PMC article. Review.
-
Illuminating the Transcriptome through the Genome.Genes (Basel). 2014 Mar 14;5(1):235-53. doi: 10.3390/genes5010235. Genes (Basel). 2014. PMID: 24705295 Free PMC article.
References
-
- Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S, Gil L, Gordon L, Hendrix M, Hourlier T, Johnson N, Kahari AK, Keefe D, Keenan S, Kinsella R, Komorowska M, Koscielny G, Kulesha E, Larsson P, Longden I, McLaren W, Muffato M, Overduin B, Pignatelli M, Pritchard B, Riat HS. et al.Ensembl 2012. Nucleic Acids Res. 2011;14:D84–D90. - PMC - PubMed
-
- Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Baren MJ van, Salzberg SL, Wold BJ, Pachter L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;14:511–515. doi: 10.1038/nbt.1621. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
