PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
- PMID: 31029062
- PMCID: PMC6486473
- DOI: 10.1093/gigascience/giz046
PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
Abstract
Background: Long thought "relics" of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene-parent gene relationships without leveraging other homologous genes/pseudogenes.
Results: We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four "flavors" of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a-PTEN-PTENP1) and novel (e.g., miR-375-SOX15-PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a "one stop shop" for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers.
Conclusions: Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike.
Keywords: competing endogenous RNA; database; functional prediction; gene regulation; graphics processing unit; high-performance computing; network analysis; pseudogenes.
© The Author(s) 2019. Published by Oxford University Press.
Figures
Similar articles
-
Pseudogene-gene functional networks are prognostic of patient survival in breast cancer.BMC Med Genomics. 2020 Apr 3;13(Suppl 5):51. doi: 10.1186/s12920-020-0687-0. BMC Med Genomics. 2020. PMID: 32241256 Free PMC article.
-
Network analysis of pseudogene-gene relationships: from pseudogene evolution to their functional potentials.Pac Symp Biocomput. 2018;23:536-547. Pac Symp Biocomput. 2018. PMID: 29218912 Free PMC article.
-
GENCODE pseudogenes.Methods Mol Biol. 2014;1167:129-55. doi: 10.1007/978-1-4939-0835-6_10. Methods Mol Biol. 2014. PMID: 24823776
-
Pseudogene-expressed RNAs: a new frontier in cancers.Tumour Biol. 2016 Feb;37(2):1471-8. doi: 10.1007/s13277-015-4482-z. Epub 2015 Dec 10. Tumour Biol. 2016. PMID: 26662308 Review.
-
Computational Methods for Pseudogene Annotation Based on Sequence Homology.Methods Mol Biol. 2021;2324:35-48. doi: 10.1007/978-1-0716-1503-4_3. Methods Mol Biol. 2021. PMID: 34165707 Review.
Cited by
-
Bioinformatics identification and validation of maternal blood biomarkers and immune cell infiltration in preeclampsia: An observational study.Medicine (Baltimore). 2024 May 24;103(21):e38260. doi: 10.1097/MD.0000000000038260. Medicine (Baltimore). 2024. PMID: 38788026 Free PMC article.
-
Pseudo2GO: A Graph-Based Deep Learning Method for Pseudogene Function Prediction by Borrowing Information From Coding Genes.Front Genet. 2020 Aug 18;11:807. doi: 10.3389/fgene.2020.00807. eCollection 2020. Front Genet. 2020. PMID: 33014009 Free PMC article.
-
ceRNAs in Cancer: Mechanism and Functions in a Comprehensive Regulatory Network.J Oncol. 2021 Oct 7;2021:4279039. doi: 10.1155/2021/4279039. eCollection 2021. J Oncol. 2021. PMID: 34659409 Free PMC article. Review.
-
Pseudogene-gene functional networks are prognostic of patient survival in breast cancer.BMC Med Genomics. 2020 Apr 3;13(Suppl 5):51. doi: 10.1186/s12920-020-0687-0. BMC Med Genomics. 2020. PMID: 32241256 Free PMC article.
-
Acidic leucine-rich nuclear phosphoprotein-32A expression contributes to adverse outcome in acute myeloid leukemia.Ann Transl Med. 2020 Mar;8(6):345. doi: 10.21037/atm.2020.02.54. Ann Transl Med. 2020. PMID: 32355789 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
