Assessing identity, redundancy and confounds in Gene Ontology annotations over time
- PMID: 23297035
- PMCID: PMC3570208
- DOI: 10.1093/bioinformatics/bts727
Assessing identity, redundancy and confounds in Gene Ontology annotations over time
Abstract
Motivation: The Gene Ontology (GO) is heavily used in systems biology, but the potential for redundancy, confounds with other data sources and problems with stability over time have been little explored.
Results: We report that GO annotations are stable over short periods, with 3% of genes not being most semantically similar to themselves between monthly GO editions. However, we find that genes can alter their 'functional identity' over time, with 20% of genes not matching to themselves (by semantic similarity) after 2 years. We further find that annotation bias in GO, in which some genes are more characterized than others, has declined in yeast, but generally increased in humans. Finally, we discovered that many entries in protein interaction databases are owing to the same published reports that are used for GO annotations, with 66% of assessed GO groups exhibiting this confound. We provide a case study to illustrate how this information can be used in analyses of gene sets and networks.
Availability: Data available at http://chibi.ubc.ca/assessGO.
Figures
Similar articles
-
GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products.BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-12-S1-S40. BMC Bioinformatics. 2011. PMID: 21342572 Free PMC article.
-
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2. BMC Bioinformatics. 2019. PMID: 30917779 Free PMC article.
-
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7. BMC Bioinformatics. 2015. PMID: 25886899 Free PMC article.
-
Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae.BMC Microbiol. 2009 Feb 19;9 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2180-9-S1-S8. BMC Microbiol. 2009. PMID: 19278556 Free PMC article. Review.
-
Access to immunology through the Gene Ontology.Immunology. 2008 Oct;125(2):154-60. doi: 10.1111/j.1365-2567.2008.02940.x. Immunology. 2008. PMID: 18798919 Free PMC article. Review.
Cited by
-
Genetic variants in Alzheimer disease - molecular and brain network approaches.Nat Rev Neurol. 2016 Jul;12(7):413-27. doi: 10.1038/nrneurol.2016.84. Epub 2016 Jun 10. Nat Rev Neurol. 2016. PMID: 27282653 Free PMC article. Review.
-
Proceedings of the 12th Annual UT-ORNL-KBRIN Bioinformatics Summit 2013.BMC Bioinformatics. 2013 Mar 22;14 Suppl 17(Suppl 17):A1. doi: 10.1186/1471-2105-14-s17-a1. BMC Bioinformatics. 2013. PMID: 24625056 Free PMC article. No abstract available.
-
Gene annotation bias impedes biomedical research.Sci Rep. 2018 Jan 22;8(1):1362. doi: 10.1038/s41598-018-19333-x. Sci Rep. 2018. PMID: 29358745 Free PMC article.
-
Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.Cell Syst. 2017 Jul 26;5(1):63-71.e6. doi: 10.1016/j.cels.2017.06.003. Epub 2017 Jul 12. Cell Syst. 2017. PMID: 28711280 Free PMC article.
-
Meta-Research: Understudied genes are lost in a leaky pipeline between genome-wide assays and reporting of results.Elife. 2024 Mar 28;12:RP93429. doi: 10.7554/eLife.93429. Elife. 2024. PMID: 38546716 Free PMC article.
References
-
- Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B. 1995;57:12.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases
