OrthologID: automation of genome-scale ortholog identification within a parsimony framework
- PMID: 16410324
- DOI: 10.1093/bioinformatics/btk040
OrthologID: automation of genome-scale ortholog identification within a parsimony framework
Abstract
Motivation: The determination of gene orthology is a prerequisite for mining and utilizing the rapidly increasing amount of sequence data for genome-scale phylogenetics and comparative genomic studies. Until now, most researchers use pairwise distance comparisons algorithms, such as BLAST, COG, RBH, RSD and INPARANOID, to determine gene orthology. In contrast, orthology determination within a character-based phylogenetic framework has not been utilized on a genomic scale owing to the lack of efficiency and automation.
Results: We have developed OrthologID, a Web application that automates the labor-intensive procedures of gene orthology determination within a character-based phylogenetic framework, thus making character-based orthology determination on a genomic scale possible. In addition to generating gene family trees and determining orthologous gene sets for complete genomes, OrthologID can also identify diagnostic characters that define each orthologous gene set, as well as diagnostic characters that are responsible for classifying query sequences from other genomes into specific orthology groups. The OrthologID database currently includes several complete plant genomes, including Arabidopsis thaliana, Oryza sativa, Populus trichocarpa, as well as a unicellular outgroup, Chlamydomonas reinhardtii. To improve the general utility of OrthologID beyond plant species, we plan to expand our sequence database to include the fully sequenced genomes of prokaryotes and other non-plant eukaryotes.
Availability: http://nypg.bio.nyu.edu/orthologid/
Similar articles
-
Gene orthology assessment with OrthologID.Methods Mol Biol. 2009;537:23-38. doi: 10.1007/978-1-59745-251-9_2. Methods Mol Biol. 2009. PMID: 19378138
-
PhyloPat: phylogenetic pattern analysis of eukaryotic genes.BMC Bioinformatics. 2006 Sep 1;7:398. doi: 10.1186/1471-2105-7-398. BMC Bioinformatics. 2006. PMID: 16948844 Free PMC article.
-
Automatic clustering of orthologs and inparalogs shared by multiple proteomes.Bioinformatics. 2006 Jul 15;22(14):e9-15. doi: 10.1093/bioinformatics/btl213. Bioinformatics. 2006. PMID: 16873526
-
Homology assessment and molecular sequence alignment.J Biomed Inform. 2006 Feb;39(1):18-33. doi: 10.1016/j.jbi.2005.11.005. Epub 2005 Dec 9. J Biomed Inform. 2006. PMID: 16380300 Review.
-
Advances in the Exon-Intron Database (EID).Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
Cited by
-
Major Revisions in Pancrustacean Phylogeny and Evidence of Sensitivity to Taxon Sampling.Mol Biol Evol. 2023 Aug 3;40(8):msad175. doi: 10.1093/molbev/msad175. Mol Biol Evol. 2023. PMID: 37552897 Review.
-
Genome-Scale Characterization of Predicted Plastid-Targeted Proteomes in Higher Plants.Sci Rep. 2020 May 19;10(1):8281. doi: 10.1038/s41598-020-64670-5. Sci Rep. 2020. PMID: 32427841 Free PMC article.
-
Surveying alignment-free features for Ortholog detection in related yeast proteomes by using supervised big data classifiers.BMC Bioinformatics. 2018 May 3;19(1):166. doi: 10.1186/s12859-018-2148-8. BMC Bioinformatics. 2018. PMID: 29724166 Free PMC article.
-
Structural complexity and functional diversity of plant NADPH oxidases.Amino Acids. 2018 Jan;50(1):79-94. doi: 10.1007/s00726-017-2491-5. Epub 2017 Oct 25. Amino Acids. 2018. PMID: 29071531 Free PMC article.
-
OrthoReD: a rapid and accurate orthology prediction tool with low computational requirement.BMC Bioinformatics. 2017 Jun 21;18(1):310. doi: 10.1186/s12859-017-1726-5. BMC Bioinformatics. 2017. PMID: 28633662 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
