Detection of gene orthology from gene co-expression and protein interaction networks

BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S7. doi: 10.1186/1471-2105-11-S3-S7.

Abstract

Background: Ortholog detection methods present a powerful approach for finding genes that participate in similar biological processes across different organisms, extending our understanding of interactions between genes across different pathways, and understanding the evolution of gene families.

Results: We exploit features derived from the alignment of protein-protein interaction networks and gene-coexpression networks to reconstruct KEGG orthologs for Drosophila melanogaster, Saccharomyces cerevisiae, Mus musculus and Homo sapiens protein-protein interaction networks extracted from the DIP repository and Mus musculus and Homo sapiens and Sus scrofa gene coexpression networks extracted from NCBI's Gene Expression Omnibus using the decision tree, Naive-Bayes and Support Vector Machine classification algorithms.

Conclusions: The performance of our classifiers in reconstructing KEGG orthologs is compared against a basic reciprocal BLAST hit approach. We provide implementations of the resulting algorithms as part of BiNA, an open source biomolecular network alignment toolkit.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Artificial Intelligence
  • Bayes Theorem
  • Databases, Genetic*
  • Decision Trees
  • Drosophila melanogaster
  • Gene Expression*
  • Genomics / methods*
  • Humans
  • Mice
  • Protein Interaction Mapping / methods*
  • Proteins / genetics
  • ROC Curve
  • Saccharomyces cerevisiae
  • Sequence Alignment
  • Swine

Substances

  • Proteins