Protein function prediction with the shortest path in functional linkage graph and boosting

Int J Bioinform Res Appl. 2008;4(4):375-84. doi: 10.1504/IJBRA.2008.021175.

Abstract

Annotating proteins with biological functions is one of the main goals in post genomic era. Various high-throughout technologies, e.g., yeast two-hybrid systems and microarray, have provided an alternative way to protein function prediction. Despite the success obtained by high-throughout data, the errors in the data have not been handled well. In this work, a new technique for protein function prediction is presented, where a weighted functional linkage graph is generated by exploiting the existing protein-protein interaction data, complex data and gene expression data. By finding the shortest path in the functional linkage graph, the functional links among proteins can be captured. With the functional links available, the functions of unknown proteins can be predicted utilising support vector machines and the functions of those proteins that have functional links to the unknown proteins. In addition, the boosting algorithm is employed to further improve the prediction accuracy. The experiments on yeast genes show promising results and prove the efficiency of the proposed methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computational Biology*
  • Databases, Protein
  • Gene Expression Profiling / statistics & numerical data
  • Genes, Fungal
  • Multiprotein Complexes
  • Protein Interaction Mapping
  • Proteins / genetics*
  • Proteins / physiology*
  • Proteomics / statistics & numerical data
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / physiology
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / physiology

Substances

  • Multiprotein Complexes
  • Proteins
  • Saccharomyces cerevisiae Proteins