FFPred: an integrated feature-based function prediction server for vertebrate proteomes

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W297-302. doi: 10.1093/nar/gkn193. Epub 2008 May 7.

Abstract

One of the challenges of the post-genomic era is to provide accurate function annotations for large volumes of data resulting from genome sequencing projects. Most function prediction servers utilize methods that transfer existing database annotations between orthologous sequences. In contrast, there are few methods that are independent of homology and can annotate distant and orphan protein sequences. The FFPred server adopts a machine-learning approach to perform function prediction in protein feature space using feature characteristics predicted from amino acid sequence. The features are scanned against a library of support vector machines representing over 300 Gene Ontology (GO) classes and probabilistic confidence scores returned for each annotation term. The GO term library has been modelled on human protein annotations; however, benchmark performance testing showed robust performance across higher eukaryotes. FFPred offers important advantages over traditional function prediction servers in its ability to annotate distant homologues and orphan protein sequences, and achieves greater coverage and classification accuracy than other feature-based prediction servers. A user may upload an amino acid and receive annotation predictions via email. Feature information is provided as easy to interpret graphics displayed on the sequence of interest, allowing for back-interpretation of the associations between features and function classes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Humans
  • Internet
  • Mice
  • Proteome / chemistry
  • Proteome / classification
  • Proteome / physiology*
  • Sequence Analysis, Protein
  • Software*
  • Systems Integration
  • User-Computer Interface
  • Vertebrates

Substances

  • Proteome