Predicting protein post-translational modifications using meta-analysis of proteome scale data sets

Mol Cell Proteomics. 2009 Feb;8(2):365-79. doi: 10.1074/mcp.M800332-MCP200. Epub 2008 Oct 28.


Protein post-translational modifications are an important biological regulatory mechanism, and the rate of their discovery using high throughput techniques is rapidly increasingly. To make use of this wealth of sequence data, we introduce a new general strategy designed to predict a variety of post-translational modifications in several organisms. We used the motif-x program to determine phosphorylation motifs in yeast, fly, mouse, and man and lysine acetylation motifs in man. These motifs were then scanned against proteomic sequence data using a newly developed tool called scan-x to globally predict other potential modification sites within these organisms. 10-fold cross-validation was used to determine the sensitivity and minimum specificity for each set of predictions, all of which showed improvement over other available tools for phosphoprediction. New motif discovery is a byproduct of this approach, and the phosphorylation motif analyses provide strong evidence of evolutionary conservation of both known and novel kinase motifs.

Publication types

  • Meta-Analysis
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Acetylation
  • Algorithms
  • Amino Acid Motifs
  • Amino Acid Sequence
  • Animals
  • Databases, Protein*
  • Drosophila
  • Humans
  • Mice
  • Molecular Sequence Data
  • Phosphorylation
  • Protein Processing, Post-Translational*
  • Proteome / chemistry
  • Proteome / metabolism*
  • ROC Curve
  • Reproducibility of Results
  • Saccharomyces cerevisiae


  • Proteome