SysPTM: a systematic resource for proteomic research on post-translational modifications

Mol Cell Proteomics. 2009 Aug;8(8):1839-49. doi: 10.1074/mcp.M900030-MCP200. Epub 2009 Apr 14.

Abstract

With the rapid expansion of protein post-translational modification (PTM) research based on large-scale proteomic work, there is an increasing demand for a suitable repository to analyze PTM data. Here we present a curated, web-accessible PTM data base, SysPTM. SysPTM provides a systematic and sophisticated platform for proteomic PTM research equipped not only with a knowledge base of manually curated multi-type modification data but also with four fully developed, in-depth data mining tools. Currently, SysPTM contains data detailing 117,349 experimentally determined PTM sites on 33,421 proteins involving nearly 50 PTM types, curated from public resources including five data bases and four web servers and more than one hundred peer-reviewed mass spectrometry papers. Protein annotations including Pfam domains, KEGG pathways, GO functional classification, and ortholog groups are integrated into the data base. Four online tools have been developed and incorporated, including PTMBlast, to compare a user's PTM dataset with PTM data in SysPTM; PTMPathway, to map PTM proteins to KEGG pathways; PTMPhylog, to discover potentially conserved PTM sites; and PTMCluster, to find clusters of multi-site modifications. The workflow of SysPTM was demonstrated by analyzing an in-house phosphorylation dataset identified by MS/MS. It is shown that in SysPTM, the role of single-type and multi-type modifications can be systematically investigated in a full biological context. SysPTM could be an important contribution to modificomics research. SysPTM is freely available online at www.sysbio.ac.cn/SysPTM.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Computational Biology / methods
  • Database Management Systems / statistics & numerical data
  • Databases, Protein / statistics & numerical data*
  • Humans
  • Internet
  • Molecular Sequence Data
  • Phosphorylation
  • Protein Processing, Post-Translational*
  • Proteins / analysis*
  • Proteins / genetics
  • Proteins / metabolism
  • Proteomics / statistics & numerical data*
  • Research Design
  • Sequence Homology, Amino Acid
  • Signal Transduction
  • User-Computer Interface

Substances

  • Proteins