PhosSNP for systematic analysis of genetic polymorphisms that influence protein phosphorylation

Mol Cell Proteomics. 2010 Apr;9(4):623-34. doi: 10.1074/mcp.M900273-MCP200. Epub 2009 Dec 8.


We are entering the era of personalized genomics as breakthroughs in sequencing technology have made it possible to sequence or genotype an individual person in an efficient and accurate manner. Preliminary results from HapMap and other similar projects have revealed the existence of tremendous genetic variations among world populations and among individuals. It is important to delineate the functional implication of such variations, i.e. whether they affect the stability and biochemical properties of proteins. It is also generally believed that the genetic variation is the main cause for different susceptibility to certain diseases or different response to therapeutic treatments. Understanding genetic variation in the context of human diseases thus holds the promise for "personalized medicine." In this work, we carried out a genome-wide analysis of single nucleotide polymorphisms (SNPs) that could potentially influence protein phosphorylation characteristics in human. Here, we defined a phosphorylation-related SNP (phosSNP) as a non-synonymous SNP (nsSNP) that affects the protein phosphorylation status. Using an in-house developed kinase-specific phosphorylation site predictor (GPS 2.0), we computationally detected that approximately 70% of the reported nsSNPs are potential phosSNPs. More interestingly, approximately 74.6% of these potential phosSNPs might also induce changes in protein kinase types in adjacent phosphorylation sites rather than creating or removing phosphorylation sites directly. Taken together, we proposed that a large proportion of the nsSNPs might affect protein phosphorylation characteristics and play important roles in rewiring biological pathways. Finally, all phosSNPs were integrated into the PhosSNP 1.0 database, which was implemented in JAVA 1.5 (J2SE 5.0). The PhosSNP 1.0 database is freely available for academic researchers.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Binding Sites / genetics
  • Computational Biology / methods*
  • Databases, Genetic*
  • Genome-Wide Association Study / methods
  • Humans
  • Models, Biological
  • Phosphorylation / genetics*
  • Polymorphism, Single Nucleotide* / physiology
  • Protein Kinases / metabolism
  • Protein Processing, Post-Translational / genetics*
  • Proteins / genetics
  • Proteins / metabolism
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein


  • Proteins
  • Protein Kinases