GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features

PLoS One. 2018 Oct 12;13(10):e0200283. doi: 10.1371/journal.pone.0200283. eCollection 2018.

Abstract

Lysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of proteins. The complicated sequence patterns of protein succinylation revealed by proteomic studies highlight the necessity of developing effective species-specific in silico strategies for global prediction succinylation sites. Here we have developed the generic and nine species-specific succinylation site classifiers through aggregating multiple complementary features. We optimized the consecutive features using the Wilcoxon-rank feature selection scheme. The final feature vectors were trained by a random forest (RF) classifier. With an integration of RF scores via logistic regression, the resulting predictor termed GPSuc achieved better performance than other existing generic and species-specific succinylation site predictors. To reveal the mechanism of succinylation and assist hypothesis-driven experimental design, our predictor serves as a valuable resource. To provide a promising performance in large-scale datasets, a web application was developed at http://kurata14.bio.kyutech.ac.jp/GPSuc/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Area Under Curve
  • Computational Biology / methods*
  • Humans
  • Logistic Models
  • Lysine / chemistry
  • Protein Processing, Post-Translational
  • Proteomics / methods*
  • ROC Curve
  • Regression Analysis
  • Species Specificity
  • Succinic Acid / chemistry

Substances

  • Succinic Acid
  • Lysine

Grants and funding

This work was supported by the Grant-in-Aid for Challenging Exploratory Research with JSPS KAKENHI Grant Number 17K20009 (HK).