Characterization of the binding profile of peptide to transporter associated with antigen processing (TAP) using Gaussian process regression

Comput Biol Med. 2011 Sep;41(9):865-70. doi: 10.1016/j.compbiomed.2011.07.004. Epub 2011 Aug 3.


Although MHC-peptide binding is the most selective event in epitope presentation process, the protein fragments generated by proteasomal cleavage require to be recognized by transporter associated with antigen processing (TAP) and translocated from cytosol to endoplasmic reticulum before they can be loaded into the ligand-binding groove of MHC. In this article, we report the use of a new and powerful machine learning tool called Gaussian process (GP) to model the linear and nonlinear relationships between the sequence pattern and binding affinity of peptide to TAP, and to explain the physicochemical properties and structural implications underlying the specific recognition and association of peptide with TAP. The resulting statistics are compared systematically with those obtained by sophisticated PLS, ANN and SVM. Results show that: (i) Nonlinear methods such as the ANN and GP perform much better than the linear PLS. (ii) GP is capable of handling both linearity- and nonlinearity-hybrid relationship and thus exhibits a good performance relative to other two nonlinear methods. (iii) Investigation of the GP model shows that the P1, P2, P3 and P9 of peptide are the most important positions that dominate TAP-peptide recognition, P5 contributes slightly to the peptide binding, whereas P4, P6, P7 and P8 can only exert very limited potency on the binding. (iv) Diverse properties cast remarkable effects on the interaction between TAP and peptide. In particular, hydrophobility, electronic property and hydrogen bond contribute most significantly to the binding affinity of TAP-peptide association.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • ATP-Binding Cassette Transporters / chemistry*
  • ATP-Binding Cassette Transporters / metabolism*
  • Amino Acid Sequence
  • Databases, Protein
  • Humans
  • Hydrogen Bonding
  • Hydrophobic and Hydrophilic Interactions
  • Models, Molecular
  • Models, Statistical
  • Neural Networks, Computer
  • Normal Distribution
  • Peptides / chemistry*
  • Peptides / metabolism*
  • Protein Binding
  • Protein Transport
  • Regression Analysis
  • Reproducibility of Results
  • Support Vector Machine


  • ATP-Binding Cassette Transporters
  • Peptides
  • transporter associated with antigen processing (TAP)