TEPITOPEpan: extending TEPITOPE for peptide binding prediction covering over 700 HLA-DR molecules

PLoS One. 2012;7(2):e30483. doi: 10.1371/journal.pone.0030483. Epub 2012 Feb 23.

Abstract

Motivation: Accurate identification of peptides binding to specific Major Histocompatibility Complex Class II (MHC-II) molecules is of great importance for elucidating the underlying mechanism of immune recognition, as well as for developing effective epitope-based vaccines and promising immunotherapies for many severe diseases. Due to extreme polymorphism of MHC-II alleles and the high cost of biochemical experiments, the development of computational methods for accurate prediction of binding peptides of MHC-II molecules, particularly for the ones with few or no experimental data, has become a topic of increasing interest. TEPITOPE is a well-used computational approach because of its good interpretability and relatively high performance. However, TEPITOPE can be applied to only 51 out of over 700 known HLA DR molecules.

Method: We have developed a new method, called TEPITOPEpan, by extrapolating from the binding specificities of HLA DR molecules characterized by TEPITOPE to those uncharacterized. First, each HLA-DR binding pocket is represented by amino acid residues that have close contact with the corresponding peptide binding core residues. Then the pocket similarity between two HLA-DR molecules is calculated as the sequence similarity of the residues. Finally, for an uncharacterized HLA-DR molecule, the binding specificity of each pocket is computed as a weighted average in pocket binding specificities over HLA-DR molecules characterized by TEPITOPE.

Result: The performance of TEPITOPEpan has been extensively evaluated using various data sets from different viewpoints: predicting MHC binding peptides, identifying HLA ligands and T-cell epitopes and recognizing binding cores. Among the four state-of-the-art competing pan-specific methods, for predicting binding specificities of unknown HLA-DR molecules, TEPITOPEpan was roughly the second best method next to NETMHCIIpan-2.0. Additionally, TEPITOPEpan achieved the best performance in recognizing binding cores. We further analyzed the motifs detected by TEPITOPEpan, examining the corresponding literature of immunology. Its online server and PSSMs therein are available at http://www.biokdd.fudan.edu.cn/Service/TEPITOPEpan/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alleles
  • Area Under Curve
  • Computational Biology / methods
  • Computer Simulation
  • Crystallography, X-Ray / methods
  • Epitopes / chemistry*
  • Gene Expression Regulation
  • HLA-DR Antigens / genetics*
  • HLA-DR Antigens / immunology*
  • Histocompatibility Antigens Class II / genetics*
  • Humans
  • Ligands
  • Models, Statistical
  • Peptide Library
  • Peptides / chemistry*
  • Polymorphism, Genetic
  • Protein Binding
  • Protein Conformation
  • Reproducibility of Results
  • T-Lymphocytes / cytology

Substances

  • Epitopes
  • HLA-DR Antigens
  • Histocompatibility Antigens Class II
  • Ligands
  • Peptide Library
  • Peptides