Parapred: antibody paratope prediction using convolutional and recurrent neural networks

Bioinformatics. 2018 Sep 1;34(17):2944-2950. doi: 10.1093/bioinformatics/bty305.

Abstract

Motivation: Antibodies play essential roles in the immune system of vertebrates and are powerful tools in research and diagnostics. While hypervariable regions of antibodies, which are responsible for binding, can be readily identified from their amino acid sequence, it remains challenging to accurately pinpoint which amino acids will be in contact with the antigen (the paratope).

Results: In this work, we present a sequence-based probabilistic machine learning algorithm for paratope prediction, named Parapred. Parapred uses a deep-learning architecture to leverage features from both local residue neighbourhoods and across the entire sequence. The method significantly improves on the current state-of-the-art methodology, and only requires a stretch of amino acid sequence corresponding to a hypervariable region as an input, without any information about the antigen. We further show that our predictions can be used to improve both speed and accuracy of a rigid docking algorithm.

Availability and implementation: The Parapred method is freely available as a webserver at http://www-mvsoftware.ch.cam.ac.uk/and for download at https://github.com/eliberis/parapred.

Supplementary information: Supplementary information is available at Bioinformatics online.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Antibodies / chemistry*
  • Antibodies / immunology
  • Binding Sites, Antibody
  • Deep Learning
  • Machine Learning
  • Models, Molecular
  • Neural Networks, Computer

Substances

  • Antibodies