A machine learning-based chemoproteomic approach to identify drug targets and binding sites in complex proteomes

Nat Commun. 2020 Aug 21;11(1):4200. doi: 10.1038/s41467-020-18071-x.

Abstract

Chemoproteomics is a key technology to characterize the mode of action of drugs, as it directly identifies the protein targets of bioactive compounds and aids in the development of optimized small-molecule compounds. Current approaches cannot identify the protein targets of a compound and also detect the interaction surfaces between ligands and protein targets without prior labeling or modification. To address this limitation, we here develop LiP-Quant, a drug target deconvolution pipeline based on limited proteolysis coupled with mass spectrometry that works across species, including in human cells. We use machine learning to discern features indicative of drug binding and integrate them into a single score to identify protein targets of small molecules and approximate their binding sites. We demonstrate drug target identification across compound classes, including drugs targeting kinases, phosphatases and membrane proteins. LiP-Quant estimates the half maximal effective concentration of compound binding sites in whole cell lysates, correctly discriminating drug binding to homologous proteins and identifying the so far unknown targets of a fungicide research compound.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Botrytis
  • Cell Survival
  • Computational Biology / methods
  • Drug Delivery Systems / methods*
  • Drug Discovery / methods
  • HeLa Cells
  • Humans
  • Ligands
  • Machine Learning*
  • Mass Spectrometry
  • Phosphotransferases / metabolism
  • Protein Binding
  • Proteolysis
  • Proteome*
  • Proteomics / methods*
  • Saccharomyces cerevisiae

Substances

  • Ligands
  • Proteome
  • Phosphotransferases