Machine Learning Classification Models to Improve the Docking-based Screening: A Case of PI3K-Tankyrase Inhibitors

Mol Inform. 2018 Nov;37(11):e1800030. doi: 10.1002/minf.201800030. Epub 2018 Jun 14.

Abstract

One of the major challenges in the current drug discovery is the improvement of the docking-based virtual screening performance. It is especially important in the rational design of compounds with desired polypharmacology or selectivity profiles. To address this problem, we present a methodology for the development of target-specific scoring functions possessing high screening power. These scoring functions were built using the machine learning methods for the dual target inhibitors of PI3Kα and tankyrase, promising targets for colorectal cancer therapy. The Deep Neural Network models achieve the external test AUC ROC values of 0.96 and 0.93 for the random split and 0.90 and 0.84 for the time-based split of the PI3Kα and tankyrase inhibitors, respectively. In addition, the impact of the training set size and the actives/decoys ratio on the model quality was assessed. The study demonstrates that the optimized scoring functions could significantly improve the docking screening power for each individual target. This is very useful in the design of multitarget or selective drugs wherein the screening filters are applied in sequence.

Keywords: Docking; drug discovery; machine learning; virtual screening.

MeSH terms

  • Binding Sites
  • Databases, Chemical
  • Drug Discovery / methods*
  • Humans
  • Machine Learning
  • Molecular Docking Simulation / methods*
  • Phosphatidylinositol 3-Kinases / chemistry*
  • Phosphatidylinositol 3-Kinases / metabolism
  • Protein Binding
  • Protein Kinase Inhibitors / chemistry*
  • Protein Kinase Inhibitors / pharmacology
  • Small Molecule Libraries / chemistry
  • Small Molecule Libraries / pharmacology
  • Tankyrases / chemistry*
  • Tankyrases / metabolism

Substances

  • Protein Kinase Inhibitors
  • Small Molecule Libraries
  • Tankyrases
  • Phosphatidylinositol 3-Kinases