Benchmarking Refined and Unrefined AlphaFold2 Structures for Hit Discovery

J Chem Inf Model. 2023 Mar 27;63(6):1656-1667. doi: 10.1021/acs.jcim.2c01219. Epub 2023 Mar 10.

Abstract

The recently developed AlphaFold2 (AF2) algorithm predicts proteins' 3D structures from amino acid sequences. The open AlphaFold protein structure database covers the complete human proteome. Using an industry-leading molecular docking method (Glide), we investigated the virtual screening performance of 37 common drug targets, each with an AF2 structure and known holo and apo structures from the DUD-E data set. In a subset of 27 targets where the AF2 structures are suitable for refinement, the AF2 structures show comparable early enrichment of known active compounds (avg. EF 1%: 13.0) to apo structures (avg. EF 1%: 11.4) while falling behind early enrichment of the holo structures (avg. EF 1%: 24.2). With an induced-fit protocol (IFD-MD), we can refine the AF2 structures using an aligned known binding ligand as the template to improve the performance in structure-based virtual screening (avg. EF 1%: 18.9). Glide-generated docking poses of known binding ligands can also be used as templates for IFD-MD, achieving similar improvements (avg. EF 1% 18.0). Thus, with proper preparation and refinement, AF2 structures show considerable promise for in silico hit identification.

MeSH terms

  • Benchmarking*
  • Binding Sites
  • Furylfuramide*
  • Humans
  • Ligands
  • Molecular Docking Simulation
  • Peptide Elongation Factor 1 / metabolism
  • Protein Binding
  • Proteins / chemistry

Substances

  • Furylfuramide
  • Peptide Elongation Factor 1
  • Proteins
  • Ligands