DiffModeler: large macromolecular structure modeling for cryo-EM maps using a diffusion model

Nat Methods. 2024 Dec;21(12):2307-2317. doi: 10.1038/s41592-024-02479-0. Epub 2024 Oct 21.

Abstract

Cryogenic electron microscopy (cryo-EM) has now been widely used for determining multichain protein complexes. However, modeling a large complex structure, such as those with more than ten chains, is challenging, particularly when the map resolution decreases. Here we present DiffModeler, a fully automated method for modeling large protein complex structures. DiffModeler employs a diffusion model for backbone tracing and integrates AlphaFold2-predicted single-chain structures for structure fitting. DiffModeler showed an average template modeling score of 0.88 and 0.91 for two datasets of cryo-EM maps of 0-5 Å resolution and 0.92 for intermediate resolution maps (5-10 Å), substantially outperforming existing methodologies. Further benchmarking at low resolutions (10-20 Å) confirms its versatility, demonstrating plausible performance.

MeSH terms

  • Algorithms
  • Cryoelectron Microscopy* / methods
  • Diffusion
  • Macromolecular Substances / chemistry
  • Models, Molecular*
  • Protein Conformation
  • Proteins / chemistry
  • Proteins / ultrastructure
  • Software

Substances

  • Macromolecular Substances
  • Proteins