Predictomes, a classifier-curated database of AlphaFold-modeled protein-protein interactions

Mol Cell. 2025 Mar 20;85(6):1216-1232.e5. doi: 10.1016/j.molcel.2025.01.034. Epub 2025 Feb 26.

Abstract

Protein-protein interactions (PPIs) are ubiquitous in biology, yet a comprehensive structural characterization of the PPIs underlying cellular processes is lacking. AlphaFold-Multimer (AF-M) has the potential to fill this knowledge gap, but standard AF-M confidence metrics do not reliably separate relevant PPIs from an abundance of false positive predictions. To address this limitation, we used machine learning on curated datasets to train a structure prediction and omics-informed classifier (SPOC) that effectively separates true and false AF-M predictions of PPIs, including in proteome-wide screens. We applied SPOC to an all-by-all matrix of nearly 300 human genome maintenance proteins, generating ∼40,000 predictions that can be viewed at predictomes.org, where users can also score their own predictions with SPOC. High-confidence PPIs discovered using our approach enable hypothesis generation in genome maintenance. Our results provide a framework for interpreting large-scale AF-M screens and help lay the foundation for a proteome-wide structural interactome.

Keywords: AF-M database; AF-M score; AlphaFold-multimer; PPI database; PPI screen; SPOC; genome maintenance; in-silico interaction screen; predictomes; protein-protein interactions.

MeSH terms

  • Computational Biology* / methods
  • Databases, Protein*
  • Humans
  • Machine Learning
  • Protein Binding
  • Protein Folding
  • Protein Interaction Mapping* / methods
  • Protein Interaction Maps
  • Proteome* / genetics
  • Proteome* / metabolism
  • Proteomics / methods

Substances

  • Proteome