DomainFit: Identification of protein domains in cryo-EM maps at intermediate resolution using AlphaFold2-predicted models

Structure. 2024 May 6:S0969-2126(24)00143-6. doi: 10.1016/j.str.2024.04.017. Online ahead of print.

Abstract

Cryoelectron microscopy (cryo-EM) has revolutionized the structural determination of macromolecular complexes. With the paradigm shift to structure determination of highly complex endogenous macromolecular complexes ex vivo and in situ structural biology, there are an increasing number of structures of native complexes. These complexes often contain unidentified proteins, related to different cellular states or processes. Identifying proteins at resolutions lower than 4 Å remains challenging because side chains cannot be visualized reliably. Here, we present DomainFit, a program for semi-automated domain-level protein identification from cryo-EM maps, particularly at resolutions lower than 4 Å. By fitting domains from AlphaFold2-predicted models into cryo-EM maps, the program performs statistical analyses and attempts to identify the domains and protein candidates forming the density. Using DomainFit, we identified two microtubule inner proteins, one of which contains a CCDC81 domain and is exclusively localized in the proximal region of the doublet microtubule in Tetrahymena thermophila.

Keywords: AlphaFold2 modeling; cilia; cryo-EM; cryo-ET; domain identification; doublet; microtubule inner protein; protein identification; subtomogram averaging.