Rapid protein assignments and structures from raw NMR spectra with the deep learning technique ARTINA
- PMID: 36257955
- PMCID: PMC9579175
- DOI: 10.1038/s41467-022-33879-5
Rapid protein assignments and structures from raw NMR spectra with the deep learning technique ARTINA
Abstract
Nuclear Magnetic Resonance (NMR) spectroscopy is a major technique in structural biology with over 11,800 protein structures deposited in the Protein Data Bank. NMR can elucidate structures and dynamics of small and medium size proteins in solution, living cells, and solids, but has been limited by the tedious data analysis process. It typically requires weeks or months of manual work of a trained expert to turn NMR measurements into a protein structure. Automation of this process is an open problem, formulated in the field over 30 years ago. We present a solution to this challenge that enables the completely automated analysis of protein NMR data within hours after completing the measurements. Using only NMR spectra and the protein sequence as input, our machine learning-based method, ARTINA, delivers signal positions, resonance assignments, and structures strictly without human intervention. Tested on a 100-protein benchmark comprising 1329 multidimensional NMR spectra, ARTINA demonstrated its ability to solve structures with 1.44 Å median RMSD to the PDB reference and to identify 91.36% correct NMR resonance assignments. ARTINA can be used by non-experts, reducing the effort for a protein assignment or structure determination by NMR essentially to the preparation of the sample and the spectra measurements.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis.Sci Data. 2024 Jan 4;11(1):30. doi: 10.1038/s41597-023-02879-5. Sci Data. 2024. PMID: 38177162 Free PMC article.
-
Time-optimized protein NMR assignment with an integrative deep learning approach using AlphaFold and chemical shift prediction.Sci Adv. 2023 Nov 24;9(47):eadi9323. doi: 10.1126/sciadv.adi9323. Epub 2023 Nov 22. Sci Adv. 2023. PMID: 37992167 Free PMC article.
-
Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS.J Biomol NMR. 2002 Nov;24(3):171-89. doi: 10.1023/a:1021614115432. J Biomol NMR. 2002. PMID: 12522306
-
NMR-based automated protein structure determination.Arch Biochem Biophys. 2017 Aug 15;628:24-32. doi: 10.1016/j.abb.2017.02.011. Epub 2017 Mar 2. Arch Biochem Biophys. 2017. PMID: 28263718 Review.
-
Automation of NMR structure determination of proteins.Curr Opin Struct Biol. 2004 Oct;14(5):547-53. doi: 10.1016/j.sbi.2004.09.003. Curr Opin Struct Biol. 2004. PMID: 15465314 Review.
Cited by
-
5D solid-state NMR spectroscopy for facilitated resonance assignment.J Biomol NMR. 2023 Dec;77(5-6):229-245. doi: 10.1007/s10858-023-00424-5. Epub 2023 Nov 9. J Biomol NMR. 2023. PMID: 37943392 Free PMC article.
-
Manual and automatic assignment of two different Aβ40 amyloid fibril polymorphs using MAS solid-state NMR spectroscopy.Biomol NMR Assign. 2024 Dec;18(2):201-212. doi: 10.1007/s12104-024-10189-z. Epub 2024 Aug 9. Biomol NMR Assign. 2024. PMID: 39120652 Free PMC article.
-
Requirements for efficient endosomal escape by designed mini-proteins.bioRxiv [Preprint]. 2024 Apr 6:2024.04.05.588336. doi: 10.1101/2024.04.05.588336. bioRxiv. 2024. PMID: 38617268 Free PMC article. Preprint.
-
Deep-Learning-Based Mixture Identification for Nuclear Magnetic Resonance Spectroscopy Applied to Plant Flavors.Molecules. 2023 Nov 1;28(21):7380. doi: 10.3390/molecules28217380. Molecules. 2023. PMID: 37959799 Free PMC article.
-
Unraveling dynamic protein structures by two-dimensional infrared spectra with a pretrained machine learning model.Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2409257121. doi: 10.1073/pnas.2409257121. Epub 2024 Jun 25. Proc Natl Acad Sci U S A. 2024. PMID: 38917009 Free PMC article.
References
-
- Garrett DS, Powers R, Gronenborn AM, Clore GM. A common sense approach to peak picking two-, three- and four-dimensional spectra using automatic computer analysis of contour diagrams. J. Magn. Reson. 1991;95:214–220. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
