Structure-based protein function prediction using graph convolutional networks
- PMID: 34039967
- PMCID: PMC8155034
- DOI: 10.1038/s41467-021-23303-9
Structure-based protein function prediction using graph convolutional networks
Abstract
The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/ .
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Accurate protein function prediction via graph attention networks with predicted structure information.Brief Bioinform. 2022 Jan 17;23(1):bbab502. doi: 10.1093/bib/bbab502. Brief Bioinform. 2022. PMID: 34882195 Free PMC article.
-
Hierarchical graph transformer with contrastive learning for protein function prediction.Bioinformatics. 2023 Jul 1;39(7):btad410. doi: 10.1093/bioinformatics/btad410. Bioinformatics. 2023. PMID: 37369035 Free PMC article.
-
Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks.J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X. J Bioinform Comput Biol. 2018. PMID: 30419785
-
Protein Structure Prediction: Conventional and Deep Learning Perspectives.Protein J. 2021 Aug;40(4):522-544. doi: 10.1007/s10930-021-10003-y. Epub 2021 May 28. Protein J. 2021. PMID: 34050498 Review.
-
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112. Brief Bioinform. 2020. PMID: 31714956 Review.
Cited by
-
ProTInSeq: transposon insertion tracking by ultra-deep DNA sequencing to identify translated large and small ORFs.Nat Commun. 2024 Mar 7;15(1):2091. doi: 10.1038/s41467-024-46112-2. Nat Commun. 2024. PMID: 38453908 Free PMC article.
-
Machine learning-aided design and screening of an emergent protein function in synthetic cells.Nat Commun. 2024 Mar 5;15(1):2010. doi: 10.1038/s41467-024-46203-0. Nat Commun. 2024. PMID: 38443351 Free PMC article.
-
Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering.ACS Cent Sci. 2024 Feb 5;10(2):226-241. doi: 10.1021/acscentsci.3c01275. eCollection 2024 Feb 28. ACS Cent Sci. 2024. PMID: 38435522 Free PMC article. Review.
-
A comprehensive computational benchmark for evaluating deep learning-based protein function prediction approaches.Brief Bioinform. 2024 Jan 22;25(2):bbae050. doi: 10.1093/bib/bbae050. Brief Bioinform. 2024. PMID: 38388682 Free PMC article.
-
Temporal colonization and metabolic regulation of the gut microbiome in neonatal oxen at single nucleotide resolution.ISME J. 2024 Jan 8;18(1):wrad022. doi: 10.1093/ismejo/wrad022. ISME J. 2024. PMID: 38365257 Free PMC article.
References
-
- Goodsell, D. S. The Machinery of Life (Springer Science & Business Media, 2009).
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
