Prediction of liquid-liquid phase separating proteins using machine learning
- PMID: 35168563
- PMCID: PMC8845408
- DOI: 10.1186/s12859-022-04599-w
Prediction of liquid-liquid phase separating proteins using machine learning
Abstract
Background: The liquid-liquid phase separation (LLPS) of biomolecules in cell underpins the formation of membraneless organelles, which are the condensates of protein, nucleic acid, or both, and play critical roles in cellular function. Dysregulation of LLPS is implicated in a number of diseases. Although the LLPS of biomolecules has been investigated intensively in recent years, the knowledge of the prevalence and distribution of phase separation proteins (PSPs) is still lag behind. Development of computational methods to predict PSPs is therefore of great importance for comprehensive understanding of the biological function of LLPS.
Results: Based on the PSPs collected in LLPSDB, we developed a sequence-based prediction tool for LLPS proteins (PSPredictor), which is an attempt at general purpose of PSP prediction that does not depend on specific protein types. Our method combines the componential and sequential information during the protein embedding stage, and, adopts the machine learning algorithm for final predicting. The proposed method achieves a tenfold cross-validation accuracy of 94.71%, and outperforms previously reported PSPs prediction tools. For further applications, we built a user-friendly PSPredictor web server ( http://www.pkumdl.cn/PSPredictor ), which is accessible for prediction of potential PSPs.
Conclusions: PSPredictor could identifie novel scaffold proteins for stress granules and predict PSPs candidates in the human genome for further study. For further applications, we built a user-friendly PSPredictor web server ( http://www.pkumdl.cn/PSPredictor ), which provides valuable information for potential PSPs recognition.
Keywords: Liquid–liquid phase separation (LLPS); Machine learning; Phase separation proteins (PSPs); Predictor.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Evaluation of sequence-based predictors for phase-separating protein.Brief Bioinform. 2023 Jul 20;24(4):bbad213. doi: 10.1093/bib/bbad213. Brief Bioinform. 2023. PMID: 37287138
-
LLPSDB: a database of proteins undergoing liquid-liquid phase separation in vitro.Nucleic Acids Res. 2020 Jan 8;48(D1):D320-D327. doi: 10.1093/nar/gkz778. Nucleic Acids Res. 2020. PMID: 31906602 Free PMC article.
-
Protein Databases Related to Liquid-Liquid Phase Separation.Int J Mol Sci. 2020 Sep 16;21(18):6796. doi: 10.3390/ijms21186796. Int J Mol Sci. 2020. PMID: 32947964 Free PMC article. Review.
-
Seq2Phase: language model-based accurate prediction of client proteins in liquid-liquid phase separation.Bioinform Adv. 2023 Dec 22;4(1):vbad189. doi: 10.1093/bioadv/vbad189. eCollection 2024. Bioinform Adv. 2023. PMID: 38205268 Free PMC article.
-
Liquid-liquid phase separation (LLPS) in cellular physiology and tumor biology.Am J Cancer Res. 2021 Aug 15;11(8):3766-3776. eCollection 2021. Am J Cancer Res. 2021. PMID: 34522448 Free PMC article. Review.
Cited by
-
Liquid-liquid phase separation is essential for reovirus viroplasm formation and immune evasion.J Virol. 2024 Sep 17;98(9):e0102824. doi: 10.1128/jvi.01028-24. Epub 2024 Aug 28. J Virol. 2024. PMID: 39194247
-
Liquid-liquid phase separation in DNA double-strand breaks repair.Cell Death Dis. 2023 Nov 15;14(11):746. doi: 10.1038/s41419-023-06267-0. Cell Death Dis. 2023. PMID: 37968256 Free PMC article. Review.
-
Intrinsically disordered regions that drive phase separation form a robustly distinct protein class.J Biol Chem. 2023 Jan;299(1):102801. doi: 10.1016/j.jbc.2022.102801. Epub 2022 Dec 14. J Biol Chem. 2023. PMID: 36528065 Free PMC article.
-
Machine-learning analysis of intrinsically disordered proteins identifies key factors that contribute to neurodegeneration-related aggregation.Front Aging Neurosci. 2022 Aug 3;14:938117. doi: 10.3389/fnagi.2022.938117. eCollection 2022. Front Aging Neurosci. 2022. PMID: 35992603 Free PMC article.
-
Molecular features driving condensate formation and gene expression by the BRD4-NUT fusion oncoprotein are overlapping but distinct.Sci Rep. 2023 Jul 24;13(1):11907. doi: 10.1038/s41598-023-39102-9. Sci Rep. 2023. PMID: 37488172 Free PMC article.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
