Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 15;23(1):72.
doi: 10.1186/s12859-022-04599-w.

Prediction of liquid-liquid phase separating proteins using machine learning

Affiliations

Prediction of liquid-liquid phase separating proteins using machine learning

Xiaoquan Chu et al. BMC Bioinformatics. .

Abstract

Background: The liquid-liquid phase separation (LLPS) of biomolecules in cell underpins the formation of membraneless organelles, which are the condensates of protein, nucleic acid, or both, and play critical roles in cellular function. Dysregulation of LLPS is implicated in a number of diseases. Although the LLPS of biomolecules has been investigated intensively in recent years, the knowledge of the prevalence and distribution of phase separation proteins (PSPs) is still lag behind. Development of computational methods to predict PSPs is therefore of great importance for comprehensive understanding of the biological function of LLPS.

Results: Based on the PSPs collected in LLPSDB, we developed a sequence-based prediction tool for LLPS proteins (PSPredictor), which is an attempt at general purpose of PSP prediction that does not depend on specific protein types. Our method combines the componential and sequential information during the protein embedding stage, and, adopts the machine learning algorithm for final predicting. The proposed method achieves a tenfold cross-validation accuracy of 94.71%, and outperforms previously reported PSPs prediction tools. For further applications, we built a user-friendly PSPredictor web server ( http://www.pkumdl.cn/PSPredictor ), which is accessible for prediction of potential PSPs.

Conclusions: PSPredictor could identifie novel scaffold proteins for stress granules and predict PSPs candidates in the human genome for further study. For further applications, we built a user-friendly PSPredictor web server ( http://www.pkumdl.cn/PSPredictor ), which provides valuable information for potential PSPs recognition.

Keywords: Liquid–liquid phase separation (LLPS); Machine learning; Phase separation proteins (PSPs); Predictor.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Relationship between percent recall and total percentage of human proteins accepted at given thresholds, for Model 0 and three best first generation prediction tools
Fig. 2
Fig. 2
Fraction of proteins in each category (scaffold, regulator or client) predicted as PSPs by PSPredictor or PScore
Fig. 3
Fig. 3
A Fraction of proteins in each tier group predicted as PSPs by first generation prediction tools and PSPredictor. B The number of predicted PSPs that overlapped between two prediction tools
Fig. 4
Fig. 4
The model architectures of CBOW (A) and Skip-gram (B)
Fig. 5
Fig. 5
2D vector projection of PSPs and non-PSPs by PCA

Similar articles

Cited by

References

    1. Alberti S, Gladfelter A, Mittag T. Considerations and challenges in studying liquid–liquid phase separation and biomolecular condensates. Cell. 2019;176:419–434. - PMC - PubMed
    1. Alberti S, Halfmann R, King O, Kapila A, Lindquist S. A systematic survey identifies prions and illuminates sequence features of prionogenic proteins. Cell. 2009;137:146–158. - PMC - PubMed
    1. Ambadipudi S, Biernat J, Riedel D, Mandelkow E, Zweckstetter M. Liquid–liquid phase separation of the microtubule-binding repeats of the Alzheimer-related protein Tau. Nat Commun. 2017;8:275. - PMC - PubMed
    1. Banani SF, Lee HO, Hyman AA, Rosen MK. Biomolecular condensates: organizers of cellular biochemistry. Nat Rev Mol Cell Biol. 2017;18:285–298. - PMC - PubMed
    1. Bolognesi B, Gotor NL, Dhar R, Cirillo D, Baldrighi M, Tartaglia GG, Lehner B. A concentration-dependent liquid phase separation can cause toxicity upon increased protein expression. Cell Rep. 2016;16:222–231. - PMC - PubMed

LinkOut - more resources