Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 2;118(44):e2111696118.
doi: 10.1073/pnas.2111696118.

Accurate model of liquid-liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties

Affiliations

Accurate model of liquid-liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties

Giulio Tesei et al. Proc Natl Acad Sci U S A. .

Abstract

Many intrinsically disordered proteins (IDPs) may undergo liquid-liquid phase separation (LLPS) and participate in the formation of membraneless organelles in the cell, thereby contributing to the regulation and compartmentalization of intracellular biochemical reactions. The phase behavior of IDPs is sequence dependent, and its investigation through molecular simulations requires protein models that combine computational efficiency with an accurate description of intramolecular and intermolecular interactions. We developed a general coarse-grained model of IDPs, with residue-level detail, based on an extensive set of experimental data on single-chain properties. Ensemble-averaged experimental observables are predicted from molecular simulations, and a data-driven parameter-learning procedure is used to identify the residue-specific model parameters that minimize the discrepancy between predictions and experiments. The model accurately reproduces the experimentally observed conformational propensities of a set of IDPs. Through two-body as well as large-scale molecular simulations, we show that the optimization of the intramolecular interactions results in improved predictions of protein self-association and LLPS.

Keywords: biomolecular condensates; force field parameterization; intrinsically disordered proteins; liquid–liquid phase separation; protein interactions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1
Fig. 1
Assessing the HPS, AVG, and HPS-Urry models using experimental data reporting on single-chain conformational properties. (A) Probability distributions of the λ parameters calculated from 87 min–max normalized hydrophobicity scales. Lines are the λ parameters of the HPS model (blue), the average over the hydrophobicity scales (orange) and the HPS-Urry model (green) (28). Intramolecular PRE intensity ratios for (B) the S42C mutant of α-Synuclein and (C) the S143C mutant of A2 LCD from simulations and experiments (22, 43) (black). (D) χ2 values quantifying the discrepancy between simulated and experimental intramolecular PRE data, scaled by the hyperparameter η=0.1 (Materials and Methods). Relative difference between simulated and experimental radii of gyration (E) for proteins that do not readily undergo phase separation alone and (F) for variants of A1 LCD, with negative values corresponding to the simulated ensembles being more compact than in experiments.
Fig. 2
Fig. 2
Flowchart illustrating the Bayesian parameter-learning procedure (Materials and Methods).
Fig. 3
Fig. 3
Selection and performance of the M1–3 models with respect to the training data. (A) Overview of the optimal λ sets with ηχPRE2<21 and χRg2<3 collected through the parameter learning procedures started from λ0= AVG (upward triangles), M1 (squares), and M2 (downward triangles). The gray gradient shows the Spearman’s correlation coefficient between experimental and simulated Rg values for the A1 LCD variants in the training set. Colored open symbols indicate the M1 (blue upward triangle), M2 (orange square), and M3 (green downward triangle) scales, whereas the adjacent values are the respective Spearman’s correlation coefficients. (B) Covariance matrix of the λ sets with ηχPRE2<21 and χRg2<3. (C) M1 (blue), M2 (orange), and M3 (green) scales. Solid lines are guides for the eye, whereas the gray shaded area shows the mean ±2 SD of the λ sets with ηχPRE2<21 and χRg2<3. Comparison between (D) ηχPRE2 and (E) χRg2 values for the HPS model (gray) and the optimized M1 (blue), M2 (orange), and M3 (green) models.
Fig. 4
Fig. 4
(A) Comparison between experimental and predicted radii of gyration (SI Appendix, Table S1), Rg, for the HPS, HPS-Urry, and M1–3 models. (B) Zoom-in on the Rg values of the A1 LCD variants, with Pearson’s r coefficients for this subset of the training data reported in the legend.
Fig. 5
Fig. 5
Testing the M1–3 models using experimental findings on protein–protein interactions. Comparison between experimental (black) intermolecular PRE rates (SI Appendix, Table S3) and predictions from the M1 (blue), M2 (orange) and M3 (green) models for (AC) FUS LCD and (D and E) A2 LCD calculated using the best-fit correlation time, τc. (F and G) Discrepancy between calculated and experimental intermolecular PRE rates χPRE2 as a function of τc. (H) Second virial coefficients, B22, of FUS LCD (circles) and A2 LCD (squares) calculated from two-chain simulations of the M1–3 models. Error bars are SEMs estimated by bootstrapping 1,000 times 40 B22 values calculated from trajectory blocks of 875 ns. (I) Probability of the bound state estimated from protein-protein interaction energies in two-chain simulations of the M1–3 models. (J) Dissociation constants, Kd, of FUS LCD (circles) and A2 LCD (squares) calculated from two-chain simulations of the M1–3 models. For pB and Kd, error bars are SDs of 10 simulation replicas. Lines in H and J are guides for the eye.
Fig. 6
Fig. 6
Protein concentrations (AC) in the condensate and (DF) in the dilute phase from slab simulations of the M1–3, HPS, and HPS-Urry models performed at 50 °C (closed symbols), 37 °C (crosses in H), and 24 °C (open symbols). Red open squares show experimental measurements at ∼ 24 °C (A, C, D, and F) and ∼ 4 °C (B and E). Correlation between log10(csat/M) from simulations and experiments for (G) diverse sequences and (H) A1 LCD variants. Solid lines show linear fits to the simulation data at 50 °C. Dashed lines show linear fits to the HPS-Urry data at 24 °C (G and H) and to the M1–3 data at 37 °C (H). Values reported in the legends are Pearson’s correlation coefficients. Error bars are SEMs of averages over blocks of 0.3 µs. We note that the correlation coefficients reported in G are associated with a substantial uncertainty as they are calculated over only three (HPS), four (HPS-Urry), and five points (M1–3).
Fig. 7
Fig. 7
Correlation between chain compaction and LLPS propensity for aromatic and charge variants of A1 LCD. log10(csat/M) vs. νsim for A1 LCD variants from simulations performed using the (A) M1, (B) M2, and (C) M3 models. Black and colored circles indicate aromatic and charge variants, respectively. Black lines are linear fits to the aromatic variants. (DF) Residuals from the linear fits of AC for the charge variants of A1 LCD as a function of the NCPR. Values reported in the legends are Pearson’s correlation coefficients. Error bars of log10(csat) values are SEMs of averages over blocks of 0.3 µs. Error bars of νsim are SDs from fits to Rij=R0|ij|νsim in the long-distance region, |ij|>10. Solid lines are linear fits to the data. Dotted lines in DF are lines of best fit to the experimental data by Bremer et al. (16).
Fig. 8
Fig. 8
Comparing residue–residue interactions in dilute solution and in the condensate. Energy maps from simulations of the M1 model of (AC) FUS LCD and (DF) A2 LCD calculated using nonelectrostatic interaction energies. The 1D projections of the energy maps for (G) FUS LCD and (H) A2 LCD, normalized by the absolute average interaction energy |E| and shifted vertically for clarity. Colors indicate that the energies were calculated within a single chain at infinite dilution (blue), between two chains in the dilute regime (orange), and between a chain located at the center of a condensate and the surrounding chains (green).

Similar articles

Cited by

References

    1. Patel A., et al. ., A liquid-to-solid phase transition of the ALS protein FUS accelerated by disease mutation. Cell 162, 1066–1077 (2015). - PubMed
    1. Wegmann S., et al. ., Tau protein liquid-liquid phase separation can initiate tau aggregation. EMBO J. 37, e98049 (2018). - PMC - PubMed
    1. Kanaan N. M., Hamel C., Grabinski T., Combs B., Liquid-liquid phase separation induces pathogenic tau conformations in vitro. Nat. Commun. 11, 2809 (2020). - PMC - PubMed
    1. Ray S., et al. ., a-Synuclein aggregation nucleates through liquid-liquid phase separation. Nat. Chem. 12, 705–716 (2020). - PubMed
    1. Hardenberg MC, et al. ., Observation of an α-synuclein liquid droplet state and its maturation into Lewy body-like assemblies. J. Mol. Cell Biol. 13, 282–294 (2021). - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources