Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 26;2(Pt 2):207-17.
doi: 10.1107/S205225251500202X. eCollection 2015 Mar 1.

Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering

Affiliations
Free PMC article

Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering

Giancarlo Tria et al. IUCrJ. .
Free PMC article

Abstract

Dynamic ensembles of macromolecules mediate essential processes in biology. Understanding the mechanisms driving the function and molecular interactions of 'unstructured' and flexible molecules requires alternative approaches to those traditionally employed in structural biology. Small-angle X-ray scattering (SAXS) is an established method for structural characterization of biological macromolecules in solution, and is directly applicable to the study of flexible systems such as intrinsically disordered proteins and multi-domain proteins with unstructured regions. The Ensemble Optimization Method (EOM) [Bernadó et al. (2007 ▶). J. Am. Chem. Soc. 129, 5656-5664] was the first approach introducing the concept of ensemble fitting of the SAXS data from flexible systems. In this approach, a large pool of macromolecules covering the available conformational space is generated and a sub-ensemble of conformers coexisting in solution is selected guided by the fit to the experimental SAXS data. This paper presents a series of new developments and advancements to the method, including significantly enhanced functionality and also quantitative metrics for the characterization of the results. Building on the original concept of ensemble optimization, the algorithms for pool generation have been redesigned to allow for the construction of partially or completely symmetric oligomeric models, and the selection procedure was improved to refine the size of the ensemble. Quantitative measures of the flexibility of the system studied, based on the characteristic integral parameters of the selected ensemble, are introduced. These improvements are implemented in the new EOM version 2.0, and the capabilities as well as inherent limitations of the ensemble approach in SAXS, and of EOM 2.0 in particular, are discussed.

Keywords: hybrid methods; macromolecular dynamics; proteins; small-angle scattering; symmetric oligomers; unstructured biology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(i) Different views of an example multi-domain protein composed of two domains [solved by MX: (a) grey and (b) yellow], connected by a disordered linker 30 amino acids long (transparent red spheres, left area). (ii) Multiple inter-domain linker reconstructions (multiple colours) computed with EOM (upper-right area). (iii) Different views of multiple inter-domain linker reconstructions computed with EOM 2.0 using the new possibility to fix domain positions in three-dimensional coordinates (bottom-right area).
Figure 2
Figure 2
Different views of a hexameric multi-domain protein with a symmetric oligomeric core. Each monomer is composed of two domains connected by a flexible linker and with disordered N- and C-termini. (a) Generated full-length hexamer where P6 symmetry is applied to the core and to the disordered regions. (b) Asymmetric modelling where the generated chains are independent of each other and the symmetry is present in the core only.
Figure 3
Figure 3
Qualitative characterization of particle flexibility from various characteristic R g distributions. (a) Pool (black), which represents the case of complete randomness; EOM(1) (purple), EOM(2) (orange), EOM(3) (pink) and EOM(4) (dark green) which represent the real outcome of independent EOM 2.0 runs in terms of R g distributions; uniform (cyan), compact (light blue), bimodal (red) which represent extreme (theoretical) cases. (b) H b(S) values computed from the distributions in (a). (c) Combination of R flex values for all the distributions (and compared to the threshold of randomness computed from the pool, in brackets, ∼89%) with the associated R σ values. The last example (red curve) indicates a potentially inconsistent result.
Figure 4
Figure 4
Distribution of end-to-end distances computed from pools containing 10 000 structures of 100 and 500 amino-acid chains and compared with the expected normal distribution having the same mean and standard deviation values.
Figure 5
Figure 5
Distributions of R g pools (blue) and selected ensembles (red) for determination of open and closed conformations of calmodulin using three different lengths of inter-domain disordered linkers (zero, six and 12 amino acids) for the pool generation. Violet and light blue triangles show R g for closed and open conformations, respectively.
Figure 6
Figure 6
Comparison of R g distributions showing that subpopulations of conformers can be identified from a large ensemble if the difference between their mean R g is greater than approximately two times the standard deviations of the original pool (bottom left). The R g values of the two subpopulations are indicated as vertical lines on each plot.
Figure 7
Figure 7
Characterization of the flexibility of uPARWT and the mutated uPARH47C-N259C using EOM 2.0. (a) Size distributions (R g) of uPARWT and uPARH47C-N259C, providing only a qualitative assessment through direct comparison of the distributions of the selected ensembles and the pool. (b) The metrics R flex and R σ enable characterization of the flexibility quantitatively, with R flex = ∼82% and R flex = ∼45%, for uPARWT and uPARH47C-N259C, respectively, reflecting a significant change in compactness of the particle upon mutation (with a threshold of randomness of ∼85% calculated from the pool).

Similar articles

Cited by

References

    1. Banavali, N. K. & Roux, B. (2009). Proteins, 74, 378–389. - PMC - PubMed
    1. Bernadó, P. (2010). Eur. Biophys. J. 39, 769–780. - PubMed
    1. Bernadó, P., Blanchard, L., Timmins, P., Marion, D., Ruigrok, R. W. H. & Blackledge, M. (2005). Proc. Natl Acad. Sci. USA, 102, 17002–17007. - PMC - PubMed
    1. Bernadó, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. (2007). J. Am. Chem. Soc. 129, 5656–5664. - PubMed
    1. Bernadó, P. & Svergun, D. I. (2012). Methods Mol. Biol. 896, 107–122. - PubMed