Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Aug;5(8):e1000475.
doi: 10.1371/journal.pcbi.1000475. Epub 2009 Aug 21.

Accurate prediction of DnaK-peptide binding via homology modelling and experimental data

Affiliations

Accurate prediction of DnaK-peptide binding via homology modelling and experimental data

Joost Van Durme et al. PLoS Comput Biol. 2009 Aug.

Abstract

Molecular chaperones are essential elements of the protein quality control machinery that governs translocation and folding of nascent polypeptides, refolding and degradation of misfolded proteins, and activation of a wide range of client proteins. The prokaryotic heat-shock protein DnaK is the E. coli representative of the ubiquitous Hsp70 family, which specializes in the binding of exposed hydrophobic regions in unfolded polypeptides. Accurate prediction of DnaK binding sites in E. coli proteins is an essential prerequisite to understand the precise function of this chaperone and the properties of its substrate proteins. In order to map DnaK binding sites in protein sequences, we have developed an algorithm that combines sequence information from peptide binding experiments and structural parameters from homology modelling. We show that this combination significantly outperforms either single approach. The final predictor had a Matthews correlation coefficient (MCC) of 0.819 when assessed over the 144 tested peptide sequences to detect true positives and true negatives. To test the robustness of the learning set, we have conducted a simulated cross-validation, where we omit sequences from the learning sets and calculate the rate of repredicting them. This resulted in a surprisingly good MCC of 0.703. The algorithm was also able to perform equally well on a blind test set of binders and non-binders, of which there was no prior knowledge in the learning sets. The algorithm is freely available at http://limbo.vib.be.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. DnaK binding to immobilised peptides attached to a cellulose-based membrane.
A) Group 2 peptides Membrane A, B) Group 2 peptides Membrane A bound to antibody only, indicating potential false positives (see supplementary Table 1 for all groups of peptide sequences and raw binding data).
Figure 2
Figure 2. DnaK bound to a substrate peptide.
Left: Substrate binding C-terminal domain of DnaK shown in cartoon style with bound heptapeptide NRLLLTG in stick style. Right: Detailed view of substrate binding in the beta sheet sandwich of DnaK. Molecular graphics created with YASARA (http://www.yasara.org) and Povray (http://www.povray.org).
Figure 3
Figure 3. ROC curves as calculated from the PSSMs of different DnaK-substrate structures.
The curves represent structures 1DKX (closed circle), 1DKY A (open square), 1DKY B (closed triangle) and 3 NMR structures from the same ensemble (open triangle, open circle, closed square).
Figure 4
Figure 4. ROC curves representing the performance of the different predictors before learning set optimization.
The graph visualizes the not cross-validated sequence-based predictor (open square), the not cross-validated sequence and structure-based predictor (closed square), the cross-validated sequence-based predictor (open circle), the cross-validated sequence and structure-based predictor (closed circle), the sequence based predictor on the independent validation set (open triangle) and the sequence and structure-based predictor on the independent validation set (closed triangle).
Figure 5
Figure 5. ROC curves representing the performance of the different predictors generated after running the learning set training algorithm.
The graph visualizes the not cross-validated sequence-based predictor (open square), the not cross-validated sequence and structure-based predictor (closed square), the cross-validated sequence-based predictor (open circle), the cross-validated sequence and structure-based predictor (closed circle), the sequence based predictor on the independent validation set (open triangle) and the sequence and structure-based predictor on the independent validation set (closed triangle).
Figure 6
Figure 6. Graphical representation of the final DnaK heptameric binding profile over all aminoacids and all residue positions.
The axes crossing point at score zero was chosen for convenience. Position 1 and 7 respectively represent the N-and and C-terminal end of the heptameric motif.
Figure 7
Figure 7. ROC performance comparison with a previously published algorithm.
A: Comparison of our DnaK predictor (closed square) with the previously published predicting algorithm from Rüdiger et al (open square) for the extended benchmark peptide set. The data point above 90% specificity (90.7%) with the best MCC in both predictors is shown as a dashed line. B: ROC curves comparison between our predictor (closed square) and the algorithm of Rüdiger et al (open square) for the validation set of peptides.

Similar articles

Cited by

References

    1. Mayer MP, Bukau B. Hsp70 chaperones: cellular functions and molecular mechanism. Cell Mol Life Sci. 2005;62:670–684. - PMC - PubMed
    1. Hartl FU, Hayer-Hartl M. Molecular chaperones in the cytosol: from nascent chain to folded protein. Science. 2002;295:1852–1858. - PubMed
    1. Rudiger S, Germeroth L, Schneider-Mergener J, Bukau B. Substrate specificity of the DnaK chaperone determined by screening cellulose-bound peptide libraries. EMBO J. 1997;16:1501–1507. - PMC - PubMed
    1. Flaherty KM, McKay DB, Kabsch W, Holmes KC. Similarity of the three-dimensional structures of actin and the ATPase fragment of a 70-kDa heat shock cognate protein. Proc Natl Acad Sci U S A. 1991;88:5041–5045. - PMC - PubMed
    1. Zhu X, Zhao X, Burkholder WF, Gragerov A, Ogata CM, et al. Structural analysis of substrate binding by the molecular chaperone DnaK. Science. 1996;272:1606–1614. - PMC - PubMed

Publication types

MeSH terms