Linking time-series of single-molecule experiments with molecular dynamics simulations by machine learning

Elife. 2018 May 3:7:e32668. doi: 10.7554/eLife.32668.

Abstract

Single-molecule experiments and molecular dynamics (MD) simulations are indispensable tools for investigating protein conformational dynamics. The former provide time-series data, such as donor-acceptor distances, whereas the latter give atomistic information, although this information is often biased by model parameters. Here, we devise a machine-learning method to combine the complementary information from the two approaches and construct a consistent model of conformational dynamics. It is applied to the folding dynamics of the formin-binding protein WW domain. MD simulations over 400 μs led to an initial Markov state model (MSM), which was then "refined" using single-molecule Förster resonance energy transfer (FRET) data through hidden Markov modeling. The refined or data-assimilated MSM reproduces the FRET data and features hairpin one in the transition-state ensemble, consistent with mutation experiments. The folding pathway in the data-assimilated MSM suggests interplay between hydrophobic contacts and turn formation. Our method provides a general framework for investigating conformational transitions in other proteins.

Keywords: Markov state model; computational biology; molecular biophysics; molecular dynamics simulation; none; semi-supervised learning; single-molecule experiment; structural biology; systems biology; time-series analysis; transfer learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Carrier Proteins / chemistry
  • Carrier Proteins / metabolism
  • Fluorescence Resonance Energy Transfer
  • Machine Learning*
  • Molecular Dynamics Simulation*
  • Protein Conformation
  • Protein Folding
  • Single Molecule Imaging / methods*

Substances

  • Carrier Proteins

Grants and funding

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.