EncoderMap(II): Visualizing Important Molecular Motions with Improved Generation of Protein Conformations

J Chem Inf Model. 2019 Nov 25;59(11):4550-4560. doi: 10.1021/acs.jcim.9b00675. Epub 2019 Nov 5.

Abstract

Dimensionality reduction can be used to project high-dimensional molecular data into a simplified, low-dimensional map. One feature of our recently introduced dimensionality reduction technique EncoderMap, which relies on the combination of an autoencoder with multidimensional scaling, is its ability to do the reverse. It is able to generate conformations for any selected points in the low-dimensional map. This transfers the simplified, low-dimensional map back into the high-dimensional conformational space. Although the output is again high-dimensional, certain aspects of the simplification are preserved. The generated conformations only mirror the most dominant conformational differences that determine the positions of conformational states in the low-dimensional map. This allows depicting such differences and-in consequence-visualizing molecular motions and gives a unique perspective on high-dimensional conformational data. In our previous work, protein conformations described in backbone dihedral angle space were used as the input for EncoderMap, and conformations were also generated in this space. For large proteins, however, the generation of conformations is inaccurate with this approach due to the local character of backbone dihedral angles. Here, we present an improved variant of EncoderMap which is able to generate large protein conformations that are accurate in short-range and long-range orders. This is achieved by differentiable reconstruction of Cartesian coordinates from the generated dihedrals, which allows adding a contribution to the cost function that monitors the accuracy of all pairwise distances between the Cα-atoms of the generated conformations. The improved capabilities to generate conformations of large, even multidomain, proteins are demonstrated for two examples: diubiquitin and a part of the Ssa1 Hsp70 yeast chaperone. We show that the improved variant of EncoderMap can nicely visualize motions of protein domains relative to each other but is also able to highlight important conformational changes within the individual domains.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenosine Triphosphatases / chemistry
  • Algorithms
  • HSP70 Heat-Shock Proteins / chemistry
  • Humans
  • Molecular Dynamics Simulation*
  • Protein Conformation
  • Proteins / chemistry*
  • Saccharomyces cerevisiae / chemistry
  • Saccharomyces cerevisiae Proteins / chemistry
  • Software
  • Ubiquitins / chemistry

Substances

  • HSP70 Heat-Shock Proteins
  • Proteins
  • Saccharomyces cerevisiae Proteins
  • UBD protein, human
  • Ubiquitins
  • Adenosine Triphosphatases
  • SSA1 protein, S cerevisiae