Functional analysis of Rossmann-like domains reveals convergent evolution of topology and reaction pathways

PLoS Comput Biol. 2019 Dec 23;15(12):e1007569. doi: 10.1371/journal.pcbi.1007569. eCollection 2019 Dec.

Abstract

Rossmann folds are ancient, frequently diverged domains found in many biological reaction pathways where they have adapted for different functions. Consequently, discernment and classification of their homologous relations and function can be complicated. We define a minimal Rossmann-like structure motif (RLM) that corresponds for the common core of known Rossmann domains and use this motif to identify all RLM domains in the Protein Data Bank (PDB), thus finding they constitute about 20% of all known 3D structures. The Evolutionary Classification of protein structure Domains (ECOD) classifies RLM domains in a number of groups that lack evidence for homology (X-groups), which suggests that they could have evolved independently multiple times. Closely related, homologous RLM enzyme families can diverge to bind different ligands using similar binding sites and to catalyze different reactions. Conversely, non-homologous RLM domains can converge to catalyze the same reactions or to bind the same ligand with alternate binding modes. We discuss a special case of such convergent evolution that is relevant to the polypharmacology paradigm, wherein the same drug (methotrexate) binds to multiple non-homologous RLM drug targets with different topologies. Finally, assigning proteins with RLM domain to the Enzyme Commission classification suggest that RLM enzymes function mainly in metabolism (and comprise 38% of reference metabolic pathways) and are overrepresented in extant pathways that represent ancient biosynthetic routes such as nucleotide metabolism, energy metabolism, and metabolism of amino acids. In fact, RLM enzymes take part in five out of eight enzymatic reactions of the Wood-Ljungdahl metabolic pathway thought to be used by the last universal common ancestor (LUCA). The prevalence of RLM domains in this ancient metabolism might explain their wide distribution among enzymes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites / genetics
  • Catalytic Domain / genetics
  • Computational Biology
  • Databases, Protein
  • Enzymes / chemistry
  • Enzymes / genetics
  • Enzymes / metabolism
  • Evolution, Molecular*
  • Humans
  • Ligands
  • Metabolic Networks and Pathways / genetics
  • Models, Molecular
  • Protein Binding / genetics
  • Protein Domains / genetics*
  • Software
  • Structural Homology, Protein

Substances

  • Enzymes
  • Ligands