Analysis of molecular recognition features (MoRFs)

J Mol Biol. 2006 Oct 6;362(5):1043-59. doi: 10.1016/j.jmb.2006.07.087. Epub 2006 Aug 4.

Abstract

Several proteomic studies in the last decade revealed that many proteins are either completely disordered or possess long structurally flexible regions. Many such regions were shown to be of functional importance, often allowing a protein to interact with a large number of diverse partners. Parallel to these findings, during the last five years structural bioinformatics has produced an explosion of results regarding protein-protein interactions and their importance for cell signaling. We studied the occurrence of relatively short (10-70 residues), loosely structured protein regions within longer, largely disordered sequences that were characterized as bound to larger proteins. We call these regions molecular recognition features (MoRFs, also known as molecular recognition elements, MoREs). Interestingly, upon binding to their partner(s), MoRFs undergo disorder-to-order transitions. Thus, in our interpretation, MoRFs represent a class of disordered region that exhibits molecular recognition and binding functions. This work extends previous research showing the importance of flexibility and disorder for molecular recognition. We describe the development of a database of MoRFs derived from the RCSB Protein Data Bank and present preliminary results of bioinformatics analyses of these sequences. Based on the structure adopted upon binding, at least three basic types of MoRFs are found: alpha-MoRFs, beta-MoRFs, and iota-MoRFs, which form alpha-helices, beta-strands, and irregular secondary structure when bound, respectively. Our data suggest that functionally significant residual structure can exist in MoRF regions prior to the actual binding event. The contribution of intrinsic protein disorder to the nature and function of MoRFs has also been addressed. The results of this study will advance the understanding of protein-protein interactions and help towards the future development of useful protein-protein binding site predictors.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acids, Aromatic / chemistry
  • Binding Sites
  • Chemistry, Physical / methods
  • Computational Biology
  • Computer Simulation
  • Cryoelectron Microscopy
  • Crystallography, X-Ray
  • Databases, Protein
  • Kinetics
  • Molecular Sequence Data
  • Nuclear Magnetic Resonance, Biomolecular
  • Protein Binding
  • Protein Conformation
  • Protein Denaturation
  • Protein Processing, Post-Translational
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Proteins / metabolism
  • Proteins / ultrastructure
  • Software
  • Spectrum Analysis, Raman
  • Structure-Activity Relationship

Substances

  • Amino Acids, Aromatic
  • Proteins