Discovery of substrates for a SET domain lysine methyltransferase predicted by multistate computational protein design

Structure. 2015 Jan 6;23(1):206-215. doi: 10.1016/j.str.2014.11.004. Epub 2014 Dec 18.


Characterization of lysine methylation has proven challenging despite its importance in biological processes such as gene transcription, protein turnover, and cytoskeletal organization. In contrast to other key posttranslational modifications, current proteomics techniques have thus far shown limited success at characterizing methyl-lysine residues across the cellular landscape. To complement current biochemical characterization methods, we developed a multistate computational protein design procedure to probe the substrate specificity of the protein lysine methyltransferase SMYD2. Modeling of substrate-bound SMYD2 identified residues important for substrate recognition and predicted amino acids necessary for methylation. Peptide- and protein- based substrate libraries confirmed that SMYD2 activity is dictated by the motif [LFM]-1-K(∗)-[AFYMSHRK]+1-[LYK]+2 around the target lysine K(∗). Comprehensive motif-based searches and mutational analysis further established four additional substrates of SMYD2. Our methodology paves the way to systematically predict and validate posttranslational modification sites while simultaneously pairing them with their associated enzymes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Computational Biology / methods*
  • HEK293 Cells
  • Histone-Lysine N-Methyltransferase / chemistry*
  • Histone-Lysine N-Methyltransferase / genetics
  • Histone-Lysine N-Methyltransferase / metabolism*
  • Humans
  • Models, Molecular
  • Mutagenesis, Site-Directed
  • Protein Engineering / methods
  • Protein Interaction Domains and Motifs / genetics
  • Protein Interaction Maps*
  • Protein Structure, Tertiary / genetics
  • Substrate Specificity


  • Histone-Lysine N-Methyltransferase
  • SMYD2 protein, human