Systematic enhancement of protein crystallization efficiency by bulk lysine-to-arginine (KR) substitution

Protein Sci. 2024 Mar;33(3):e4898. doi: 10.1002/pro.4898.

Abstract

Structural genomics consortia established that protein crystallization is the primary obstacle to structure determination using x-ray crystallography. We previously demonstrated that crystallization propensity is systematically related to primary sequence, and we subsequently performed computational analyses showing that arginine is the most overrepresented amino acid in crystal-packing interfaces in the Protein Data Bank. Given the similar physicochemical characteristics of arginine and lysine, we hypothesized that multiple lysine-to-arginine (KR) substitutions should improve crystallization. To test this hypothesis, we developed software that ranks lysine sites in a target protein based on the redundancy-corrected KR substitution frequency in homologs. This software can be run interactively on the worldwide web at https://www.pxengineering.org/. We demonstrate that three unrelated single-domain proteins can tolerate 5-11 KR substitutions with at most minor destabilization, and, for two of these three proteins, the construct with the largest number of KR substitutions exhibits significantly enhanced crystallization propensity. This approach rapidly produced a 1.9 Å crystal structure of a human protein domain refractory to crystallization with its native sequence. Structures from Bulk KR-substituted domains show the engineered arginine residues frequently make hydrogen-bonds across crystal-packing interfaces. We thus demonstrate that Bulk KR substitution represents a rational and efficient method for probabilistic engineering of protein surface properties to improve crystallization.

Keywords: circular dichroism spectroscopy; homology analysis; protein crystallization; protein engineering; protein solubility; protein thermodynamics; x-ray crystallography.

MeSH terms

  • Amino Acids / chemistry
  • Arginine / metabolism
  • Crystallization
  • Crystallography, X-Ray
  • Humans
  • Lysine* / chemistry
  • Proteins* / genetics

Substances

  • Lysine
  • Proteins
  • Amino Acids
  • Arginine

Grants and funding