Designing repeat proteins: well-expressed, soluble and stable proteins from combinatorial libraries of consensus ankyrin repeat proteins

J Mol Biol. 2003 Sep 12;332(2):489-503. doi: 10.1016/s0022-2836(03)00896-9.


We describe an efficient way to generate combinatorial libraries of stable, soluble and well-expressed ankyrin repeat (AR) proteins. Using a combination of sequence and structure consensus analyses, we designed a 33 amino acid residue AR module with seven randomized positions having a theoretical diversity of 7.2x10(7). Different numbers of this module were cloned between N and C-terminal capping repeats, i.e. ARs designed to shield the hydrophobic core of stacked AR modules. In this manner, combinatorial libraries of designed AR proteins consisting of four to six repeats were generated, thereby potentiating the theoretical diversity. All randomly chosen library members were expressed in soluble form in the cytoplasm of Escherichia coli in amounts up to 200 mg per 1 l of shake-flask culture. Virtually pure proteins were obtained in a single purification step. The designed AR proteins are monomeric and display CD spectra identical with those of natural AR proteins. At the same time, our AR proteins are highly thermostable, with T(m) values ranging from 66 degrees C to well above 85 degrees C. Thus, our combinatorial library members possess the properties required for biotechnological applications. Moreover, the favorable biophysical properties and the modularity of the AR fold may account, partly, for the abundance of natural AR proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Ankyrin Repeat*
  • Base Sequence
  • Circular Dichroism
  • Consensus Sequence
  • Databases, Protein*
  • Gene Library
  • Models, Molecular
  • Molecular Sequence Data
  • Protein Engineering*
  • Protein Structure, Tertiary
  • Repetitive Sequences, Amino Acid*
  • Temperature

Associated data

  • GENBANK/AY195851
  • GENBANK/AY195852
  • GENBANK/AY195853
  • GENBANK/AY195854
  • GENBANK/AY195855
  • GENBANK/AY195856
  • GENBANK/AY327140