Deep sequencing of large library selections allows computational discovery of diverse sets of zinc fingers that bind common targets

Nucleic Acids Res. 2014 Feb;42(3):1497-508. doi: 10.1093/nar/gkt1034. Epub 2013 Nov 7.

Abstract

The Cys2His2 zinc finger (ZF) is the most frequently found sequence-specific DNA-binding domain in eukaryotic proteins. The ZF's modular protein-DNA interface has also served as a platform for genome engineering applications. Despite decades of intense study, a predictive understanding of the DNA-binding specificities of either natural or engineered ZF domains remains elusive. To help fill this gap, we developed an integrated experimental-computational approach to enrich and recover distinct groups of ZFs that bind common targets. To showcase the power of our approach, we built several large ZF libraries and demonstrated their excellent diversity. As proof of principle, we used one of these ZF libraries to select and recover thousands of ZFs that bind several 3-nt targets of interest. We were then able to computationally cluster these recovered ZFs to reveal several distinct classes of proteins, all recovered from a single selection, to bind the same target. Finally, for each target studied, we confirmed that one or more representative ZFs yield the desired specificity. In sum, the described approach enables comprehensive large-scale selection and characterization of ZF specificities and should be a great aid in furthering our understanding of the ZF domain.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Binding Sites
  • Computational Biology / methods
  • DNA-Binding Proteins / chemistry*
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Gene Library
  • High-Throughput Nucleotide Sequencing
  • Mutagenesis
  • Polymerase Chain Reaction
  • Transcription Factors / chemistry*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Zinc Fingers*

Substances

  • DNA-Binding Proteins
  • Transcription Factors