Structure-function analysis of the THAP zinc finger of THAP1, a large C2CH DNA-binding module linked to Rb/E2F pathways

J Biol Chem. 2008 Feb 15;283(7):4352-63. doi: 10.1074/jbc.M707537200. Epub 2007 Dec 11.


THAP1, the founding member of a previously uncharacterized large family of cellular proteins (THAP proteins), is a sequence-specific DNA-binding factor that has recently been shown to regulate cell proliferation through modulation of pRb/E2F cell cycle target genes. THAP1 shares its DNA-binding THAP zinc finger domain with Drosophila P element transposase, zebrafish E2F6, and several nematode proteins interacting genetically with the retinoblastoma protein pRb. In this study, we report the three-dimensional structure and structure-function relationships of the THAP zinc finger of human THAP1. Deletion mutagenesis and multidimensional NMR spectroscopy revealed that the THAP domain of THAP1 is an atypical zinc finger of approximately 80 residues, distinguished by the presence between the C2CH zinc coordinating residues of a short antiparallel beta-sheet interspersed by a long loop-helix-loop insertion. Alanine scanning mutagenesis of this loop-helix-loop motif resulted in the identification of a number of critical residues for DNA recognition. NMR chemical shift perturbation analysis was used to further characterize the residues involved in DNA binding. The combination of the mutagenesis and NMR data allowed the mapping of the DNA binding interface of the THAP zinc finger to a highly positively charged area harboring multiple lysine and arginine residues. Together, these data represent the first structure-function analysis of a functional THAP domain, with demonstrated sequence-specific DNA binding activity. They also provide a structural framework for understanding DNA recognition by this atypical zinc finger, which defines a novel family of cellular factors linked to cell proliferation and pRb/E2F cell cycle pathways in humans, fish, and nematodes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Apoptosis Regulatory Proteins / chemistry
  • Apoptosis Regulatory Proteins / metabolism
  • Apoptosis Regulatory Proteins / physiology*
  • Base Sequence
  • DNA Probes
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / metabolism
  • DNA-Binding Proteins / physiology*
  • E2F Transcription Factors / metabolism*
  • Humans
  • Models, Molecular
  • Molecular Sequence Data
  • Nuclear Magnetic Resonance, Biomolecular
  • Nuclear Proteins / chemistry
  • Nuclear Proteins / metabolism
  • Nuclear Proteins / physiology*
  • Retinoblastoma Protein / metabolism*
  • Sequence Homology, Amino Acid
  • Structure-Activity Relationship
  • Zinc Fingers*


  • Apoptosis Regulatory Proteins
  • DNA Probes
  • DNA-Binding Proteins
  • E2F Transcription Factors
  • Nuclear Proteins
  • Retinoblastoma Protein
  • THAP1 protein, human

Associated data

  • PDB/2JTG