We review the preparation of new compounds with good solution and cell uptake properties that can selectively recognize mixed A·T and G·C bp sequences of DNA. Our underlying aim is to show that these new compounds provide important new biotechnology reagents as well as a new class of therapeutic candidates with better properties and development potential than other currently available agents. In this review, entirely different ways to recognize mixed sequences of DNA by modifying AT selective heterocyclic cations are described. To selectively recognize a G·C base pair an H-bond acceptor must be incorporated with AT recognizing groups as with netropsin. We have used pyridine, azabenzimidazole and thiophene-N-methylbenzimidazole GC recognition units in modules crafted with both rational design and empirical optimization. These modules can selectively and strongly recognize a single G·C base pair in an AT sequence context. In some cases, a relatively simple change in substituents can convert a heterocyclic module from AT to GC recognition selectivity. Synthesis and DNA interaction results for initial example lead modules are described for single G·C base pair recognition compounds. The review concludes with a description of the initial efforts to prepare larger compounds to recognize sequences of DNA with more than one G·C base pairs. The challenges and initial successes are described along with future directions.
Keywords: DNA G·C base pair recognition; aza-benzimidazole; benzimidazole; biosensor; heterocyclic diamidine; mixed base pair DNA sequences; sequence selectivity; thiophene-N-methylbenzimidazole; σ-hole.