AT specific heterocyclic cations that bind in the DNA duplex minor groove have had major successes as cell and nuclear stains and as therapeutic agents which can effectively enter human cells. Expanding the DNA sequence recognition capability of the minor groove compounds could also expand their therapeutic targets and have an impact in many areas, such as modulation of transcription factor biological activity. Success in the design of mixed sequence binding compounds has been achieved with N-methylbenzimidazole (N-MeBI) thiophenes which are preorganized to fit the shape of the DNA minor groove and H-bond to the –NH of G·C base pairs that projects into the minor groove. Initial compounds bind strongly to a single G·C base pair in an AT context with a specificity ratio of 50 (KD AT-GC/KD AT) or less and this is somewhat low for biological use. We felt that modifications of compound shape could be used to probe local DNA microstructure in target mixed base pair sequences of DNA and potentially improve the compound binding selectivity. Modifications were made by increasing the size of the benzimidazole N-substituent, for example, by using N-isobutyl instead of N-Me, and by changing the molecular twist by introducing substitutions at specific positions on the aromatic core of the compounds. In both cases, we have been able to achieve a dramatic increase in binding specificity, including no detectible binding to pure AT sequences, without a significant loss in affinity to mixed base pair target sequences.
Sequence-specific binding to DNA is crucial for targeting transcription factor-DNA complexes to modulate gene expression. The heterocyclic diamidine, DB2277, specifically recognizes a single G•C base pair in the minor groove of mixed base pair sequences of the type AAAGTTT. NMR spectroscopy reveals the presence of major and minor species of the bound compound. To understand the principles that determine the binding affinity and orientation in mixed sequences of DNA, over thirty DNA hairpin substrates were examined by NMR and thermal melting. The NMR exchange dynamics between major and minor species shows that the exchange is much faster than compound dissociation determined from biosensor–surface plasmon resonance. Extensive modifications of DNA sequences resulted in a unique DNA sequence with binding site AAGATA that binds DB2277 in a single orientation. A molecular docking result agrees with the model representing rapid flipping of DB2277 between major and minor species. Imino spectral analysis of a 15N-labeled central G clearly shows the crucial role of the exocyclic amino group of G in sequence-specific recognition. Our results suggest that this approach can be expanded to additional modules for recognition of more sequence-specific DNA complexes. This approach provides substantial information about the sequence-specific, highly efficient, dynamic nature of minor groove binding agents.
The high-resolution NMR structure of the first heterocyclic, non-amide, organic cation that strongly and selectively recognizes mixed AT/GC bp (bp=base pair) sequences of DNA in a 1:1 complex is described. Compound designs of this type provide essential methods for control of functional, non-genomic DNA sequences and have broad cell uptake capability, based on studies from animals to humans. The high-resolution structural studies described in this report are essential for understanding the molecular basis for the sequence-specific binding as well as for new ideas for additional compound designs for sequence-specific recognition. The molecular features, in this report, explain the mechanism of recognition of both A⋅T and G⋅C bps and are an interesting molecular recognition story. Examination of the experimental structure and the NMR restrained molecular dynamics model suggests that recognition of the G⋅C base pair involves two specific H-bonds. The structure illustrates a wealth of information on different DNA interactions and illustrates an interfacial water molecule that is a key component of the complex.
The design and synthesis of compounds that target mixed, AT/GC, DNA sequences is described. The design concept connects two N-methyl-benzimidazole-thiophene single GC recognition units with a flexible linker that lets the compound fit the shape and twist of the DNA minor groove while covering a full turn of the double helix.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.