As part of an ongoing effort to expand the genetic alphabet for in vitro and eventual in vivo applications, we have synthesized a wide variety of predominantly hydrophobic unnatural base pairs and evaluated their replication in DNA. Collectively, the results have led us to propose that these base pairs, which lack stabilizing edge-on interactions, are replicated via a unique intercalative mechanism. Here, we report the synthesis and characterization of three novel derivatives of the nucleotide analog dMMO2, which forms an unnatural base pair with the nucleotide analog d5SICS. Replacing the para-methyl substituent of dMMO2 with a furanyl substituent (yielding dFMO) has a dramatically negative effect on replication, while replacing it with a methoxy (dDMO) or with a thiomethyl group (dTMO), improves replication in both steady-state assays and during PCR amplification. Thus, dTMO-d5SICS, and especially dDMO-d5SICS, represent significant progress toward the expansion of the genetic alphabet. To elucidate the structure-activity relationships governing unnatural base pair replication, we determined the solution structure of duplex DNA containing the parental dMMO2-d5SICS pair, and also used this structure to generate models of the derivative base pairs. The results strongly support the intercalative mechanism of replication, reveal a surprisingly high level of specificity that may be achieved by optimizing packing interactions, and should prove invaluable for the further optimization of the unnatural base pair.
Cdc13, the telomere end-binding protein from Saccharomyces cerevisiae, is a multidomain protein that specifically binds telomeric single-stranded DNA (ssDNA) with exquisitely high affinity to coordinate telomere maintenance. Recent structural and genetic data have led to the proposal that Cdc13 is the paralog of RPA70 within a telomere-specific RPA complex. Our understanding of Cdc13 structure and biochemistry has been largely restricted to studies of individual domains, precluding analysis of how each domain influences the activity of the others. To better facilitate a comparison to RPA70, we evaluated the ssDNA binding of full-length S. cerevisiae Cdc13 to its minimal substrate, Tel11. We found that, unlike RPA70 and the other known telomere end-binding proteins, the core Cdc13 ssDNA-binding activity is wholly contained within a single tight-binding oligosaccharide/oligonucleotide/oligopeptide binding (OB)-fold. Because two OB-folds are implicated in dimerization, we also evaluated the relationship between dimerization and ssDNA-binding activity and found that the two activities are independent. We also find that Cdc13 binding exhibits positive cooperativity that is independent of dimerization. This study reveals that, while Cdc13 and RPA70 share similar domain topologies, the corresponding domains have evolved different and specialized functions.
The incorporation of synthetic nucleoside analogues into DNA duplexes provides a unique opportunity to probe both structure and function of nucleic acids. We used 1H and 19F NMR and molecular dynamics calculations to determine the solution structures of two similar DNA decamer duplexes, one containing a central G-T mismatched or "wobble" base pair, and one in which the thymine in this base pair is replaced by difluorotoluene (a thymine isostere) creating a G-F pair. Here, we show that the non-hydrogen-bonding G-F pair stacks relatively well into the helix and that the distortions caused by each non-Watson-Crick G-T or G-F base pair are quite localized to a three base pair site around the mismatch. A detailed structural analysis reveals that the absence of hydrogen bonding introduces more dynamic motion into the G-F pair relative to G-T and permits the G-F pair to exhibit stacking and conformational features characteristic of both a Watson-Crick base pair (on the guanine containing strand) and a wobble base pair (on the strand containing the difluorotoluene). We used these results to posit a rationale for recognition and repair of mismatch sites in DNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.