Organisms are defined by the information encoded in their genomes, and since the evolution of life, this information has been encoded using a two base pair genetic alphabet (A-T and G-C). In vitro, the alphabet has been expanded to include several unnatural base pairs (UBPs)1–3. We have developed a class of UBPs formed between nucleotides bearing hydrophobic nucleobases, exemplified by the pair formed between d5SICS and dNaM (d5SICS-dNaM, Fig. 1a), which is efficiently PCR amplified1 and transcribed4,5 in vitro, and whose unique mechanism of replication has been characterized6,7. However, expansion of a organism’s genetic alphabet presents new and unprecedented challenges: the unnatural nucleoside triphosphates must enter the cell; endogenous polymerases must be able to faithfully incorporate the unnatural triphosphates into DNA within the complex cellular milieu; and finally, the UBP must be stable in the presence of pathways that maintain the integrity of DNA. Here we show that an exogenously expressed algal nucleotide triphosphate transporter efficiently imports the triphosphates of both d5SICS and dNaM (d5SICSTP and dNaMTP) into E. coli, and that the endogenous replication machinery uses them to accurately replicate a plasmid containing d5SICS-dNaM. Neither the presence of the unnatural triphosphates nor the replication of the UBP introduces a significant growth burden. Lastly, we find that the UBP is not efficiently excised by DNA repair pathways. Thus, the resulting bacterium is the first organism to stably propagate an expanded genetic alphabet.
We synthesized a panel of unnatural base pairs whose pairing depends on hydrophobic and packing forces and identify dTPT3-dNaM, which is PCR amplified with a natural base pair-like efficiency and fidelity. In addition, the dTPT3 scaffold is uniquely tolerant of attaching a propargyl amine linker, resulting in the dTPT3PA-dNaM pair, which is amplified only slightly less well. The identification of dTPT3 represents significant progress towards developing an unnatural base pair for the in vivo expansion of an organism's genetic alphabet and for a variety of in vitro biotechnology applications where it is used to site-specifically label amplified DNA, and it also demonstrates for the first time that hydrophobic and packing forces are sufficient to mediate natural-like replication.
The natural four-letter genetic alphabet, comprised of just two base pairs (dA-dT and dG-dC), is conserved throughout all life, and its expansion by the development of a third, unnatural base pair has emerged as a central goal of chemical and synthetic biology. We recently developed a class of candidate unnatural base pairs, exemplified by the pair formed between d5SICS and dNaM. Here, we examine the PCR amplification of DNA containing one or more d5SICS-dNaM pairs in a wide variety of sequence contexts. Under standard conditions, we show that this DNA may be amplified with high efficiency and greater than 99.9% fidelity. To more rigorously explore potential sequence effects, we used deep sequencing to characterize a library of templates containing the unnatural base pair as a function of amplification. We found that the unnatural base pair is efficiently replicated with high fidelity in virtually all sequence contexts. The results show that, for PCR and PCR-based applications, d5SICS-dNaM is functionally equivalent to a natural base pair, and when combined with dA-dT and dG-dC, it provides a fully functional six-letter genetic alphabet.expanded genetic alphabet | hydrophobic | artificial DNA | unnatural nucleotides | bioinformatics E xpansion of the genetic alphabet to include an unnatural base pair has emerged as a central goal of chemical and synthetic biology. Success would represent a remarkable integration of orthogonal synthetic components into a fundamental biological system and build the foundation for a semisynthetic organism with increased potential for information storage and retrieval (1). Moreover, the constituent unnatural nucleotides could be used to site-specifically label DNA or RNA with different functionalities of interest (2-4) and potentially revolutionize the already ubiquitous in vitro applications of nucleic acids, such as aptamer and DNA/RNAzyme selections (5, 6), PCR-based diagnostics (7, 8), and DNA-based nanomaterials and devices (9).Although many candidate unnatural base pairs have been reported (10-21), only a few are actually replicable by DNA polymerases (10,11,13,16). Moreover, it is clear that most applications will require that the unnatural base pair not only be replicated with high efficiency and fidelity but also, that replication be at least approximately independent of sequence context. Sequence dependencies would cause biased amplification and effectively preclude many uses of the unnatural base pair. No candidate unnatural base pair has been shown to be replicated without sequence bias, and thus, none can yet claim functional equivalence to a natural base pair.In general, the most promising unnatural base pair candidates currently available have been developed by pursuing one of two different strategies. The first strategy, pioneered in the work by Benner and coworkers (22), relies on the use of nucleotide analogs bearing nucleobases that pair through complementary hydrogen bonding (H-bonding) patterns that are orthogonal to those patterns of the natural pairs. Early ef...
We have developed a family of unnatural base pairs (UBPs), which rely on hydrophobic and packing interactions for pairing and which are well replicated and transcribed. While the pair formed between d5SICS and dNaM (d5SICS-dNaM) has received the most attention, and has been used to expand the genetic alphabet of a living organism, recent efforts have identified dTPT3-dNaM, which is replicated with even higher fidelity. These efforts also resulted in more UBPs than could be independently analyzed, and thus we now report a PCR-based screen to identify the most promising. While we found that dTPT3-dNaM is generally the most promising UBP, we identified several others that are replicated nearly as well and significantly better than d5SICS-dNaM, and are thus viable candidates for the expansion of the genetic alphabet of a living organism. Moreover, the results suggest that continued optimization should be possible, and that the putatively essential hydrogen-bond acceptor at the position ortho to the glycosidic linkage may not be required. These results clearly demonstrate the generality of hydrophobic forces for the control of base pairing within DNA, provide a wealth of new structure–activity relationship data and importantly identify multiple new candidates for in vivo evaluation and further optimization.
As part of an ongoing effort to expand the genetic alphabet for in vitro and eventually in vivo applications, we have synthesized a wide variety of predominantly hydrophobic unnatural base pairs exemplified by d5SICS-dMMO2 and d5SICS-dNaM. When incorporated into DNA, the latter is replicated and transcribed with greater efficiency and fidelity than the former, however previous optimization efforts identified the para and methoxy-distal meta positions of dMMO2 as particularly promising for further optimization. Here, we report the stepwise optimization of dMMO2 via the synthesis and evaluation of eighteen novel para-derivatized analogs of dMMO2, followed by further derivatization and evaluation of the most promising analogs with meta substituents. Subject to size constraints, we find that para substituents can optimize replication via both steric and electronic effects and that meta methoxy groups are unfavorable while fluoro substituents can be beneficial or deleterious depending on the para substituent. In addition, we find that improvements in the efficiency of unnatural triphosphate insertion translate most directly into higher fidelity replication. Importantly, we identify multiple, unique base pair derivatives that when incorporated into DNA are well replicated. The most promising, d5SICS-dFEMO, is replicated under some conditions with greater efficiency and fidelity than d5SICS-dNaM. These results clearly demonstrate the generality of hydrophobic forces for the control of base pairing within DNA, provide a wealth of new SAR data, and importantly identify multiple new candidates for eventual in vivo evaluation.
We have developed a class of replicable unnatural DNA base pairs formed between d5SICS and either dMMO2, dDMO, or dNaM. To explore the use of these pairs to produce site-specifically labeled DNA, we report the synthesis of a variety of derivatives bearing propynyl groups, an analysis of their polymerase-mediated replication, and subsequent site-specific modification of the amplified DNA via Click chemistry. We find that with the d5SICS scaffold, a propynyl ether linker is accommodated better than its aliphatic analog, but not as well as the protected propargyl amine linker explored previously. We also find that with the dMMO2 and dDMO analogs, the dMMO2 position para to the glycosidic linkage is best suited for linker attachment, and that while aliphatic and ether-based linkers are similarly accommodated, the direct attachment of an ethynyl group to the nucleobase core is most well tolerated. To demonstrate the utility of these analogs, a variety of them are used to site-selectively attach a biotin tag to the amplified DNA. Finally, we use d5SICSCO-dNaM to couple one or two proteins to amplified DNA, with the double labeled product visualized by atomic force microscopy. The ability to encode the spatial relationships of arrayed molecules in PCR amplifiable DNA should have important applications, ranging from SELEX with functionalities not naturally present in DNA to the production, and perhaps “evolution” of nanomaterials.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.