Large microbial gene clusters encode useful functions, including energy utilization and natural product biosynthesis, but genetic manipulation of such systems is slow, difficult and complicated by complex regulation. We exploit the modularity of a refactored Klebsiella oxytoca nitrogen fixation (nif) gene cluster (16 genes, 103 parts) to build genetic permutations that could not be achieved by starting from the wild-type cluster. Constraint-based combinatorial design and DNA assembly are used to build libraries of radically different cluster architectures by varying part choice, gene order, gene orientation and operon occupancy. We construct 84 variants of the nifUSVWZM operon, 145 variants of the nifHDKY operon, 155 variants of the nifHDKYENJ operon and 122 variants of the complete 16-gene pathway. The performance and behavior of these variants are characterized by nitrogenase assay and strand-specific RNA sequencing (RNA-seq), and the results are incorporated into subsequent design cycles. We have produced a fully synthetic cluster that recovers 57% of wild-type activity. Our approach allows the performance of genetic parts to be quantified simultaneously in hundreds of genetic contexts. This parallelized design-build-test-learn cycle, which can access previously unattainable regions of genetic space, should provide a useful, fast tool for genetic optimization and hypothesis testing.
A fundamental goal in biotechnology and biology is the development of approaches to better understand the genetic basis of traits. Here we report a versatile method, trackable multiplex recombineering (TRMR), whereby thousands of specific genetic modifications are created and evaluated simultaneously. To demonstrate TRMR, in a single day we modified the expression of >95% of the genes in Escherichia coli by inserting synthetic DNA cassettes and molecular barcodes upstream of each gene. Barcode sequences and microarrays were then used to quantify population dynamics. Within a week we mapped thousands of genes that affect E. coli growth in various media (rich, minimal and cellulosic hydrolysate) and in the presence of several growth inhibitors (beta-glucoside, D-fucose, valine and methylglyoxal). This approach can be applied to a broad range of traits to identify targets for future genome-engineering endeavors.
An inverse metabolic engineering strategy was used to select for Escherichia coli cells with an increased capability to N-glycosylate a specific target protein. We developed a screen for E. coli cells containing extra-chromosomal DNA fragments for improved ability to add precise sugar groups onto the AcrA protein using the glycosylation system from Campylobacter jejuni. Four different sized (1, 2, 4, and 8 kb) genomic DNA libraries were screened, and the sequences that conferred a yield advantage were determined. These advantageous genomic fragments were mapped onto the E. coli W3110 chromosome. Five candidate genes (identified across two or more libraries) were subsequently selected for forward engineering verification in E. coli CLM24 cells, utilizing a combination of internal standards for absolute quantitation and pseudo-selective reaction monitoring (pSRM) and Western blotting validation. An increase in glycosylated protein was quantified in cells overexpressing 4-α-glucantransferase and a phosphoenolpyruvate-dependent sugar phosphotransferase system, amounting to a 3.8-fold (engineered cells total = 5.3 mg L(-1) ) and 6.7-fold (engineered cells total = 9.4 mg L(-1) ) improvement compared to control cells, respectively. Furthermore, increased glycosylation efficiency was observed in cells overexpressing enzymes involved with glycosylation precursor synthesis, enzymes 1-deoxyxylulose-5-phosphate synthase (1.3-fold) and UDP-N-acetylglucosamine pyrophosphorylase (1.6-fold). To evaluate the wider implications of the engineering, we tested a modified Fc fragment of an IgG antibody as the target glycoprotein with two of our engineered cells, and achieved a ca. 75% improved glycosylation efficiency.
Genetic designs can consist of dozens of genes and hundreds of genetic parts. After evaluating a design, it is desirable to implement changes without the cost and burden of starting the construction process from scratch. Here, we report a two-step process where a large design space is divided into deep pools of composite parts, from which individuals are retrieved and assembled to build a final construct. The pools are built via multiplexed assembly and sequenced using next-generation sequencing. Each pool consists of ∼20 Mb of up to 5000 unique and sequence-verified composite parts that are barcoded for retrieval by PCR. This approach is applied to a 16-gene nitrogen fixation pathway, which is broken into pools containing a total of 55 848 composite parts (71.0 Mb). The pools encompass an enormous design space (1043 possible 23 kb constructs), from which an algorithm-guided 192-member 4.5 Mb library is built. Next, all 1030 possible genetic circuits based on 10 repressors (NOR/NOT gates) are encoded in pools where each repressor is fused to all permutations of input promoters. These demonstrate that multiplexing can be applied to encompass entire design spaces from which individuals can be accessed and evaluated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.