Recoding--the repurposing of genetic codons--is a powerful strategy for enhancing genomes with functions not commonly found in nature. Here, we report computational design, synthesis, and progress toward assembly of a 3.97-megabase, 57-codon Escherichia coli genome in which all 62,214 instances of seven codons were replaced with synonymous alternatives across all protein-coding genes. We have validated 63% of recoded genes by individually testing 55 segments of 50 kilobases each. We observed that 91% of tested essential genes retained functionality with limited fitness effect. We demonstrate identification and correction of lethal design exceptions, only 13 of which were found in 2229 genes. This work underscores the feasibility of rewriting genomes and establishes a framework for large-scale design, assembly, troubleshooting, and phenotypic analysis of synthetic organisms.
We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3′ transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5′ ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread.
A systematic search was performed for DNA-binding sequences of YgiP, an uncharacterized transcription factor of Escherichia coli, by using the Genomic SELEX. A total of 688 YgiP-binding loci were identified after genome-wide profiling of SELEX fragments with a high-density microarray (SELEX-chip). Gel shift and DNase-I footprinting assays indicated that YgiP binds to multiple sites along DNA probes with a consensus GTTNATT sequence. Atomic force microscope observation indicated that at low concentrations, YgiP associates at various sites on DNA probes, but at high concentrations, YgiP covers the entire DNA surface supposedly through protein–protein contact. The intracellular concentration of YgiP is very low in growing E. coli cells under aerobic conditions, but increases more than 100-fold to the level as high as the major nucleoid proteins under anaerobic conditions. An E. coli mutant lacking ygiP showed retarded growth under anaerobic conditions. High abundance and large number of binding sites together indicate that YgiP is a nucleoid-associated protein with both architectural and regulatory roles as the nucleoid proteins Fis and IHF. We then propose that YgiP is a novel nucleoid protein of E. coli under anaerobiosis and propose to rename it Dan (DNA-binding protein under anaerobic conditions).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.