This study reports the essential Caulobacter genome at 8 bp resolution determined by saturated transposon mutagenesis and high-throughput sequencing. This strategy is applicable to full genome essentiality studies in a broad class of bacterial species.
Each Caulobacter cell cycle involves differentiation and an asymmetric cell division driven by a cyclical regulatory circuit comprised of four transcription factors (TFs) and a DNA methyltransferase. Using a modified global 5′ RACE protocol, we globally mapped transcription start sites (TSSs) at base-pair resolution, measured their transcription levels at multiple times in the cell cycle, and identified their transcription factor binding sites. Out of 2726 TSSs, 586 were shown to be cell cycle-regulated and we identified 529 binding sites for the cell cycle master regulators. Twenty-three percent of the cell cycle-regulated promoters were found to be under the combinatorial control of two or more of the global regulators. Previously unknown features of the core cell cycle circuit were identified, including 107 antisense TSSs which exhibit cell cycle-control, and 241 genes with multiple TSSs whose transcription levels often exhibited different cell cycle timing. Cumulatively, this study uncovered novel new layers of transcriptional regulation mediating the bacterial cell cycle.
Summary Cytokinesis in Gram-negative bacteria is mediated by a multiprotein machine (the divisome) that invaginates and remodels the inner membrane, peptidoglycan, and outer membrane. Understanding the order of divisome assembly would inform models of the interactions among its components and their respective functions. We leveraged the ability to isolate synchronous populations of Caulobacter crescentus cells to investigate assembly of the divisome and place the arrival of each component into functional context. Additionally, we investigated the genetic dependency of localization among divisome proteins and the cell cycle regulation of their transcript and protein levels to gain insight into the control mechanisms underlying their assembly. Our results revealed a picture of divisome assembly with unprecedented temporal resolution. Specifically, we observed 1) initial establishment of the division site, 2) recruitment of early FtsZ-binding proteins, 3) arrival of proteins involved in peptidoglycan remodeling, 4) arrival of FtsA, 5) assembly of core divisome components, 6) initiation of envelope invagination, 7) recruitment of polar markers and cytoplasmic compartmentalization, and 8) cell separation. Our analysis revealed differences in divisome assembly among Caulobacter and other bacteria that establish a framework for identifying aspects of bacterial cytokinesis that are widely conserved from those that are more variable.
Through advanced mechanistic modeling and the generation of large high-quality datasets, machine learning is becoming an integral part of understanding and engineering living systems. Here we show that mechanistic and machine learning models can be combined to enable accurate genotype-to-phenotype predictions. We use a genome-scale model to pinpoint engineering targets, efficient library construction of metabolic pathway designs, and high-throughput biosensor-enabled screening for training diverse machine learning algorithms. From a single data-generation cycle, this enables successful forward engineering of complex aromatic amino acid metabolism in yeast, with the best machine learning-guided design recommendations improving tryptophan titer and productivity by up to 74 and 43%, respectively, compared to the best designs used for algorithm training. Thus, this study highlights the power of combining mechanistic and machine learning models to effectively direct metabolic engineering efforts.
SummarySmall non-coding RNAs (sRNAs) are active in many bacterial cell functions, including regulation of the cell's response to environmental challenges. We describe the identification of 27 novel Caulobacter crescentus sRNAs by analysis of RNA expression levels assayed using a tiled Caulobacter microarray and a protocol optimized for detection of sRNAs. The principal analysis method involved identification of sets of adjacent probes with unusually high correlation between the individual intergenic probes within the set, suggesting presence of a sRNA. Among the validated sRNAs, two are candidate transposase gene antisense RNAs. The expression of 10 of the sRNAs is regulated by either entry into stationary phase, carbon starvation, or rich versus minimal media. The expression of four of the novel sRNAs changes as the cell cycle progresses. One of these shares a promoter motif with several genes expressed at the swarmerto-stalked cell transition; while another appears to be controlled by the CtrA global transcriptional regulator. The probe correlation analysis approach reported here is of general use for large-scale sRNA identification for any sequenced microbial genome.
Bacteria adapt to shifts from rapid to slow growth, and have developed strategies for long-term survival during prolonged starvation and stress conditions. We report the regulatory response of C. crescentus to carbon starvation, based on combined high-throughput proteome and transcriptome analyses. Our results identify cell cycle changes in gene expression in response to carbon starvation that involve the prominent role of the FixK FNR/CAP family transcription factor and the CtrA cell cycle regulator. Notably, the SigT ECF sigma factor mediates the carbon starvation-induced degradation of CtrA, while activating a core set of general starvation-stress genes that respond to carbon starvation, osmotic stress, and exposure to heavy metals. Comparison of the response of swarmer cells and stalked cells to carbon starvation revealed four groups of genes that exhibit different expression profiles. Also, cell pole morphogenesis and initiation of chromosome replication normally occurring at the swarmer-to-stalked cell transition are uncoupled in carbon-starved cells.
The collection of multiple genome-scale datasets is now routine, and the frontier of research in systems biology has shifted accordingly. Rather than clustering a single dataset to produce a static map of functional modules, the focus today is on data integration, network alignment, interactive visualization and ontological markup. Because of the intrinsic noisiness of high-throughput measurements, statistical methods have been central to this effort. In this review, we briefly survey available datasets in functional genomics, review methods for data integration and network alignment, and describe recent work on using network models to guide experimental validation. We explain how the integration and validation steps spring from a Bayesian description of network uncertainty, and conclude by describing an important near-term milestone for systems biology: the construction of a set of rich reference networks for key model organisms.
SUMMARYIn combination with advanced mechanistic modeling and the generation of high-quality multi-dimensional data sets, machine learning is becoming an integral part of understanding and engineering living systems. Here we show that mechanistic and machine learning models can complement each other and be used in a combined approach to enable accurate genotype-to-phenotype predictions. We use a genome-scale model to pinpoint engineering targets and produce a large combinatorial library of metabolic pathway designs with different promoters which, once phenotyped, provide the basis for machine learning algorithms to be trained and used for new design recommendations. The approach enables successful forward engineering of aromatic amino acid metabolism in yeast, with the new recommended designs improving tryptophan production by up to 17% compared to the best designs used for algorithm training, and ultimately producing a total increase of 106% in tryptophan accumulation compared to optimized reference designs. Based on a single high-throughput data-generation iteration, this study highlights the power of combining mechanistic and machine learning models to enhance their predictive power and effectively direct metabolic engineering efforts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.