Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.
BackgroundGenome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches).DescriptionRegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes.ConclusionsRegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.
The evolutionary rates of protein-coding genes in an organism span, approximately, 3 orders of magnitude and show a universal, approximately log-normal distribution in a broad variety of species from prokaryotes to mammals. This universal distribution implies a steady-state process, with identical distributions of evolutionary rates among genes that are gained and genes that are lost. A mathematical model of such process is developed under the single assumption of the constancy of the distributions of the propensities for gene loss (PGL). This model predicts that genes of different ages, that is, genes with homologs detectable at different phylogenetic depths, substantially differ in those variables that correlate with PGL. We computationally partition protein-coding genes from humans, flies, and Aspergillus fungus into age classes, and show that genes of different ages retain the universal log-normal distribution of evolutionary rates, with a shift toward higher rates in ''younger'' classes but also with a substantial overlap. The only exception involves human primate-specific genes that show a heavy tail of rapidly evolving genes, probably owing to gene annotation artifacts. As predicted, the gene age classes differ in characteristics correlated with PGL. Compared with ''young'' genes (e.g., mammal-specific human ones), ''old'' genes (e.g., eukaryotespecific), on average, are longer, are expressed at a higher level, possess a higher intron density, evolve slower on the short time scale, and are subject to stronger purifying selection. Thus, genome evolution fits a simple model with approximately uniform rates of gene gain and loss, without major bursts of genomic innovation.gene age ͉ gene expression ͉ genome evolution ͉ intron density
Redox-sensing repressor Rex was previously implicated in the control of anaerobic respiration in response to the cellular NADH/NAD ؉ levels in Gram-positive bacteria. We utilized the comparative genomics approach to infer candidate Rex-binding DNA motifs and assess the Rex regulon content in 119 genomes from 11 taxonomic groups. Both DNA-binding and NADsensing domains are broadly conserved in Rex orthologs identified in the phyla Firmicutes, Thermotogales, Actinobacteria, Chloroflexi, Deinococcus-Thermus, and Proteobacteria. The identified DNA-binding motifs showed significant conservation in these species, with the only exception detected in Clostridia, where the Rex motif deviates in two positions from the generalized consensus, TTGTGAANNNNTTCACAA. Comparative analysis of candidate Rex sites revealed remarkable variations in functional repertoires of candidate Rex-regulated genes in various microorganisms. Most of the reconstructed regulatory interactions are lineage specific, suggesting frequent events of gain and loss of regulator binding sites in the evolution of Rex regulons. We identified more than 50 novel Rex-regulated operons encoding functions that are essential for resumption of the NADH:NAD ؉ balance. The novel functional role of Rex in the control of the central carbon metabolism and hydrogen production genes was validated by in vitro DNA binding assays using the TM0169 protein in the hydrogen-producing bacterium Thermotoga maritima. Bacteria can adapt to changes in their environment by utilizing a range of transcription factors that receive an appropriate intra-or extracellular signal and trigger the specific transcriptional response. The Rex regulator is a transcription factor that responds to intracellular redox potential and negatively controls expression of genes involved in energy metabolism and fermentative growth in Gram-positive bacteria, including the actinobacterium Streptomyces coelicolor (4) and various Firmicutes, such as Bacillus subtilis (35), Staphylococcus aureus (25), and Streptococcus mutans (3). NAD plays an important role in different biological processes, including redox cellular balance. During the catabolism of carbohydrates, NAD ϩ is reduced to NADH by glycolytic enzymes. The NADH formed is reoxidized back either by respiratory electron transport chains or by NADH-linked fermentative enzymes. The DNA-binding activity of Rex is modulated by the intracellular ratio of NADH:NAD ϩ . Under the low NADH:NAD ϩ ratio, the Rex protein binds to the target sites and represses transcription of genes involved in NADH reoxidation, while the increase of NADH concentration results in the dissociation of Rex from DNA and derepression of its target genes (4, 25, 35).Rex was initially described in S. coelicolor, where it controls the cytochrome bd terminal oxidase operon cydABCD and the heme biosynthesis genes hemACD, as well as the membrane-bound proton-translocating NADH dehydrogenase operon nuoA-nuoN (4). The Rex-regulated promoters contain a common DNA motif with consensus 5=-TGTGNNCNNNT...
The RegPrecise database (http://regprecise.lbl.gov) was developed for capturing, visualization and analysis of predicted transcription factor regulons in prokaryotes that were reconstructed and manually curated by utilizing the comparative genomic approach. A significant number of high-quality inferences of transcriptional regulatory interactions have been already accumulated for diverse taxonomic groups of bacteria. The reconstructed regulons include transcription factors, their cognate DNA motifs and regulated genes/operons linked to the candidate transcription factor binding sites. The RegPrecise allows for browsing the regulon collections for: (i) conservation of DNA binding sites and regulated genes for a particular regulon across diverse taxonomic lineages; (ii) sets of regulons for a family of transcription factors; (iii) repertoire of regulons in a particular taxonomic group of species; (iv) regulons associated with a metabolic pathway or a biological process in various genomes. The initial release of the database includes ∼11 500 candidate binding sites for ∼400 orthologous groups of transcription factors from over 350 prokaryotic genomes. Majority of these data are represented by genome-wide regulon reconstructions in Shewanella and Streptococcus genera and a large-scale prediction of regulons for the LacI family of transcription factors. Another section in the database represents the results of accurate regulon propagation to the closely related genomes.
Background: An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes.
The human gut microbiome harbors a diverse array of metabolic pathways contributing to its development and homeostasis via a complex web of diet-dependent metabolic interactions within the microbial community and host. Genomics-based reconstruction and predictive modeling of these interactions would provide a framework for diagnostics and treatment of dysbiosis-related syndromes via rational selection of therapeutic prebiotics and dietary nutrients. Of particular interest are micronutrients, such as B-group vitamins, precursors of indispensable metabolic cofactors, that are produced de novo by some gut bacteria (prototrophs) but must be provided exogenously in the diet for many other bacterial species (auxotrophs) as well as for the mammalian host. Cross-feeding of B vitamins between prototrophic and auxotrophic species is expected to strongly contribute to the homeostasis of microbial communities in the distal gut given the efficient absorption of dietary vitamins in the upper gastrointestinal tract. To confidently estimate the balance of microbiome micronutrient biosynthetic capabilities and requirements using available genomic data, we have performed a subsystems-based reconstruction of biogenesis, salvage and uptake for eight B vitamins (B1, B2, B3, B5, B6, B7, B9, and B12) and queuosine (essential factor in tRNA modification) over a reference set of 2,228 bacterial genomes representing 690 cultured species of the human gastrointestinal microbiota. This allowed us to classify the studied organisms with respect to their pathway variants and infer their prototrophic vs. auxotrophic phenotypes. In addition to canonical vitamin pathways, several conserved partial pathways were identified pointing to alternative routes of syntrophic metabolism and expanding a microbial vitamin “menu” by several pathway intermediates (vitamers) such as thiazole, quinolinate, dethiobiotin, pantoate. A cross-species comparison was applied to assess the extent of conservation of vitamin phenotypes at distinct taxonomic levels (from strains to families). The obtained reference collection combined with 16S rRNA gene-based phylogenetic profiles was used to deduce phenotype profiles of the human gut microbiota across in two large cohorts. This analysis provided the first estimate of B-vitamin requirements, production and sharing capabilities in the human gut microbiome establishing predictive phenotype profiling as a new approach to classification of microbiome samples. Future expansion of our reference genomic collection of metabolic phenotypes will allow further improvement in coverage and accuracy of predictive phenotype profiling of the human microbiome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.