SUMMARY Twenty-five years have passed since the discovery of cyclic dimeric (3′→5′) GMP (cyclic di-GMP or c-di-GMP). From the relative obscurity of an allosteric activator of a bacterial cellulose synthase, c-di-GMP has emerged as one of the most common and important bacterial second messengers. Cyclic di-GMP has been shown to regulate biofilm formation, motility, virulence, the cell cycle, differentiation, and other processes. Most c-di-GMP-dependent signaling pathways control the ability of bacteria to interact with abiotic surfaces or with other bacterial and eukaryotic cells. Cyclic di-GMP plays key roles in lifestyle changes of many bacteria, including transition from the motile to the sessile state, which aids in the establishment of multicellular biofilm communities, and from the virulent state in acute infections to the less virulent but more resilient state characteristic of chronic infectious diseases. From a practical standpoint, modulating c-di-GMP signaling pathways in bacteria could represent a new way of controlling formation and dispersal of biofilms in medical and industrial settings. Cyclic di-GMP participates in interkingdom signaling. It is recognized by mammalian immune systems as a uniquely bacterial molecule and therefore is considered a promising vaccine adjuvant. The purpose of this review is not to overview the whole body of data in the burgeoning field of c-di-GMP-dependent signaling. Instead, we provide a historic perspective on the development of the field, emphasize common trends, and illustrate them with the best available examples. We also identify unresolved questions and highlight new directions in c-di-GMP research that will give us a deeper understanding of this truly universal bacterial second messenger.
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih. gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.
Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the protein family (COG) to assign function(s) to the entire set of carefully identified orthologs and describe the range of potential functions when there were more than one; and (iii) careful manual curation of the annotation of the COGs, aimed at detailed prediction of the biological function(s) for each COG while avoiding annotation errors and overprediction. Here we present an update of the COGs, the first since 2003, and a comprehensive revision of the COG annotations and expansion of the genome coverage to include representative complete genomes from all bacterial and archaeal lineages down to the genus level. This re-analysis of the COGs shows that the original COG assignments had an error rate below 0.5% and allows an assessment of the progress in functional genomics in the past 12 years. During this time, functions of many previously uncharacterized COGs have been elucidated and tentative functional assignments of many COGs have been validated, either by targeted experiments or through the use of high-throughput methods. A particularly important development is the assignment of functions to several widespread, conserved proteins many of which turned out to participate in translation, in particular rRNA maturation and tRNA modification. The new version of the COGs is expected to become an important tool for microbial genomics.
The archetypal two-component signal transduction systems include a sensor histidine kinase and a response regulator, which consists of a receiver CheY-like domain and a DNA-binding domain. Sequence analysis of the sensor kinases and response regulators encoded in complete bacterial and archaeal genomes revealed complex domain architectures for many of them and allowed the identification of several novel conserved domains, such as PAS, GAF, HAMP, GGDEF, EAL, and HD-GYP. All of these domains are widely represented in bacteria, including 19 copies of the GGDEF domain and 17 copies of the EAL domain encoded in the Escherichia coli genome. In contrast, these novel signaling domains are much less abundant in bacterial parasites and in archaea, with none at all found in some archaeal species. This skewed phyletic distribution suggests that the newly discovered complexity of signal transduction systems emerged early in the evolution of bacteria, with subsequent massive loss in parasites and some horizontal dissemination among archaea. Only a few proteins containing these domains have been studied experimentally, and their exact biochemical functions remain obscure; they may include transformations of novel signal molecules, such as the recently identified cyclic diguanylate. Recent experimental data provide the first direct evidence of the participation of these domains in signal transduction pathways, including regulation of virulence genes and extracellular enzyme production in the human pathogens Bordetella pertussis and Borrelia burgdorferi and the plant pathogen Xanthomonas campestris. Gene-neighborhood analysis of these new domains suggests their participation in a variety of processes, from mercury and phage resistance to maintenance of virulence plasmids. It appears that the real picture of the complexity of phosphorelay signal transduction in prokaryotes is only beginning to unfold.
SummaryBis-(3 ¢ ¢ ¢ ¢ -5 ¢ ¢ ¢ ¢ )-cyclic dimeric guanosine monophosphate (c-di-GMP) has come to the limelight as a result of the recent advances in microbial genomics and increased interest in multicellular microbial behaviour. Known for more than 15 years as an activator of cellulose synthase in Gluconacetobacter xylinus , c-di-GMP is emerging as a novel global second messenger in bacteria. The GGDEF and EAL domain proteins involved in c-di-GMP synthesis and degradation, respectively, are (almost) ubiquitous in bacterial genomes. These proteins affect cell differentiation and multicellular behaviour as well as interactions between the microorganisms and their eukaryotic hosts and other phenotypes. While the role of GGDEF and EAL domain proteins in bacterial physiology and behaviour has gained appreciation, and significant progress has been achieved in understanding the enzymology of c-di-GMP turnover, many questions regarding c-di-GMP-dependent signalling remain unanswered. Among these, the key questions are the identity of targets of c-di-GMP action and mechanisms of c-di-GMP-dependent regulation. This review discusses phylogenetic distribution of the c-di-GMP signalling pathway in bacteria, recent developments in biochemical and structural characterization of proteins involved in its metabolism, and biological processes affected by c-di-GMP. The accumulated data clearly indicate that a novel ubiquitous signalling system in bacteria has been discovered.
Recent studies identified c-di-GMP as a universal bacterial secondary messenger regulating biofilm formation, motility, production of extracellular polysaccharide and multicellular behavior in diverse bacteria. However, except for cellulose synthase, no protein has been shown to bind c-di-GMP and the targets for c-di-GMP action remain unknown. Here we report identification of the PilZ ("pills") domain (Pfam domain PF07238) in the sequences of bacterial cellulose synthases, alginate biosynthesis protein Alg44, proteins of enterobacterial YcgR and firmicute YpfA families, and other proteins encoded in bacterial genomes and present evidence indicating that this domain is (part of) the long-sought c-di-GMP-binding protein. Association of the PilZ domain with a variety of other domains, including likely components of bacterial multidrug secretion system, could provide clues to multiple functions of the c-di-GMP in bacterial pathogenesis and cell development.
CheY-like phosphoacceptor (or receiver [REC]) domain is a common module in a variety
The Clusters of Orthologous Genes (COG) database, also referred to as the Clusters of Orthologous Groups of proteins, was created in 1997 and went through several rounds of updates, most recently, in 2014. The current update, available at https://www.ncbi.nlm.nih.gov/research/COG, substantially expands the scope of the database to include complete genomes of 1187 bacteria and 122 archaea, typically, with a single genome per genus. In addition, the current version of the COGs includes the following new features: (i) the recently deprecated NCBI’s gene index (gi) numbers for the encoded proteins are replaced with stable RefSeq or GenBank\ENA\DDBJ coding sequence (CDS) accession numbers; (ii) COG annotations are updated for >200 newly characterized protein families with corresponding references and PDB links, where available; (iii) lists of COGs grouped by pathways and functional systems are added; (iv) 266 new COGs for proteins involved in CRISPR-Cas immunity, sporulation in Firmicutes and photosynthesis in cyanobacteria are included; and (v) the database is made available as a web page, in addition to FTP. The current release includes 4877 COGs. Future plans include further expansion of the COG collection by adding archaeal COGs (arCOGs), splitting the COGs containing multiple paralogs, and continued refinement of COG annotations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.