The sequences coding for DNA[cytosine-N4]methyltransferases MvaI (from Micrococcus varians RFL19) and Cfr9I (from Citrobacter freundii RFL9) have been determined. The predicted methylases are proteins of 454 and 300 amino acids, respectively. Primary structure comparison of M.Cfr9I and another m4C-forming methylase, M.Pvu II, revealed extended regions of homology. The sequence comparison of the three DNA[cytosine-N4]-methylases using originally developed software revealed two conserved patterns, DPF-GSGT and TSPPY, which were found similar also to those of adenine and DNA[cytosine-C5]-methylases. These data provided a basis for global alignment and classification of DNA-methylase sequences. Structural considerations led us to suggest that the first region could be the binding site of AdoMet, while the second is thought to be directly involved in the modification of the exocyclic amino group.
CRISPR-associated Rossmann Fold (CARF) and SMODS-associated and fused to various effector domains (SAVED) are key components of cyclic oligonucleotide-based antiphage signaling systems (CBASS) that sense cyclic oligonucleotides and transmit the signal to an effector inducing cell dormancy or death. Most of the CARFs are components of a CBASS built into type III CRISPR–Cas systems, where the CARF domain binds cyclic oligoA (cOA) synthesized by Cas10 polymerase-cyclase and allosterically activates the effector, typically a promiscuous ribonuclease. Additionally, this signaling pathway includes a ring nuclease, often also a CARF domain (either the sensor itself or a specialized enzyme) that cleaves cOA and mitigates dormancy or death induction. We present a comprehensive census of CARF and SAVED domains in bacteria and archaea, and their sequence- and structure-based classification. There are 10 major families of CARF domains and multiple smaller groups that differ in structural features, association with distinct effectors, and presence or absence of the ring nuclease activity. By comparative genome analysis, we predict specific functions of CARF and SAVED domains and partition the CARF domains into those with both sensor and ring nuclease functions, and sensor-only ones. Several families of ring nucleases functionally associated with sensor-only CARF domains are also predicted.
The analysis of ∼2000 bacterial genomes revealed that they all, without a single exception, encode one or more DNA polymerase III α-subunit (PolIIIα) homologs. Classified into C-family of DNA polymerases they come in two major forms, PolC and DnaE, related by ancient duplication. While PolC represents an evolutionary compact group, DnaE can be further subdivided into at least three groups (DnaE1-3). We performed an extensive analysis of various sequence, structure and surface properties of all four polymerase groups. Our analysis suggests a specific evolutionary pathway leading to PolC and DnaE from the last common ancestor and reveals important differences between extant polymerase groups. Among them, DnaE1 and PolC show the highest conservation of the analyzed properties. DnaE3 polymerases apparently represent an ‘impaired’ version of DnaE1. Nonessential DnaE2 polymerases, typical for oxygen-using bacteria with large GC-rich genomes, have a number of features in common with DnaE3 polymerases. The analysis of polymerase distribution in genomes revealed three major combinations: DnaE1 either alone or accompanied by one or more DnaE2s, PolC + DnaE3 and PolC + DnaE1. The first two combinations are present in Escherichia coli and Bacillus subtilis, respectively. The third one (PolC + DnaE1), found in Clostridia, represents a novel, so far experimentally uncharacterized, set.
Examination of crystal structures of restriction endonucleases EcoRI and EcoRV complexes with their cognate DNA revealed a common structural element, which forms the core of both proteins. This element consists of a five-stranded beta-sheet and two alpha-helices packed against it and could be described as alpha-beta sandwich in which helices and beta-strands lie in two stacked layers. While the spatial structure of this alpha-beta sandwich is conserved in both enzymes, there are not detectable similarities between amino acid sequences except of a few residues involved in active site formation. Probably, other restriction endonucleases which have similar organization of the active site might possess similar structural element regardless of DNA sequence recognized and recognition elements in the enzyme used.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.