Bacterial adaptive immune systems employ CRISPRs (clustered regularly interspaced short palindromic repeats) and CRISPR-associated (Cas) proteins for RNA-guided nucleic acid cleavage1,2. Although generally targeted to DNA substrates3–5, the Type III and Type VI CRISPR systems direct interference complexes against single-stranded RNA (ssRNA) substrates6–9. In Type VI systems, the single-subunit C2c2 protein functions as an RNA-guided RNA endonuclease9,10. How this enzyme acquires mature CRISPR RNAs (crRNAs) essential for immune surveillance and its mechanism of crRNA-mediated RNA cleavage remain unclear. Here we show that C2c2 possesses a unique ribonuclease activity responsible for CRISPR RNA maturation that is distinct from its RNA-activated ssRNA-degradation activity. These dual ribonuclease functions are chemically and mechanistically different from each other and from the crRNA-processing behavior of the evolutionarily unrelated CRISPR enzyme Cpf111. We show that the two ribonuclease activities of C2c2 enable multiplexed processing and loading of guide RNAs that in turn allow for sensitive cellular transcript detection.
CRISPR-Cas systems provide microbes with adaptive immunity to infectious nucleic acids and are widely employed as genome editing tools. These tools utilize RNA-guided Cas proteins whose large size (950—1400 amino acids) has been considered essential to their specific DNA- or RNA-targeting activities. Here we present a set of CRISPR-Cas systems from uncultivated archaea that contain Cas14, a family of exceptionally compact RNA-guided nucleases (400—700 amino acids). Despite their small size, Cas14 proteins are capable of targeted single-stranded DNA (ssDNA) cleavage without restrictive sequence requirements. Moreover, target recognition by Cas14 triggers non-specific cutting of ssDNA molecules, an activity that enables high-fidelity SNP genotyping (Cas14-DETECTR). Metagenomic data show that multiple CRISPR-Cas14 systems evolved independently and suggest a potential evolutionary origin of single-effector CRISPR-based adaptive immunity.
Infection by the human pathogen Legionella pneumophila relies on the translocation of ~300 virulence proteins, termed effectors, which manipulate host-cell processes. However, almost no information exists regarding effectors in other Legionella pathogens. Here we sequenced, assembled and characterized the genomes of 38 Legionella species, and predicted their effector repertoire using a previously validated machine-learning approach. This analysis revealed a treasure trove of 5,885 predicted effectors. The effector repertoire of different Legionella species was found to be largely non-overlapping, and only seven core-effectors were shared among all species studied. Species-specific effectors had atypically low GC content, suggesting exogenous acquisition, possibly from their natural protozoan hosts. Furthermore, we detected numerous novel conserved effector domains, and discovered new domain combinations, which allowed inferring yet undescribed effector functions. The effector collection and network of domain architectures described here can serve as a roadmap for future studies of effector function and evolution.
CRISPR-Cas systems provide microbes with adaptive immunity by employing short sequences, termed spacers, that guide Cas proteins to cleave foreign DNA 1,2 . Class 2 CRISPR-Cas systems are streamlined versions in which a single Cas protein bound to RNA recognizes and cleaves targeted sequences 3,4 . The programmable nature of these minimal systems has enabled their repurposing as a versatile technology that is broadly revolutionizing biological and clinical research 5 . However, current CRISPR-Cas technologies are based solely on systems from isolated bacteria, leaving untapped the vast majority of enzymes from organisms that have not been cultured. Metagenomics, the sequencing of DNA extracted from natural microbial communities, provides access to the genetic material of a huge array of uncultivated organisms 6,7 . Here, using genome-resolved metagenomics, we identified novel CRISPR-Cas systems, including the first reported Cas9 in the archaeal domain of life. This divergent Cas9 protein was found in littlestudied nanoarchaea as part of an active CRISPR-Cas system. In bacteria, we discovered two
A large number of highly pathogenic bacteria utilize secretion systems to translocate effector proteins into host cells. Using these effectors, the bacteria subvert host cell processes during infection. Legionella pneumophila translocates effectors via the Icm/Dot type-IV secretion system and to date, approximately 100 effectors have been identified by various experimental and computational techniques. Effector identification is a critical first step towards the understanding of the pathogenesis system in L. pneumophila as well as in other bacterial pathogens. Here, we formulate the task of effector identification as a classification problem: each L. pneumophila open reading frame (ORF) was classified as either effector or not. We computationally defined a set of features that best distinguish effectors from non-effectors. These features cover a wide range of characteristics including taxonomical dispersion, regulatory data, genomic organization, similarity to eukaryotic proteomes and more. Machine learning algorithms utilizing these features were then applied to classify all the ORFs within the L. pneumophila genome. Using this approach we were able to predict and experimentally validate 40 new effectors, reaching a success rate of above 90%. Increasing the number of validated effectors to around 140, we were able to gain novel insights into their characteristics. Effectors were found to have low G+C content, supporting the hypothesis that a large number of effectors originate via horizontal gene transfer, probably from their protozoan host. In addition, effectors were found to cluster in specific genomic regions. Finally, we were able to provide a novel description of the C-terminal translocation signal required for effector translocation by the Icm/Dot secretion system. To conclude, we have discovered 40 novel L. pneumophila effectors, predicted over a hundred additional highly probable effectors, and shown the applicability of machine learning algorithms for the identification and characterization of bacterial pathogenesis determinants.
During evolution segments of homeothermic genomes underwent a GC content increase. Our analyses reveal that two exon-intron architectures have evolved from an ancestral state of low GC content exons flanked by short introns with a lower GC content. One group underwent a GC content elevation that abolished the differential exon-intron GC content, with introns remaining short. The other group retained the overall low GC content as well as the differential exon-intron GC content, and is associated with longer introns. We show that differential exon-intron GC content regulates exon inclusion level in this group, in which disease-associated mutations often lead to exon skipping. This group's exons also display higher nucleosome occupancy compared to flanking introns and exons of the other group, thus "marking" them for spliceosomal recognition. Collectively, our results reveal that differential exon-intron GC content is a previously unidentified determinant of exon selection and argue that the two GC content architectures reflect the two mechanisms by which splicing signals are recognized: exon definition and intron definition.
CRISPR adaptive immunity pathways protect prokaryotic cells against foreign nucleic acids using CRISPR RNA (crRNA)-guided nucleases. In type VI-A CRISPR-Cas systems, the signature protein Cas13a (formerly C2c2) contains two separate ribonuclease activities that catalyze crRNA maturation and ssRNA degradation. The Cas13a protein family occurs across different bacterial phyla and varies widely in both protein sequence and corresponding crRNA sequence conservation. Although grouped phylogenetically together, we show that the Cas13a enzyme family comprises two distinct functional groups that recognize orthogonal sets of crRNAs and possess different ssRNA cleavage specificities. These functional distinctions could not be bioinformatically predicted, suggesting more subtle co-evolution of Cas13a enzymes. Additionally, we find that Cas13a pre-crRNA processing is not essential for ssRNA cleavage, although it enhances ssRNA targeting for crRNAs encoded internally within the CRISPR array. We define two Cas13a protein subfamilies that can operate in parallel for RNA detection both in bacteria and for diagnostic applications.
Current understanding of microorganism–virus interactions, which shape the evolution and functioning of Earth's ecosystems, is based primarily on cultivated organisms. Here we investigate thousands of viral and microbial genomes recovered using a cultivation-independent approach to study the frequency, variety and taxonomic distribution of viral defence mechanisms. CRISPR-Cas systems that confer microorganisms with immunity to viruses are present in only 10% of 1,724 sampled microorganisms, compared with previous reports of 40% occurrence in bacteria and 81% in archaea. We attribute this large difference to the lack of CRISPR-Cas systems across major bacterial lineages that have no cultivated representatives. We correlate absence of CRISPR-Cas with lack of nucleotide biosynthesis capacity and a symbiotic lifestyle. Restriction systems are well represented in these lineages and might provide both non-specific viral defence and access to nucleotides.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.