Small proteins are an emerging class of gene products with diverse roles in bacterial physiology. However, a full understanding of their importance has been hampered by insufficient genome annotations and a lack of comprehensive characterization in microbes other than Escherichia coli. We have taken an integrative approach to accelerate the discovery of small proteins and their putative virulence-associated functions in Salmonella Typhimurium. We merged the annotated small proteome of Salmonella with new small proteins predicted with in silico and experimental approaches. We then exploited existing and newly generated global datasets that provide information on small open reading frame expression during infection of epithelial cells (dual RNA-seq), contribution to bacterial fitness inside macrophages (TraDIS), and potential engagement in molecular interactions (Grad-seq). This integrative approach suggested a new role for the small protein MgrB beyond its known function in regulating PhoQ. We demonstrate a virulence and motility defect of a Salmonella ΔmgrB mutant and reveal an effect of MgrB in regulating the Salmonella transcriptome and proteome under infection-relevant conditions. Our study highlights the power of interpreting available ‘omics’ datasets with a focus on small proteins, and may serve as a blueprint for a data integration-based survey of small proteins in diverse bacteria.
CRISPR-Cas systems recognize foreign genetic material using CRISPR RNAs (crRNAs). In type II systems, a trans-activating crRNA (tracrRNA) hybridizes to crRNAs to drive their processing and utilization by Cas9. While analyzing Cas9-RNA complexes from Campylobacter jejuni, we discovered tracrRNA hybridizing to cellular RNAs, leading to formation of “noncanonical” crRNAs capable of guiding DNA targeting by Cas9. Our discovery inspired the engineering of reprogrammed tracrRNAs that link the presence of any RNA of interest to DNA targeting with different Cas9 orthologs. This capability became the basis for a multiplexable diagnostic platform termed LEOPARD (leveraging engineered tracrRNAs and on-target DNAs for parallel RNA detection). LEOPARD allowed simultaneous detection of RNAs from different viruses in one test and distinguished severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its D614G (Asp614→Gly) variant with single-base resolution in patient samples.
Small proteins encoded by ORFs shorter than 50 codons (sORFs) are often overlooked by annotation engines and are difficult to characterize using traditional biochemical techniques. Ribosome profiling has tremendous potential to empirically improve the annotations of prokaryotic genomes. Recent improvements in ribosome profiling methods for bacterial model organisms have revealed many new sORFs in well-characterized genomes. Antibiotics that trap ribosomes just after initiation have played a key role in these developments by allowing unambiguous identification of the start codons (and hence the reading frame) for novel ORFs. Here we describe these new methods and highlight critical controls and considerations for adapting ribosome profiling to different prokaryotic species.
Helicobacter pylori, one of the most prevalent human pathogens, used to be thought to lack small regulatory RNAs (sRNAs) which are otherwise considered abundant in all bacteria. However, our recent analysis of the primary transcriptome of H. pylori discovered an unexpectedly large number of sRNAs, and suggested that this model organism also uses riboregulation to control the expression of its genes. Nonetheless, whereas most enterobacterial sRNAs require the RNA chaperone Hfq for function, Epsilonproteobacteria including H. pylori seem to have no Hfq homologue, which prompted us to search for other auxiliary proteins in sRNA-mediated regulation. Therefore, we have developed two orthogonal methods to isolate and investigate in vivo and in vitro assembled RNA-protein complexes in H. pylori: (i) an affinity chromatography strategy based on aptamer-tagged sRNAs of interest to identify their protein binding partners; and (ii) a rapid method for chromosomal FLAG-tagging of proteins to facilitate co-immunoprecipitation of associated RNA species. Using these methods, we have identified RNA-protein interactions between the ribosomal protein S1 and various mRNAs and sRNAs of H. pylori. Moreover, both methods reported a stable RNA-protein complex between the abundant HPnc6910 sRNA and HP1334, a protein of unknown function that is encoded downstream of HPnc6910. Given that 50% of all bacteria may lack Hfq, our methods can be useful to identify RNA-protein interactions in a wider range of bacterial pathogens.
Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility.
In contrast to extensively studied prokaryotic ‘small’ transcriptomes (encompassing all small non-coding RNAs), small proteomes (here defined as including proteins ≤ 70 aa) are only now entering the limelight. The absence of a complete small protein catalogue in most prokaryotes precludes our understanding of how these molecules affect physiology. So far, archaeal genomes have not yet been analysed broadly with a dedicated focus on small proteins. Here, we present a combinatorial approach, integrating experimental data from small protein-optimised mass spectrometry (MS) and ribosome profiling (Ribo-seq), to generate a high confidence inventory of small proteins in the model archaeon Haloferax volcanii. We demonstrate by MS and Ribo-seq that 67% of the 317 annotated small open reading frames (sORFs) are translated under standard growth conditions. Furthermore, annotation-independent analysis of Ribo-seq data showed ribosomal engagement for 47 novel sORFs in intergenic regions. Seven of these were also detected by proteomics, in addition to an eighth novel small protein solely identified by MS. We also provide independent experimental evidence in vivo for the translation of 12 sORFs (annotated and novel) using epitope tagging and western blotting, underlining the validity of our identification scheme. Several novel sORFs are conserved in Haloferax species and might have important functions. Based on our findings, we conclude that the small proteome of H. volcanii is larger than previously appreciated, and that combining MS with Ribo-seq is a powerful approach for the discovery of novel small protein coding genes in archaea.
Motivation Ribosome profiling (Ribo-seq) is a powerful approach based on deep sequencing of cDNA libraries generated from ribosome-protected RNA fragments to explore the translatome of a cell, and is especially useful for the detection of small proteins (50–100 amino acids) that are recalcitrant to many standard biochemical and in silico approaches. While pipelines are available to analyze Ribo-seq data, none are designed explicitly for the automatic processing and analysis of data from bacteria, nor are they focused on the discovery of unannotated open reading frames (ORFs). Results We present HRIBO (High-throughput annotation by Ribo-seq), a workflow to enable reproducible and high-throughput analysis of bacterial Ribo-seq data. The workflow performs all required pre-processing and quality control steps. Importantly, HRIBO outputs annotation-independent ORF predictions based on two complementary bacteria-focused tools, and integrates them with additional feature information and expression values. This facilitates the rapid and high-confidence discovery of novel ORFs and their prioritization for functional characterization. Availabilityand implementation HRIBO is a free and open source project available under the GPL-3 license at: https://github.com/RickGelhausen/HRIBO.
CRISPR–Cas systems provide bacteria with adaptive immunity against phages and plasmids; however, pathways regulating their activity are not well defined. We recently developed a high-throughput genome-wide method (SorTn-seq) and used this to uncover CRISPR–Cas regulators. Here, we demonstrate that the widespread Rsm/Csr pathway regulates the expression of multiple CRISPR–Cas systems in Serratia (type I-E, I-F and III-A). The main pathway component, RsmA (CsrA), is an RNA-binding post-transcriptional regulator of carbon utilisation, virulence and motility. RsmA binds cas mRNAs and suppresses type I and III CRISPR–Cas interference in addition to adaptation by type I systems. Coregulation of CRISPR–Cas and flagella by the Rsm pathway allows modulation of adaptive immunity when changes in receptor availability would alter susceptibility to flagella-tropic phages. Furthermore, we show that Rsm controls CRISPR–Cas in other genera, suggesting conservation of this regulatory strategy. Finally, we identify genes encoding RsmA homologues in phages, which have the potential to manipulate the physiology of host bacteria and might provide an anti-CRISPR activity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.