Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely on protein prior knowledge and on ab initio prediction algorithms. Such algorithms notably enforce an arbitrary criterion of one coding sequence (CDS) per transcript, leading to a substantial underestimation of the coding potential of eukaryotes. Here, we present OpenProt, the first database fully endorsing a polycistronic model of eukaryotic genomes to date. OpenProt contains all possible ORFs longer than 30 codons across 10 species, and cumulates supporting evidence such as protein conservation, translation and expression. OpenProt annotates all known proteins (RefProts), novel predicted isoforms (Isoforms) and novel predicted proteins from alternative ORFs (AltProts). It incorporates cutting-edge algorithms to evaluate protein orthology and re-interrogate publicly available ribosome profiling and mass spectrometry datasets, supporting the annotation of thousands of predicted ORFs. The constantly growing database currently cumulates evidence from 87 ribosome profiling and 114 mass spectrometry studies from several species, tissues and cell lines. All data is freely available and downloadable from a web platform (www.openprot.org) supporting a genome browser and advanced queries for each species. Thus, OpenProt enables a more comprehensive landscape of eukaryotic genomes’ coding potential.
Escherichia coli BW25113 is the parent strain of the Keio collection comprising nearly 4,000 single-gene deletion mutants. We report the complete 4,631,469-bp genome sequence of this strain and the key variations from the type strain E. coli MG1655.
OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one’s bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.
The gut microbiota is a suspected hotspot for bacterial conjugation due to its high density and diversity of microorganisms. However, the contribution of different conjugative plasmid families to horizontal gene transfer in this environment remains poorly characterized. Here, we systematically quantified the transfer rates in the mouse intestinal tract for 13 conjugative plasmids encompassing 10 major incompatibility groups. The vast majority of these plasmids were unable to perform conjugation in situ or only reached relatively low transfer rates. Surprisingly, IncI2 conjugative plasmid TP114 was identified as a proficient DNA delivery system in this environment, with the ability to transfer to virtually 100% of the probed recipient bacteria. We also show that a type IV pilus present in I-complex conjugative plasmids plays a crucial role for the transfer of TP114 in the mouse intestinal microbiota, most likely by contributing to mating pair stabilization. These results provide new insights on the mobility of genes in the gut microbiota and highlights TP114 as a very efficient DNA delivery system of interest for microbiome editing tools.
Antibiotic resistance threatens our ability to treat infectious diseases, spurring interest in alternative antimicrobial technologies. The use of bacterial conjugation to deliver CRISPR-cas systems programmed to precisely eliminate antibiotic-resistant bacteria represents a promising approach but requires high in situ DNA transfer rates. We have optimized the transfer efficiency of conjugative plasmid TP114 using accelerated laboratory evolution. We hence generated a potent conjugative delivery vehicle for CRISPR-cas9 that can eliminate > 99.9% of targeted antibioticresistant Escherichia coli in the mouse gut microbiota using a single dose. We then applied this system to a Citrobacter rodentium infection model, achieving full clearance within four consecutive days of treatment.
The near‐minimal bacterium Mesoplasma florum is an interesting model for synthetic genomics and systems biology due to its small genome (~ 800 kb), fast growth rate, and lack of pathogenic potential. However, fundamental aspects of its biology remain largely unexplored. Here, we report a broad yet remarkably detailed characterization of M. florum by combining a wide variety of experimental approaches. We investigated several physical and physiological parameters of this bacterium, including cell size, growth kinetics, and biomass composition of the cell. We also performed the first genome‐wide analysis of its transcriptome and proteome, notably revealing a conserved promoter motif, the organization of transcription units, and the transcription and protein expression levels of all protein‐coding sequences. We converted gene transcription and expression levels into absolute molecular abundances using biomass quantification results, generating an unprecedented view of the M. florum cellular composition and functions. These characterization efforts provide a strong experimental foundation for the development of a genome‐scale model for M. florum and will guide future genome engineering endeavors in this simple organism.
Bacteria have evolved defence mechanisms against bacteriophages. Restriction-modification systems provide innate immunity by degrading invading DNAs that lack proper methylation. CRISPR–Cas systems provide adaptive immunity by sampling the genome of past invaders and cutting the DNA of closely related DNA molecules. These barriers also restrict horizontal gene transfer mediated by conjugative plasmids. IncC conjugative plasmids are important contributors to the global dissemination of multidrug resistance among pathogenic bacteria infecting animals and humans. Here, we show that IncC conjugative plasmids are highly resilient to host defence systems during entry into a new host by conjugation. Using a TnSeq strategy, we uncover a conserved operon containing five genes (vcrx089–vcrx093) that confer a novel host defence evasion (hde) phenotype. We show that vcrx089–vcrx090 promote resistance against type I restriction-modification, whereas vcrx091–vcxr093 promote CRISPR–Cas evasion by repairing double-strand DNA breaks via recombination between short sequence repeats. vcrx091, vcrx092 and vcrx093 encode a single-strand binding protein, and a single-strand annealing recombinase and double-strand exonuclease related to Redβ and λExo of bacteriophage λ, respectively. Homologous genes of the integrative and conjugative element R391 also provide CRISPR–Cas evasion. Hence, the conserved hde operon considerably broadens the host range of large families of mobile elements spreading multidrug resistance.
R‐loop disassembly by the human helicase Senataxin contributes to genome integrity and to proper transcription termination at a subset of RNA polymerase II genes. Whether Senataxin also contributes to transcription termination at other classes of genes has remained unclear. Here, we show that Sen1, one of two fission yeast homologues of Senataxin, promotes efficient termination of RNA polymerase III (RNAP3) transcription in vivo. In the absence of Sen1, RNAP3 accumulates downstream of RNAP3‐transcribed genes and produces long exosome‐sensitive 3′‐extended transcripts. Importantly, neither of these defects was affected by the removal of R‐loops. The finding that Sen1 acts as an ancillary factor for RNAP3 transcription termination in vivo challenges the pre‐existing view that RNAP3 terminates transcription autonomously. We propose that Sen1 is a cofactor for transcription termination that has been co‐opted by different RNA polymerases in the course of evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.