Galaxy is a mature, browser accessible workbench for scientific computing. It enables scientists to share, analyze and visualize their own data, with minimal technical impediments. A thriving global community continues to use, maintain and contribute to the project, with support from multiple national infrastructure providers that enable freely accessible analysis and training services. The Galaxy Training Network supports free, self-directed, virtual training with >230 integrated tutorials. Project engagement metrics have continued to grow over the last 2 years, including source code contributions, publications, software packages wrapped as tools, registered users and their daily analysis jobs, and new independent specialized servers. Key Galaxy technical developments include an improved user interface for launching large-scale analyses with many files, interactive tools for exploratory data analysis, and a complete suite of machine learning tools. Important scientific developments enabled by Galaxy include Vertebrate Genome Project (VGP) assembly workflows and global SARS-CoV-2 collaborations.
Ribonucleases are crucial enzymes in RNA metabolism and post-transcriptional regulatory processes in bacteria. Cyanobacteria encode the two essential ribonucleases RNase E and RNase J. Cyanobacterial RNase E is shorter than homologues in other groups of bacteria and lacks both the chloroplast-specific N-terminal extension as well as the C-terminal domain typical for RNase E of enterobacteria. In order to investigate the function of RNase E in the model cyanobacterium Synechocystis sp. PCC 6803, we engineered a temperature-sensitive RNase E mutant by introducing two site-specific mutations, I65F and the spontaneously occurred V94A. This enabled us to perform RNA-seq after the transient inactivation of RNase E by a temperature shift (TIER-seq) and to map 1472 RNase-E-dependent cleavage sites. We inferred a dominating cleavage signature consisting of an adenine at the −3 and a uridine at the +2 position within a single-stranded segment of the RNA. The data identified mRNAs likely regulated jointly by RNase E and an sRNA and potential 3′ end-derived sRNAs. Our findings substantiate the pivotal role of RNase E in post-transcriptional regulation and suggest the redundant or concerted action of RNase E and RNase J in cyanobacteria.
There is an ongoing explosion of scientific datasets being generated, brought on by recent technological advances in many areas of the natural sciences. As a result, the life sciences have become increasingly computational in nature, and bioinformatics has taken on a central role in research studies. However, basic computational skills, data analysis, and stewardship are still rarely taught in life science educational programs, resulting in a skills gap in many of the researchers tasked with analysing these big datasets. In order to address this skills gap and empower researchers to perform their own data analyses, the Galaxy Training Network (GTN) has previously developed the Galaxy Training Platform (https://training.galaxyproject.org), an open access, community-driven framework for the collection of FAIR (Findable, Accessible, Interoperable, Reusable) training materials for data analysis utilizing the user-friendly Galaxy framework as its primary data analysis platform. Since its inception, this training platform has thrived, with the number of tutorials and contributors growing rapidly, and the range of topics extending beyond life sciences to include topics such as climatology, cheminformatics, and machine learning. While initially aimed at supporting researchers directly, the GTN framework has proven to be an invaluable resource for educators as well. We have focused our efforts in recent years on adding increased support for this growing community of instructors. New features have been added to facilitate the use of the materials in a classroom setting, simplifying the contribution flow for new materials, and have added a set of train-the-trainer lessons. Here, we present the latest developments in the GTN project, aimed at facilitating the use of the Galaxy Training materials by educators, and its usage in different learning environments.
Mutations of cilia-associated molecules cause multiple developmental defects that are collectively termed ciliopathies. However, several ciliary proteins, involved in gating access to the cilium, also assume localizations at other cellular sites including the nucleus, where they participate in DNA damage responses to maintain tissue integrity. Molecular insight into how these molecules execute such diverse functions remains limited. A mass spectrometry screen for ANKS6-interacting proteins suggested an involvement of ANKS6 in RNA processing and/or binding. Comparing the RNA-binding properties of the known RNA-binding protein BICC1 with the three ankyrin-repeat proteins ANKS3, ANKS6 (NPHP16) and INVERSIN (NPHP2) confirmed that certain nephronophthisis (NPH) family members can interact with RNA molecules. We also observed that BICC1 and INVERSIN associate with stress granules in response to translational inhibition. Furthermore, BICC1 recruits ANKS3 and ANKS6 into TIA-1-positive stress granules after exposure to hippuristanol. Our findings uncover a novel function of NPH family members, and provide further evidence that NPH family members together with BICC1 are involved in stress responses to maintain tissue and organ integrity.
Background Cross-linking and immunoprecipitation followed by next-generation sequencing (CLIP-seq) is the state-of-the-art technique used to experimentally determine transcriptome-wide binding sites of RNA-binding proteins (RBPs). However, it relies on gene expression, which can be highly variable between conditions and thus cannot provide a complete picture of the RBP binding landscape. This creates a demand for computational methods to predict missing binding sites. Although there exist various methods using traditional machine learning and lately also deep learning, we encountered several problems: many of these are not well documented or maintained, making them difficult to install and use, or are not even available. In addition, there can be efficiency issues, as well as little flexibility regarding options or supported features. Results Here, we present RNAProt, an efficient and feature-rich computational RBP binding site prediction framework based on recurrent neural networks. We compare RNAProt with 1 traditional machine learning approach and 2 deep-learning methods, demonstrating its state-of-the-art predictive performance and better run time efficiency. We further show that its implemented visualizations capture known binding preferences and thus can help to understand what is learned. Since RNAProt supports various additional features (including user-defined features, which no other tool offers), we also present their influence on benchmark set performance. Finally, we show the benefits of incorporating additional features, specifically structure information, when learning the binding sites of an hairpin loop binding RBP. Conclusions RNAProt provides a complete framework for RBP binding site predictions, from data set generation over model training to the evaluation of binding preferences and prediction. It offers state-of-the-art predictive performance, as well as superior run time efficiency, while at the same time supporting more features and input types than any other tool available so far. RNAProt is easy to install and use, comes with comprehensive documentation, and is accompanied by informative statistics and visualizations. All this makes RNAProt a valuable tool to apply in future RBP binding site research.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.