We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.
Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-017-1334-8) contains supplementary material, which is available to authorized users.
RNA modifications such as m6A methylation form an additional layer of complexity in the transcriptome. Nanopore direct RNA sequencing can capture this information in the raw current signal for each RNA molecule, enabling the detection of RNA modifications using supervised machine learning. However, experimental approaches provide only site-level training data, whereas the modification status for each single RNA molecule is missing. Here we present m6Anet, a neural-network-based method that leverages the multiple instance learning framework to specifically handle missing read-level modification labels in site-level training data. m6Anet outperforms existing computational methods, shows similar accuracy as experimental approaches, and generalizes with high accuracy to different cell lines and species without retraining model parameters. In addition, we demonstrate that m6Anet captures the underlying read-level stoichiometry, which can be used to approximate differences in modification rates. Overall, m6Anet offers a tool to capture the transcriptome-wide identification and quantification of m6A from a single run of direct RNA sequencing.
RNA modifications such as m6A methylation form an additional layer of complexity in the transcriptome. Nanopore direct RNA sequencing captures this information in the raw current signal for each RNA molecule, enabling the detection of RNA modifications using supervised machine learning. However, experimental approaches provide only site-level training data, whereas the modification status for each single RNA molecule is missing. Here we present m6Anet, a neural network-based method that leverages the Multiple Instance Learning framework to specifically handle missing read-level modification labels in site-level training data. m6Anet outperforms existing computational methods, shows similar accuracy as experimental approaches, and generalises to different cell lines with almost identical accuracy. We demonstrate that m6Anet captures the underlying read-level stoichiometry that can be used to approximate differences in modification rates. m6Anet achieves this without retraining model parameters, enabling the transcriptome-wide identification and quantification of m6A from a single run of direct RNA sequencing.
Differences in RNA expression can provide insights into the molecular identity of a cell, pathways involved in human diseases, and variation in RNA levels across patients associated with clinical phenotypes. RNA modifications such as m6A have been found to contribute to molecular functions of RNAs. However, quantification of differences in RNA modifications has been challenging. Here we develop a computational method (xPore) to identify differential RNA modifications from direct RNA sequencing data. We evaluate our method on transcriptome-wide m6A profiling data, demonstrating that xPore identifies positions of m6A sites at single base resolution, estimates the fraction of modified RNAs in the cell, and quantifies the differential modification rate across conditions. We apply the method to direct RNA-Sequencing data from 6 cell lines and find that many m6A sites are preserved, while a subset of m6A sites show significant differences in their modification rates across cell types. Together, we show that RNA modifications can be identified from direct RNA-sequencing with high accuracy, enabling the analysis of differential modifications and expression from a single high throughput experiment.Availability : xPore is available as open source software ( https://github.com/GoekeLab/xpore )
The
growing availability of multiomic data provides a highly comprehensive
view of cellular processes at the levels of mRNA, proteins, metabolites,
and reaction fluxes. However, due to probabilistic interactions between
components depending on the environment and on the time course, casual,
sometimes rare interactions may cause important effects in the cellular
physiology. To date, interactions at the pathway level cannot be measured
directly, and methodologies to predict pathway cross-correlations
from reaction fluxes are still missing. Here, we develop a multiomic
approach of flux-balance analysis combined with Bayesian factor modeling
with the aim of detecting pathway cross-correlations and predicting
metabolic pathway activation profiles. Starting from gene expression
profiles measured in various environmental conditions, we associate
a flux rate profile with each condition. We then infer pathway cross-correlations
and identify the degrees of pathway activation with respect to the
conditions and time course using Bayesian factor modeling. We test
our framework on the most recent metabolic reconstruction of Escherichia coli in both static and dynamic environments,
thus predicting the functionality of particular groups of reactions
and how it varies over time. In a dynamic environment, our method
can be readily used to characterize the temporal progression of pathway
activation in response to given stimuli.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.