Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
The importance of post-transcriptional regulation by small non-coding RNAs has recently been recognized in both pro-and eukaryotes. Small RNAs (sRNAs) regulate gene expression posttranscriptionally by base pairing with the mRNA. Here we use dynamical simulations to characterize this regulation mode in comparison to transcriptional regulation mediated by protein-DNA interaction and to post-translational regulation achieved by protein-protein interaction. We show quantitatively that regulation by sRNA is advantageous when fast responses to external signals are needed, consistent with experimental data about its involvement in stress responses. Our analysis indicates that the half-life of the sRNA-mRNA complex and the ratio of their production rates determine the steady-state level of the target protein, suggesting that regulation by sRNA may provide fine-tuning of gene expression. We also describe the network of regulation by sRNA in Escherichia coli, and integrate it with the transcription regulation network, uncovering mixed regulatory circuits, such as mixed feed-forward loops. The integration of sRNAs in feedforward loops provides tight repression, guaranteed by the combination of transcriptional and post-transcriptional regulations.
Summary Genome-wide identification of the mechanism of action (MoA) of small-molecule compounds characterizing their targets, effectors, and activity modulators, represents a highly relevant yet elusive goal, with critical implications for assessment of compound efficacy and toxicity. Current approaches are labor-intensive and mostly limited to elucidating high-affinity binding target proteins. We introduce a regulatory network-based approach that elucidates genome-wide MoA proteins based on the assessment of the global dysregulation of their molecular interactions following compound perturbation. Analysis of cellular perturbation profiles identified established MoA proteins for 70% of the tested compounds and elucidated novel proteins that were experimentally validated. Finally, unknown-MoA compound analysis revealed altretamine, an anticancer drug, as an inhibitor of glutathione peroxidase 4 lipid repair activity, which was experimentally confirmed, thus revealing unexpected similarity to the activity of sulfasalazine. This suggests that regulatory network analysis can provide valuable mechanistic insight into the elucidation of small molecule MoA and compound similarity.
The Groverian entanglement measure, G(ψ), is applied to characterize a variety of pure quantum states |ψ of multiple qubits. The Groverian measure is calculated analytically for certain states of high symmetry, while for arbitrary states it is evaluated using a numerical procedure. In particular, it is calculated for the class of Greenberger-Horne-Zeilinger states, the W states as well as for random pure states of n qubits. The entanglement generated by Grover's algorithm is evaluated by calculating G(ψ) for the intermediate states that are obtained after t Grover iterations, for various initial states and for different sets of the marked states.
Breast cancer is the most common malignancy in women and is responsible for hundreds of thousands of deaths annually. As with most cancers, it is a heterogeneous disease and different breast cancer subtypes are treated differently. Understanding the difference in prognosis for breast cancer based on its molecular and phenotypic features is one avenue for improving treatment by matching the proper treatment with molecular subtypes of the disease. In this work, we employed a competition-based approach to modeling breast cancer prognosis using large datasets containing genomic and clinical information and an online real-time leaderboard program used to speed feedback to the modeling team and to encourage each modeler to work towards achieving a higher ranked submission. We find that machine learning methods combined with molecular features selected based on expert prior knowledge can improve survival predictions compared to current best-in-class methodologies and that ensemble models trained across multiple user submissions systematically outperform individual models within the ensemble. We also find that model scores are highly consistent across multiple independent evaluations. This study serves as the pilot phase of a much larger competition open to the whole research community, with the goal of understanding general strategies for model optimization using clinical and molecular profiling data and providing an objective, transparent system for assessing prognostic models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.