The generation and interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA‐seq) can be a complex task. While raw data quality control, alignment, and quantification can be streamlined via efficient algorithms that can deliver the preprocessed expression matrix, a common bottleneck in the analysis of such large datasets is the subsequent in‐depth, iterative processes of data exploration, statistical testing, visualization, and interpretation. Specific tools for these workflow steps are available but require a level of technical expertise which might be prohibitive for life and clinical scientists, who are left with essential pieces of information distributed among different tabular and list formats. Our protocols are centered on the joint use of our Bioconductor packages (pcaExplorer, ideal, GeneTonic) for interactive and reproducible workflows. All our packages provide an interactive and accessible experience via Shiny web applications, while still documenting the steps performed with RMarkdown as a framework to guarantee the reproducibility of the analyses, reducing the overall time to generate insights from the data at hand. These protocols guide readers through the essential steps of Exploratory Data Analysis, statistical testing, and functional enrichment analyses, followed by integration and contextualization of results. In our packages, the core elements are linked together in interactive widgets that make drill‐down tasks efficient by viewing the data at a level of increased detail. Thanks to their interoperability with essential classes and gold‐standard pipelines implemented in the open‐source Bioconductor project and community, these protocols will permit complex tasks in RNA‐seq data analysis, combining interactivity and reproducibility for following modern best scientific practices and helping to streamline the discovery process for transcriptome data. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Exploratory Data Analysis with pcaExplorer Basic Protocol 2: Differential Expression Analysis with ideal Basic Protocol 3: Interpretation of RNA‐seq results with GeneTonic Support Protocol: Downloading and installing pcaExplorer, ideal, and GeneTonic Alternate Protocol: Using functions from pcaExplorer, ideal, and GeneTonic in custom analyses
In Molecular Tumor Boards (MTBs), therapy recommendations for cancer patients are discussed. To aid decision-making based on the patient’s molecular profile, the research platform cBioPortal was extended based on users’ requirements. Additionally, a comprehensive dockerized workflow was developed to support the deployment of cBioPortal and connected services. In this work, we present the challenges and experiences of nearly two years of implementing and deploying an MTB platform based on cBioPortal and compare those to findings of a previous study.
Neuroblastoma (NBL) and medulloblastoma (MB) are aggressive pediatric cancers which can benefit from therapies targeting gangliosides. Therefore, we compared the ganglioside profile of 9 MB and 14 NBL samples by thin layer chromatography and mass spectrometry. NBL had the highest expression of GD2 (median 0.54 nmol GD2/mg protein), and also expressed complex gangliosides. GD2-low samples expressed GD1a and were more differentiated. MB mainly expressed GD2 (median 0.032 nmol GD2/mg protein) or GM3. Four sonic hedgehog-activated (SHH) as well as one group 4 and one group 3 MBs were GD2-positive. Two group 3 MB samples were GD2-negative but GM3-positive. N-glycolyl neuraminic acid-containing GM3 was neither detected in NBL nor MB by mass spectrometry. Furthermore, a GD2-phenotype predicting two-gene signature (ST8SIA1 and B4GALNT1) was applied to RNA-Seq datasets, including 86 MBs and validated by qRT-PCR. The signature values were decreased in group 3 and wingless-activated (WNT) compared to SHH and group 4 MBs. These results suggest that while NBL is GD2-positive, only some MB patients can benefit from a GD2-directed therapy. The expression of genes involved in the ganglioside synthesis may allow the identification of GD2-positive MBs. Finally, the ganglioside profile may reflect the differentiation status in NBL and could help to define MB subtypes.
Background Extensive sequencing of tumor tissues has greatly improved our understanding of cancer biology over the past years. The integration of genomic and clinical data is increasingly used to select personalized therapies in dedicated tumor boards (Molecular Tumor Boards) or to identify patients for basket studies. Genomic alterations and clinical information can be stored, integrated and visualized in the open-access resource cBioPortal for Cancer Genomics. cBioPortal can be run as a local instance enabling storage and analysis of patient data in single institutions, in the respect of data privacy. However, uploading clinical input data and genetic aberrations requires the elaboration of multiple data files and specific data formats, which makes it difficult to integrate this system into clinical practice. To solve this problem, we developed cbpManager. Results cbpManager is an R package providing a web-based interactive graphical user interface intended to facilitate the maintenance of mutations data and clinical data, including patient and sample information, as well as timeline data. cbpManager enables a large spectrum of researchers and physicians, regardless of their informatics skills to intuitively create data files ready for upload in cBioPortal for Cancer Genomics on a daily basis or in batch. Due to its modular structure based on R Shiny, further data formats such as copy number and fusion data can be covered in future versions. Further, we provide cbpManager as a containerized solution, enabling a straightforward large-scale deployment in clinical systems and secure access in combination with ShinyProxy. cbpManager is freely available via the Bioconductor project at https://bioconductor.org/packages/cbpManager/ under the AGPL-3 license. It is already used at six University Hospitals in Germany (Mainz, Gießen, Lübeck, Halle, Freiburg, and Marburg). Conclusion In summary, our package cbpManager is currently a unique software solution in the workflow with cBioPortal for Cancer Genomics, to assist the user in the interactive generation and management of study files suited for the later upload in cBioPortal.
Data visualization and interactive data exploration are important aspects of illustrating complex concepts and results from analyses of omics data. A suitable visualization has to be intuitive and accessible. Web-based dashboards have become popular tools for the arrangement, consolidation, and display of such visualizations. However, the combination of automated data processing pipelines handling omics data and dynamically generated, interactive dashboards is poorly solved. Here, we present i2dash, an R package intended to encapsulate functionality for the programmatic creation of customized dashboards. It supports interactive and responsive (linked) visualizations across a set of predefined graphical layouts. i2dash addresses the needs of data analysts/software developers for a tool that is compatible and attachable to any R-based analysis pipeline, thereby fostering the separation of data visualization on one hand and data analysis tasks on the other hand. In addition, the generic design of i2dash enables the development of modular extensions for specific needs. As a proof of principle, we provide an extension of i2dash optimized for single-cell RNA sequencing analysis, supporting the creation of dashboards for the visualization needs of such experiments. Equipped with these features, i2dash is suitable for extensive use in large-scale sequencing/bioinformatics facilities. Along this line, we provide i2dash as a containerized solution, enabling a straightforward large-scale deployment and sharing of dashboards using cloud services. i2dash is freely available via the R package archive CRAN (https://CRAN.R-project.org/package=i2dash).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.