The generation and interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA‐seq) can be a complex task. While raw data quality control, alignment, and quantification can be streamlined via efficient algorithms that can deliver the preprocessed expression matrix, a common bottleneck in the analysis of such large datasets is the subsequent in‐depth, iterative processes of data exploration, statistical testing, visualization, and interpretation. Specific tools for these workflow steps are available but require a level of technical expertise which might be prohibitive for life and clinical scientists, who are left with essential pieces of information distributed among different tabular and list formats. Our protocols are centered on the joint use of our Bioconductor packages (pcaExplorer, ideal, GeneTonic) for interactive and reproducible workflows. All our packages provide an interactive and accessible experience via Shiny web applications, while still documenting the steps performed with RMarkdown as a framework to guarantee the reproducibility of the analyses, reducing the overall time to generate insights from the data at hand. These protocols guide readers through the essential steps of Exploratory Data Analysis, statistical testing, and functional enrichment analyses, followed by integration and contextualization of results. In our packages, the core elements are linked together in interactive widgets that make drill‐down tasks efficient by viewing the data at a level of increased detail. Thanks to their interoperability with essential classes and gold‐standard pipelines implemented in the open‐source Bioconductor project and community, these protocols will permit complex tasks in RNA‐seq data analysis, combining interactivity and reproducibility for following modern best scientific practices and helping to streamline the discovery process for transcriptome data. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Exploratory Data Analysis with pcaExplorer Basic Protocol 2: Differential Expression Analysis with ideal Basic Protocol 3: Interpretation of RNA‐seq results with GeneTonic Support Protocol: Downloading and installing pcaExplorer, ideal, and GeneTonic Alternate Protocol: Using functions from pcaExplorer, ideal, and GeneTonic in custom analyses
In Molecular Tumor Boards (MTBs), therapy recommendations for cancer patients are discussed. To aid decision-making based on the patient’s molecular profile, the research platform cBioPortal was extended based on users’ requirements. Additionally, a comprehensive dockerized workflow was developed to support the deployment of cBioPortal and connected services. In this work, we present the challenges and experiences of nearly two years of implementing and deploying an MTB platform based on cBioPortal and compare those to findings of a previous study.
Neuroblastoma (NBL) and medulloblastoma (MB) are aggressive pediatric cancers which can benefit from therapies targeting gangliosides. Therefore, we compared the ganglioside profile of 9 MB and 14 NBL samples by thin layer chromatography and mass spectrometry. NBL had the highest expression of GD2 (median 0.54 nmol GD2/mg protein), and also expressed complex gangliosides. GD2-low samples expressed GD1a and were more differentiated. MB mainly expressed GD2 (median 0.032 nmol GD2/mg protein) or GM3. Four sonic hedgehog-activated (SHH) as well as one group 4 and one group 3 MBs were GD2-positive. Two group 3 MB samples were GD2-negative but GM3-positive. N-glycolyl neuraminic acid-containing GM3 was neither detected in NBL nor MB by mass spectrometry. Furthermore, a GD2-phenotype predicting two-gene signature (ST8SIA1 and B4GALNT1) was applied to RNA-Seq datasets, including 86 MBs and validated by qRT-PCR. The signature values were decreased in group 3 and wingless-activated (WNT) compared to SHH and group 4 MBs. These results suggest that while NBL is GD2-positive, only some MB patients can benefit from a GD2-directed therapy. The expression of genes involved in the ganglioside synthesis may allow the identification of GD2-positive MBs. Finally, the ganglioside profile may reflect the differentiation status in NBL and could help to define MB subtypes.
Background Extensive sequencing of tumor tissues has greatly improved our understanding of cancer biology over the past years. The integration of genomic and clinical data is increasingly used to select personalized therapies in dedicated tumor boards (Molecular Tumor Boards) or to identify patients for basket studies. Genomic alterations and clinical information can be stored, integrated and visualized in the open-access resource cBioPortal for Cancer Genomics. cBioPortal can be run as a local instance enabling storage and analysis of patient data in single institutions, in the respect of data privacy. However, uploading clinical input data and genetic aberrations requires the elaboration of multiple data files and specific data formats, which makes it difficult to integrate this system into clinical practice. To solve this problem, we developed cbpManager. Results cbpManager is an R package providing a web-based interactive graphical user interface intended to facilitate the maintenance of mutations data and clinical data, including patient and sample information, as well as timeline data. cbpManager enables a large spectrum of researchers and physicians, regardless of their informatics skills to intuitively create data files ready for upload in cBioPortal for Cancer Genomics on a daily basis or in batch. Due to its modular structure based on R Shiny, further data formats such as copy number and fusion data can be covered in future versions. Further, we provide cbpManager as a containerized solution, enabling a straightforward large-scale deployment in clinical systems and secure access in combination with ShinyProxy. cbpManager is freely available via the Bioconductor project at https://bioconductor.org/packages/cbpManager/ under the AGPL-3 license. It is already used at six University Hospitals in Germany (Mainz, Gießen, Lübeck, Halle, Freiburg, and Marburg). Conclusion In summary, our package cbpManager is currently a unique software solution in the workflow with cBioPortal for Cancer Genomics, to assist the user in the interactive generation and management of study files suited for the later upload in cBioPortal.
Data visualization and interactive data exploration are important aspects of illustrating complex concepts and results from analyses of omics data. A suitable visualization has to be intuitive and accessible. Web-based dashboards have become popular tools for the arrangement, consolidation, and display of such visualizations. However, the combination of automated data processing pipelines handling omics data and dynamically generated, interactive dashboards is poorly solved. Here, we present i2dash, an R package intended to encapsulate functionality for the programmatic creation of customized dashboards. It supports interactive and responsive (linked) visualizations across a set of predefined graphical layouts. i2dash addresses the needs of data analysts/software developers for a tool that is compatible and attachable to any R-based analysis pipeline, thereby fostering the separation of data visualization on one hand and data analysis tasks on the other hand. In addition, the generic design of i2dash enables the development of modular extensions for specific needs. As a proof of principle, we provide an extension of i2dash optimized for single-cell RNA sequencing analysis, supporting the creation of dashboards for the visualization needs of such experiments. Equipped with these features, i2dash is suitable for extensive use in large-scale sequencing/bioinformatics facilities. Along this line, we provide i2dash as a containerized solution, enabling a straightforward large-scale deployment and sharing of dashboards using cloud services. i2dash is freely available via the R package archive CRAN (https://CRAN.R-project.org/package=i2dash).
Non-conventional T cells, such as γδ T and invariant natural killer T (iNKT) cells, are emerging players in fighting cancer. Alpha-galactosylceramide (α-GalCer) is used as an exogenous ligand to activate iNKT cells. Human cells don’t have a direct pathway producing α-GalCer, which, however, can be produced by bacteria. We searched the literature for bacteria strains that are able to produce α-GalCer and used available sequencing data to analyze their presence in human tumor tissues and their association with survival. The modulatory effect of antibiotics on the concentration of α-GalCer was analyzed in mice. The human gut bacteria Bacteroides fragilis, Bacteroides vulgatus, and Prevotella copri produce α-GalCer structures that are able to activate iNKT cells. In mice, α-GalCer was depleted upon treatment with vancomycin. The three species were detected in colon adenocarcinoma (COAD) and rectum adenocarcinoma tissues, and Prevotella copri was also detected in bone tumors and glioblastoma tissues. Bacteroides vulgatus in COAD tissues correlated with better survival. In conclusion, α-GalCer-producing bacteria are part of the human gut microbiome and can infiltrate tumor tissues. These results suggest a new mechanism of interaction between bacteria and immune cells: α-GalCer produced by bacteria may activate non-conventional T cells in tumor tissues, where they can exert a direct or indirect anti-tumor activity.
Immunotherapy with immune checkpoint blockers (ICB) is associated with striking clinical success, but only in a small fraction of patients. Thus, we need computational biomarker-based methods that can anticipate which patients will respond to treatment. Current established biomarkers are imperfect due to their incomplete view of the tumor and its microenvironment. We have recently presented a novel approach that integrates transcriptomics data with biological knowledge to study tumors at a more holistic level. Validated in four different solid cancers, our approach outperformed the state-of-the-art methods to predict response to ICB. Here, we introduce estimate systems immune response (easier), an R/Bioconductor package that applies our approach to quantify biomarkers and assess patients’ likelihood to respond to immunotherapy, providing just the patients’ baseline bulk-tumor RNA-sequencing (RNA-seq) data as input.
Objective The objective was to develop a dataset definition, information model, and FHIR® specification for key data elements contained in a German molecular genomics (MolGen) report to facilitate genomic and phenotype integration in electronic health records. Materials and Methods A dedicated expert group participating in the German Medical Informatics Initiative reviewed information contained in MolGen reports, determined the key elements, and formulated a dataset definition. HL7’s Genomics Reporting Implementation Guide (IG) was adopted as a basis for the FHIR® specification which was subjected to a public ballot. In addition, elements in the MolGen dataset were mapped to the fields defined in ISO/TS 20428:2017 standard to evaluate compliance. Results A core dataset of 76 data elements, clustered into 6 categories was created to represent all key information of German MolGen reports. Based on this, a FHIR specification with 16 profiles, 14 derived from HL7®’s Genomics Reporting IG and 2 additional profiles (of the FamilyMemberHistory and RiskAssessment resources), was developed. Five example resource bundles show how our adaptation of an international standard can be used to model MolGen report data that was requested following oncological or rare disease indications. Furthermore, the map of the MolGen report data elements to the fields defined by the ISO/TC 20428:2017 standard, confirmed the presence of the majority of required fields. Conclusions Our report serves as a template for other research initiatives attempting to create a standard format for unstructured genomic report data. Use of standard formats facilitates integration of genomic data into electronic health records for clinical decision support.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.