Motivation -To curate and organize expensive spaceflight experiments conducted aboard space stations and maximize the scientific return of investment, while democratizing access to vast amounts of spaceflight related omics data generated from several model organisms. Results -The GeneLab Data System (GLDS) is an open access database containing fully coordinated and curated "omics" (genomics, transcriptomics, proteomics, metabolomics) data, detailed metadata and radiation dosimetry for a variety of model organisms. GLDS is supported by an integrated data system allowing federated search across several public bioinformatics repositories. Archived datasets can be queried using full-text search (e.g., keywords, Boolean and wildcards) and results can be sorted in multifactorial manner using assistive filters. GLDS also provides a collaborative platform built on GenomeSpace for sharing files and analyses with collaborators. It currently houses 172 datasets and supports standard guidelines for submission of datasets, MIAME (for microarray), ENCODE Consortium Guidelines (for RNA-seq) and MIAPE Guidelines (for proteomics).
Translating fundamental biological discoveries from NASA Space Biology program into health risk from space flights has been an ongoing challenge. We propose to use NASA GeneLab database to gain new knowledge on potential systemic responses to space. Unbiased systems biology analysis of transcriptomic data from seven different rodent datasets reveals for the first time the existence of potential “master regulators” coordinating a systemic response to microgravity and/or space radiation with TGF-β1 being the most common regulator. We hypothesized the space environment leads to the release of biomolecules circulating inside the blood stream. Through datamining we identified 13 candidate microRNAs (miRNA) which are common in all studies and directly interact with TGF-β1 that can be potential circulating factors impacting space biology. This study exemplifies the utility of the GeneLab data repository to aid in the process of performing novel hypothesis–based research.
Microbiome samples are inherently defined by the environment in which they are found. Therefore, data that provide context and enable interpretation of measurements produced from biological samples, often referred to as metadata, are critical. Important contributions have been made in the development of community-driven metadata standards; however, these standards have not been uniformly embraced by the microbiome research community. To understand how these standards are being adopted, or the barriers to adoption, across research domains, institutions, and funding agencies, the National Microbiome Data Collaborative (NMDC) hosted a workshop in October 2019. This report provides a summary of discussions that took place throughout the workshop, as well as outcomes of the working groups initiated at the workshop.
The mission of NASA’s GeneLab database (https://genelab.nasa.gov/) is to collect, curate, and provide access to the genomic, transcriptomic, proteomic and metabolomic (so-called ‘omics’) data from biospecimens flown in space or exposed to simulated space stressors, maximizing their utilization. This large collection of data enables the exploration of molecular network responses to space environments using a systems biology approach. We review here the various components of the GeneLab platform, including the new data repository web interface, and the GeneLab Online Data Entry (GEODE) web portal, which will support the expansion of the database in the future to include companion non-omics assay data. We discuss our design for GEODE, particularly how it promotes investigators providing more accurate metadata, reducing the curation effort required of GeneLab staff. We also introduce here a new GeneLab Application Programming Interface (API) specifically designed to support tools for the visualization of processed omics data. We review the outreach efforts by GeneLab to utilize the spaceflight data in the repository to generate novel discoveries and develop new hypotheses, including spearheading data analysis working groups, and a high school student training program. All these efforts are aimed ultimately at supporting precision risk management for human space exploration.
Accurate assessment of risks of long-term space missions is critical for human space exploration. It is essential to have a detailed understanding of the biological effects on humans living and working in deep space. Ionizing radiation from galactic cosmic rays (GCR) is a major health risk factor for astronauts on extended missions outside the protective effects of the Earth's magnetic field. Currently, there are gaps in our knowledge of the health risks associated with chronic low-dose, low-dose-rate ionizing radiation, specifically ions associated with high (H) atomic number (Z) and energy (E). The NASA GeneLab project ( https://genelab.nasa.gov/ ) aims to provide a detailed library of omics datasets associated with biological samples exposed to HZE. The GeneLab Data System (GLDS) includes datasets from both spaceflight and ground-based studies, a majority of which involve exposure to ionizing radiation. In addition to detailed information on radiation exposure for ground-based studies, GeneLab is adding detailed, curated dosimetry information for spaceflight experiments. GeneLab is the first comprehensive omics database for space-related research from which an investigator can generate hypotheses to direct future experiments, utilizing both ground and space biological radiation data. The GLDS is continually expanding as omics-related data are generated by the space life sciences community. Here we provide a brief summary of the space radiation-related data available at GeneLab.
Summary With the development of transcriptomic technologies, we are able to quantify precise changes in gene expression profiles from astronauts and other organisms exposed to spaceflight. Members of NASA GeneLab and GeneLab-associated analysis working groups (AWGs) have developed a consensus pipeline for analyzing short-read RNA-sequencing data from spaceflight-associated experiments. The pipeline includes quality control, read trimming, mapping, and gene quantification steps, culminating in the detection of differentially expressed genes. This data analysis pipeline and the results of its execution using data submitted to GeneLab are now all publicly available through the GeneLab database. We present here the full details and rationale for the construction of this pipeline in order to promote transparency, reproducibility, and reusability of pipeline data; to provide a template for data processing of future spaceflight-relevant datasets; and to encourage cross-analysis of data from other databases with the data available in GeneLab.
Users of the ISAID indexing system create complex, precise, and accurate indexing for full-text documents much faster than users of manual methods. Furthermore, the natural language processing methods that ISAID uses to suggest indexes contributes substantially to increased indexing speed and accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.