2023
DOI: 10.1038/s41597-023-01968-9
|View full text |Cite
|
Sign up to set email alerts
|

Developing a standardized but extendable framework to increase the findability of infectious disease datasets

Abstract: Biomedical datasets are increasing in size, stored in many repositories, and face challenges in FAIRness (findability, accessibility, interoperability, reusability). As a Consortium of infectious disease researchers from 15 Centers, we aim to adopt open science practices to promote transparency, encourage reproducibility, and accelerate research advances through data reuse. To improve FAIRness of our datasets and computational tools, we evaluated metadata standards across established biomedical data repositori… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…To define which properties need to be collected about a dataset, a schema defines the set of field names within the data and what they represent (i.e., description (dataset description), creator (author(s) who generated and/or processed the data), measurementTechnique (experimental technique(s) used to collect the data), etc. 1 ). More detailed schemas also define the allowable values (controlled vocabularies, or ontologies, which are formal representations of allowed values and their relationship to each other) and constraints such as type or expected number for each property.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…To define which properties need to be collected about a dataset, a schema defines the set of field names within the data and what they represent (i.e., description (dataset description), creator (author(s) who generated and/or processed the data), measurementTechnique (experimental technique(s) used to collect the data), etc. 1 ). More detailed schemas also define the allowable values (controlled vocabularies, or ontologies, which are formal representations of allowed values and their relationship to each other) and constraints such as type or expected number for each property.…”
mentioning
confidence: 99%
“…Often, data standards, tools, software, platforms, and resources are developed as pilot projects or as side effects of hypothesis-driven scientific grants. For example, the NIAID Systems Biology Data Dissemination Working Group developed and implemented an infectious disease-specific Dataset and ComputationalTool schema, increasing the FAIRness of nearly 400 datasets and computational tools using it 1 . The schema is straightforward, yet has potential to exponentially enhance biological and biomedical dataset accessibility and reuse via increased exposure through dataset aggregation projects like Google Dataset Search.…”
mentioning
confidence: 99%