2017
DOI: 10.7717/peerj.3509
|View full text |Cite
|
Sign up to set email alerts
|

GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis

Abstract: There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior ins… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
9
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 76 publications
0
9
0
Order By: Relevance
“…However, they do not integrate their data in a framework that allows scalable and detailed querying (e.g., quickly extracting all water table and temperature data from multiple sites into a single table, for scaling of water table-temperature relationships from individual sites to a broader geographical range). The field of bioinformatics is further along in this regard: for molecular meta-omic data, numerous databases (e.g., MIGS/MIMS, MIMAS, IMG/M, GeneLab) (Hermida et al, 2006;Field et al, 2008;Gattiker et al, 2009;Chen et al, 2019;Ray et al, 2019) and integrative data management platforms (e.g., KBase, MOD-CO, ODG, GeNNet, BioKNO, MGV, OMMS, mixOmics) (Sujansky, 2001;Symons & Nieselt, 2011;Perez-Arriaga et al, 2015;Yoon, Kim & Kim, 2017;Costa et al, 2017;Rohart et al, 2017;Guhlin et al, 2017;Manzoni et al, 2018;Arkin et al, 2018;Brandizi et al, 2018;Rambold et al, 2019) have been developed, and often include standardization of sample metadata to enable efficient data integration. Notable among these are KBase (https://kbase.us/) (Arkin et al, 2018), which provides "apps" through which users can process their data in a framework that tracks processing steps ("provenance") in an accessible format, and MOD-CO (Rambold et al, 2019), a bioinformatics data processing tool that includes a conceptual schema and data model to track metadata and workflows.…”
Section: Introductionmentioning
confidence: 99%
“…However, they do not integrate their data in a framework that allows scalable and detailed querying (e.g., quickly extracting all water table and temperature data from multiple sites into a single table, for scaling of water table-temperature relationships from individual sites to a broader geographical range). The field of bioinformatics is further along in this regard: for molecular meta-omic data, numerous databases (e.g., MIGS/MIMS, MIMAS, IMG/M, GeneLab) (Hermida et al, 2006;Field et al, 2008;Gattiker et al, 2009;Chen et al, 2019;Ray et al, 2019) and integrative data management platforms (e.g., KBase, MOD-CO, ODG, GeNNet, BioKNO, MGV, OMMS, mixOmics) (Sujansky, 2001;Symons & Nieselt, 2011;Perez-Arriaga et al, 2015;Yoon, Kim & Kim, 2017;Costa et al, 2017;Rohart et al, 2017;Guhlin et al, 2017;Manzoni et al, 2018;Arkin et al, 2018;Brandizi et al, 2018;Rambold et al, 2019) have been developed, and often include standardization of sample metadata to enable efficient data integration. Notable among these are KBase (https://kbase.us/) (Arkin et al, 2018), which provides "apps" through which users can process their data in a framework that tracks processing steps ("provenance") in an accessible format, and MOD-CO (Rambold et al, 2019), a bioinformatics data processing tool that includes a conceptual schema and data model to track metadata and workflows.…”
Section: Introductionmentioning
confidence: 99%
“…However, they do not integrate their data in a framework that allows scalable and detailed querying (for example, quickly extracting all water table and temperature data from multiple sites into a single table, for scaling of water table-temperature relationships from individual sites to a broader geographical range). The field of bioinformatics is further along in this regard: for molecular meta-omic data, numerous databases (e.g., MIGS/MIMS, MIMAS, IMG/M, GeneLab) (Hermida et al, 2006;Field et al, 2008;Gattiker et al, 2009;Chen et al, 2019;Ray et al, 2019) and integrative data management platforms (e.g., KBase, MOD-CO, ODG, GeNNet, BioKNO, MGV, OMMS, mixOmics) (Sujansky, 2001;Symons & Nieselt, 2011;Perez-Arriaga et al, 2015;Yoon, Kim & Kim, 2017;Costa et al, 2017;Rohart et al, 2017;Guhlin et al, 2017;Manzoni et al, 2018;Arkin et al, 2018;Brandizi et al, 2018;Rambold et al, 2019) have been developed, and often include standardization of sample metadata to enable efficient data integration. Notable among these are KBase (https://kbase.us/) (Arkin et al, 2018), which provides "apps" through which users can process their data in a framework that tracks processing steps ("provenance") in an accessible format, and MOD-CO (Rambold et al, PeerJ reviewing PDF | (2019:10:42335:1:1:NEW 22 May 2020)…”
Section: Introductionmentioning
confidence: 99%
“…A recent paper presents GeNNet 6 , which describes the rationale for scripted workflows and the use of graph databases in reproducible research. In this paper a scripted workflow in R 7 , use of Neo4j to store data and the use of Docker for reproducibility is described.…”
Section: Introductionmentioning
confidence: 99%