The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2019
DOI: 10.1007/978-1-4939-9074-0_24
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Workflows and Reproducible Data Analysis for Genomics

Abstract: Biological, clinical, and pharmacological research now often involves analyses of genomes, transcriptomes, proteomes, and interactomes, within and between individuals and across species. Due to large volumes, the analysis and integration of data generated by such high-throughput technologies have become computationally intensive, and analysis can no longer happen on a typical desktop computer. In this chapter we show how to describe and execute the same analysis using a number of workflow systems and how these… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0
2

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 32 publications
(32 citation statements)
references
References 33 publications
0
30
0
2
Order By: Relevance
“…ics data analysis workflow with flexibility, extensibility and reproducibility [39,40,41,42,43,44]. Herein, we used Nextflow to encapsulate the seven data processing pipelines so that users can directly call different pipelines through a uniform set of APIs.…”
Section: Nextflow Pipeline Design and Featuresmentioning
confidence: 99%
“…ics data analysis workflow with flexibility, extensibility and reproducibility [39,40,41,42,43,44]. Herein, we used Nextflow to encapsulate the seven data processing pipelines so that users can directly call different pipelines through a uniform set of APIs.…”
Section: Nextflow Pipeline Design and Featuresmentioning
confidence: 99%
“…Reproducibility in biomedical analyses has become important recently (Grüning et al, 2018; Strozzi et al, 2019). For workflows, this means that anyone should be able to reconstruct the exact workflow run, including the correct sequence of steps, the actual commands, the runtime parameters and options, and the handling of data, e.g., chunking for parallelization, to reach the exact same conclusions despite differences in hardware, operating systems, and software dependencies.…”
Section: Resultsmentioning
confidence: 99%
“…Workflow management systems are developed for robust and easy implementation of computational pipelines; nevertheless, they differ significantly in terms of workflows, definitions, job scheduling, and features (5,6,7,8). For example, Snakemake uses a "pullbased" strategy to check for specific output files and schedule jobs accordingly (5,6), whereas Nextflow uses a "push-based" scheme in which a "process" defined in the workflow pushes its outputs to downstream "processes" (7,24). The SciPipe (5) workflow library is written in the GO language; similar to Nextflow it implements dataflow based task scheduling.…”
Section: Discussionmentioning
confidence: 99%