2019
DOI: 10.1093/gigascience/giz044
|View full text |Cite
|
Sign up to set email alerts
|

SciPipe: A workflow library for agile development of complex and dynamic bioinformatics pipelines

Abstract: Background The complex nature of biological data has driven the development of specialized software tools. Scientific workflow management systems simplify the assembly of such tools into pipelines, assist with job automation, and aid reproducibility of analyses. Many contemporary workflow tools are specialized or not designed for highly complex workflows, such as with nested loops, dynamic scheduling, and parametrization, which is common in, e.g., machine learning. Findings … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 38 publications
(53 reference statements)
0
17
0
Order By: Relevance
“…Workflow management systems are developed for robust and easy implementation of computational pipelines; nevertheless, they differ significantly in terms of workflows, definitions, job scheduling, and features (5,6,7,8). For example, Snakemake uses a "pullbased" strategy to check for specific output files and schedule jobs accordingly (5,6), whereas Nextflow uses a "push-based" scheme in which a "process" defined in the workflow pushes its outputs to downstream "processes" (7,24). The SciPipe (5) workflow library is written in the GO language; similar to Nextflow it implements dataflow based task scheduling.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Workflow management systems are developed for robust and easy implementation of computational pipelines; nevertheless, they differ significantly in terms of workflows, definitions, job scheduling, and features (5,6,7,8). For example, Snakemake uses a "pullbased" strategy to check for specific output files and schedule jobs accordingly (5,6), whereas Nextflow uses a "push-based" scheme in which a "process" defined in the workflow pushes its outputs to downstream "processes" (7,24). The SciPipe (5) workflow library is written in the GO language; similar to Nextflow it implements dataflow based task scheduling.…”
Section: Discussionmentioning
confidence: 99%
“…A major challenge in bioinformatics analysis of RNA-Seq datasets is implementing data processing pipelines in an efficient, modular, and reproducible manner (4,5,6,7). A majority of existing bioinformatics tools are standalone linux programs, executed via the shell; writing bioinformatic pipelines as shell, perl, or python scripts is a very common practice among bioinformaticians.…”
Section: Introductionmentioning
confidence: 99%
“…Workflow management systems together with Linux containers offer a solution to efficiently analyze large scale datasets in a highly reproducible, scalable and parallelizable manner. During the last decade, an increasing interest in the field has led to the development of different programs such as Snakemake (Köster andRahmann, 2012), NextFlow (Di Tommaso et al, 2017), Galaxy (Afgan et al, 2018), SciPipe (Lampa et al, 2019) or GenPipes (Bourgey et al, 2019), among others. These tools enable the prototyping and deployment of pipelines by abstracting computational processes and representing pipelines as directed graphs, in which nodes represent tasks to be executed and edges represent either data flow or execution dependencies between different tasks.…”
Section: Overview Of the Masterofpores Workflowmentioning
confidence: 99%
“…While Nipype is a widely used tool in neuroimaging and has many readily-defined interfaces available for researchers, the others require non-insignificant development to describe interfaces for common neuroimaging applications such as FSL ( Jenkinson et al, 2012 ) or MRtrix ( Tournier and Calamante, 2012 ). PSOM ( Bellec et al, 2012 ) and Scipipe ( Lampa et al, 2018 ) are functionally similar to Nipype but have been developed for GNU Octave/MATLAB and Golang, respectively. Several domains have more specialized tools which accomplish similar feats in their area of interest.…”
Section: Emergent Technologies In Reproducible Neurosciencementioning
confidence: 99%