2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing 2014
DOI: 10.1109/ccgrid.2014.128
|View full text |Cite
|
Sign up to set email alerts
|

From Scripted HPC-Based NGS Pipelines to Workflows on the Cloud

Abstract: Abstract-In this paper we describe our initial experiences in the Cloud-e-Genome project with moving the whole exome sequencing pipeline from the scripted HPC-based solution to a workflow enactment system running in the cloud. We discuss shortcomings of the existing approach based on scripts and list benefits that a workflow-based solution can provide. Despite the effort it involved to wrap all required tools in the form of workflow blocks and the restrictions of the dataflow model used to represent workflows … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…This paper extends our preliminary workshop publication [13] which reported on initial progress on the Cloude-Genome project, a collaboration between the School of Computing Science and Institute of Genetic Medicine at Newcastle University. This extended version offers the following new contributions:…”
Section: Contributions and Relevance To This Journal Special Issuementioning
confidence: 59%
See 1 more Smart Citation
“…This paper extends our preliminary workshop publication [13] which reported on initial progress on the Cloude-Genome project, a collaboration between the School of Computing Science and Institute of Genetic Medicine at Newcastle University. This extended version offers the following new contributions:…”
Section: Contributions and Relevance To This Journal Special Issuementioning
confidence: 59%
“…For instance, one may allocate virtual clusters in the cloud, e.g. using StarCluster 13 or CloudMan [36], and then simply transfer data and scripts verbatim. that is atypical of the usually lower performance of the cloud than HPC (cf.…”
Section: Related Workmentioning
confidence: 99%
“…De Oliveira et al [11] propose a provenance based task scheduling algorithm for single site cloud environments. Some adaptation of SWfMSs [6,9] in the cloud environment can provide the parallelism in workflow level or activity level, which is coarse-grained, at a single site cloud. These methods cannot perform parallelism of the tasks of the same activities and they cannot handle the distributed input data at different sites.…”
Section: Related Workmentioning
confidence: 99%
“…To verify our algorithm for real workflow, we used one from the Cloud e-Genome project [22] (Figure 10). The project's overall goal is to facilitate the adoption of genetic testing in clinical practice at a population scale.…”
Section: B Workflows From a Real Scientific Applicationmentioning
confidence: 99%