2018
DOI: 10.1101/488643
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Galaxy-Kubernetes integration: scaling bioinformatics workflows in the cloud

Abstract: SummaryMaking reproducible, auditable and scalable data-processing analysis workflows is an important challenge in the field of bioinformatics. Recently, software containers and cloud computing introduced a novel solution to address these challenges. They simplify software installation, management and reproducibility by packaging tools and their dependencies. In this work we implemented a cloud provider agnostic and scalable container orchestration setup for the popular Galaxy workflow environment. This soluti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3
3

Relationship

4
5

Authors

Journals

citations
Cited by 19 publications
(15 citation statements)
references
References 10 publications
0
15
0
Order By: Relevance
“…More recently, through efforts within the Phe-noMeNal H2020 Project [26], Galaxy gained the ability to deploy inside Kubernetes in an automated fashion through the use of Helm Charts [27]. The Galaxy-stable Helm chart makes use of community maintained Galaxy container images (Docker-Galaxy-stable), which are used on other container orchestrators, and include support for SSH file transfer protocol (SFTP) access, database backends and even deployment/compatibility of HPC scheduling systems like Condor or SLURM for hybrid cloud setups.…”
Section: Galaxymentioning
confidence: 99%
“…More recently, through efforts within the Phe-noMeNal H2020 Project [26], Galaxy gained the ability to deploy inside Kubernetes in an automated fashion through the use of Helm Charts [27]. The Galaxy-stable Helm chart makes use of community maintained Galaxy container images (Docker-Galaxy-stable), which are used on other container orchestrators, and include support for SSH file transfer protocol (SFTP) access, database backends and even deployment/compatibility of HPC scheduling systems like Condor or SLURM for hybrid cloud setups.…”
Section: Galaxymentioning
confidence: 99%
“…Galaxy (https://galaxyproject.org/) is a web application workflow environment written in Python, capable of distributing jobs among a plethora of batch schedulers (PBS, LSF, GoDocker, DRMAA based schedulers, etc. ), local machine (through containers or conda), and cloud providers (through Kubernetes and others). In the HPC case, Galaxy provides the flexibility to use either containers (Docker or Singularity) or directly Conda packages.…”
Section: Moving From Desktop Applications To Distributed Hpc and Cloumentioning
confidence: 99%
“…Kubernetes works with many container engines and enables automated deployment on cloud or on-premise clusters. One of the successful adoptions of Kubernetes in genomics is the Galaxy project [53]. However, Kubernetes does not reduce the deployment complexity for applications with complex workflows.…”
Section: Container Scalabilitymentioning
confidence: 99%