2015
DOI: 10.7717/peerj.1273
|View full text |Cite
|
Sign up to set email alerts
|

The impact of Docker containers on the performance of genomic pipelines

Abstract: Genomic pipelines consist of several pieces of third party software and, because of their experimental nature, frequent changes and updates are commonly necessary thus raising serious deployment and reproducibility issues. Docker containers are emerging as a possible solution for many of these problems, as they allow the packaging of pipelines in an isolated and self-contained manner. This makes it easy to distribute and execute pipelines in a portable manner across a wide range of computing platforms. Thus, t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
47
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 111 publications
(53 citation statements)
references
References 19 publications
0
47
0
Order By: Relevance
“…One study evaluating Docker in the context of HPC use [47] had to limit access to the Docker-enabled cluster only to “trusted” users due to security concerns (personal communication). The solution proposed here combines mature container building tools provided for multiple operating systems by the Docker project with HPC compatibility through Singularity.…”
Section: Discussionmentioning
confidence: 99%
“…One study evaluating Docker in the context of HPC use [47] had to limit access to the Docker-enabled cluster only to “trusted” users due to security concerns (personal communication). The solution proposed here combines mature container building tools provided for multiple operating systems by the Docker project with HPC compatibility through Singularity.…”
Section: Discussionmentioning
confidence: 99%
“…Nevertheless, there are auxiliary tools that help the deployment and distribution process, mainly when the applications require complex setup tools. For example, it is possible to use software containers like Docker to distribute complex software and help deploy it, ensuring the whole community can run the software 2123 .…”
Section: Why Should We Care About Software Development Recommendations?mentioning
confidence: 99%
“…Due to the absence of established tools/workflows to handle multi-omic datasets, most of the aforementioned studies utilized non-standardized, ad hoc analyses, mostly consisting of custom workflows, thereby creating a challenge in reproducing the analyses [10, 4547]. Given that the lack of reproducible bioinformatic workflows is not limited to those used for the multi-omic analysis of microbial consortia [10, 4547], several approaches have recently been developed with the explicit aim of enhancing software reproducibility.…”
Section: Introductionmentioning
confidence: 99%
“…Given that the lack of reproducible bioinformatic workflows is not limited to those used for the multi-omic analysis of microbial consortia [10, 4547], several approaches have recently been developed with the explicit aim of enhancing software reproducibility. These include a wide range of tools for constructing bioinformatic workflows [48–50] as well as containerizing bioinformatic tools/pipelines using Docker [29, 46–48].…”
Section: Introductionmentioning
confidence: 99%