2014
DOI: 10.1002/cpe.3274
|View full text |Cite
|
Sign up to set email alerts
|

Experiences building Globus Genomics: a next‐generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services

Abstract: We describe Globus Genomics, a system that we have developed for rapid analysis of large quantities of next-generation sequencing (NGS) genomic data. This system achieves a high degree of end-to-end automation that encompasses every stage of data analysis including initial data retrieval from remote sequencing centers or storage (via the Globus file transfer system); specification, configuration, and reuse of multi-step processing pipelines (via the Galaxy workflow system); creation of custom Amazon Machine Im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0
1

Year Published

2014
2014
2018
2018

Publication Types

Select...
7
1
1

Relationship

3
6

Authors

Journals

citations
Cited by 67 publications
(50 citation statements)
references
References 26 publications
(33 reference statements)
0
49
0
1
Order By: Relevance
“…We aim to leverage billing capabilities we have developed for the Globus Genomics [33] service for this purpose. The Globus Genomics model charges users for the cumulative costs associated with each analysis job as well as a fixed subscription for using the service.…”
Section: Pricing Strategiesmentioning
confidence: 99%
“…We aim to leverage billing capabilities we have developed for the Globus Genomics [33] service for this purpose. The Globus Genomics model charges users for the cumulative costs associated with each analysis job as well as a fixed subscription for using the service.…”
Section: Pricing Strategiesmentioning
confidence: 99%
“…To address these challenges, Globus Genomics [58,59] was developed at the Computation Institute (CI), a joint institute between the University of Chicago and Argonne National Laboratory. Globus Genomics is a cloud-based integrated solution for NGS data analysis.…”
Section: Bioinformatics Platformsmentioning
confidence: 99%
“…It extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), and a cloud provisioning tool for auto-scaling (via HTCondor scheduler). Genome sequencing is notoriously data-intensive, and Globus Transfer [59] is designed for fast and secure movement of large amounts of data. Setting up a production instance of Galaxy is a nontrivial task that involves a number of manual installation and configuration steps for both the platform and any dependent software packages-steps that can be both error-prone and time-consuming.…”
Section: Bioinformatics Platformsmentioning
confidence: 99%
“…Большая часть из них является коммерческими продуктами, среди них наиболее известными и широко используемыми являются BaseSpace (Illumina, Inc.) [13], CLCBio [14], Lasergene (DNASTAR, Inc.) [15], Geneious [16]. Среди свободных и гибридных (частично коммерческих, частично свободных) популярными являются программные комплексы Galaxy [17,18], Globus Genomics [19], PATRIC [20], UGENE [21,22]. Они обладают широчайшими возможностями, большинство из этих программных пакетов могут быть установлены на локальный компьютер или сервер, они работают под разными операционными системами.…”
Section: программное обеспечениеunclassified