2021
DOI: 10.1109/mcse.2021.3052101
|View full text |Cite
|
Sign up to set email alerts
|

Using Jupyter for Reproducible Scientific Workflows

Abstract: Literate computing has emerged as an important tool for computational studies and open science, with growing folklore of best practices. In this work, we report two case studiesone in computational magnetism and another in computational mathematics-where domain-specific software was exposed to the Jupyter environment. This enables high level control of simulations and computation, interactive exploration of computational results, batch processing on HPC resources, and reproducible workflow documentation in Jup… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
38
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 73 publications
(53 citation statements)
references
References 11 publications
(10 reference statements)
0
38
0
Order By: Relevance
“…Although this shows the potential for combining exploration and explanation of new analysis methods in a common framework, Rule et al explain how reproducibility is better achieved when users develop their notebooks following general purpose guidelines [13]. Eventually, to really reproduce the analysis, users have be familiar with additional systems, (i.e., data and software repositories) [1], requiring them to learn new technologies and formats. Obtaining actionable metadata that would also foster the automation of…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Although this shows the potential for combining exploration and explanation of new analysis methods in a common framework, Rule et al explain how reproducibility is better achieved when users develop their notebooks following general purpose guidelines [13]. Eventually, to really reproduce the analysis, users have be familiar with additional systems, (i.e., data and software repositories) [1], requiring them to learn new technologies and formats. Obtaining actionable metadata that would also foster the automation of…”
Section: Related Workmentioning
confidence: 99%
“…Pursuing reproducible research in Jupyter Notebook requires users to adopt general good practices [13] and learn how to use complementary systems [1]. We illustrate how SWIRRL assists users in this objective.…”
Section: Automating Notebook Reproducibilitymentioning
confidence: 99%
“…For data-proximate computational resources, we have implemented a JupyterHub, an open-source platform gaining rapid attention in the scientific community (Fangohr et al, 2019;Beg et al, 2021), on Google Cloud Platform in collaboration with 2i2c.org, a non-profit organization based in the U.S. The authentication for each user/collaborator to the JupyterHub is done via Github.…”
Section: Cloud-based Jupyterhubmentioning
confidence: 99%
“…Jupyter is also recognised by Hardisty and Wittenburg [9] as one possible CWFR solutions. Beg et al [29] also discuss the reproducibility of Jupyter notebooks as a scientific workflow. Jupyter notebook provides a one-study, one-document concept that is easily shareable.…”
Section: Notebook and Gitmentioning
confidence: 99%