2022
DOI: 10.1007/978-3-030-96498-6_6
|View full text |Cite
|
Sign up to set email alerts
|

Transitioning from File-Based HPC Workflows to Streaming Data Pipelines with openPMD and ADIOS2

Abstract: This paper aims to create a transition path from file-based IO to streaming-based workflows for scientific applications in an HPC environment. By using the openPMP-api, traditional workflows limited by filesystem bottlenecks can be overcome and flexibly extended for in situ analysis. The openPMD-api is a library for the description of scientific data according to the Open Standard for Particle-Mesh Data (openPMD). Its approach towards recent challenges posed by hardware heterogeneity lies in the decoupling of … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…A highperformance, openPMD C++ and Python reference implementation, co-developed by LBNL and HZDR/CASUS, is used in massively parallel codes and in data analysis [27]. Recently, data streaming techniques were developed to transition traditional post-processing scripts to online, massively parallel data analysis workflows, which can be co-located with simulations and enable rapidly prototyping for scientific intransit analysis [28].…”
Section: Modernizing Blast For Exascalementioning
confidence: 99%
“…A highperformance, openPMD C++ and Python reference implementation, co-developed by LBNL and HZDR/CASUS, is used in massively parallel codes and in data analysis [27]. Recently, data streaming techniques were developed to transition traditional post-processing scripts to online, massively parallel data analysis workflows, which can be co-located with simulations and enable rapidly prototyping for scientific intransit analysis [28].…”
Section: Modernizing Blast For Exascalementioning
confidence: 99%
“…This instance demonstrates Julia's potential as a unified language that can seamlessly connect different stages of an HPC workflow, from computational simulation to data visualization. Future work includes further performance tuning and evaluation of the trade-offs for in-memory streaming data pipelines [34]. By bridging various components in a typical HPC workflow, Julia could emerge as a compelling option for scientists and engineers working on complex, large-scale computational problems.…”
Section: Parallel I/omentioning
confidence: 99%
“…A challenge arises in the rapid setup of in situ processing pipelines, since typical approaches of implementing a virtual detector in the simulation code itself can be more complicated compared to scripted (serial) analysis workflows on files. Innovative software design can potentially alleviate this challenge, with approaches such as streaming data with close-to-file-like analysis syntax [135] or embedding of interpreters into running simulations to execute scripts on in-memory data. Continued research in this area and close collaboration with computer science and engineering teams will be essential to ensure that flexible, science-case specific analysis can be designed by accelerator and beam physicists.…”
Section: Jinst 16 T10003mentioning
confidence: 99%