2021
DOI: 10.1080/15361055.2020.1851073
|View full text |Cite
|
Sign up to set email alerts
|

A Framework for International Collaboration on ITER Using Large-Scale Data Transfer to Enable Near-Real-Time Analysis

Abstract: The global nature of the ITER project along with its projected approximately petabyte-per-day data generation presents not only a unique challenge but also an opportunity for the fusion community to rethink, optimize, and enhance our scientific discovery process. Recognizing this, collaborative research with computational scientists was undertaken over the past several years to create a framework for large-scale data movement across wide-area networks to enable global near-real-time analysis of fusion data. Th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 32 publications
(28 reference statements)
0
7
0
Order By: Relevance
“…Large-scale data analysis for experimental diagnostics can be accelerated using data science and networking techniques to stream the data from the experiment to large, remote HPC centers. By working with data streams, and leveraging the large HPC compute resources, better and more data analysis can be performed, which can better inform fusion scientists between plasma shots on the best way to optimize the next shot [413]. A demonstration of this used the streaming framework DELTA [414] to stream ECEI diagnostic data from the KSTAR tokamak in Korea to the NERSC HPC center in the USA, and complete spectral analysis of all channel pairs using multiple CPUs on the Cori supercomputer.…”
Section: Diagnostics and Fusion Data Streamsmentioning
confidence: 99%
“…Large-scale data analysis for experimental diagnostics can be accelerated using data science and networking techniques to stream the data from the experiment to large, remote HPC centers. By working with data streams, and leveraging the large HPC compute resources, better and more data analysis can be performed, which can better inform fusion scientists between plasma shots on the best way to optimize the next shot [413]. A demonstration of this used the streaming framework DELTA [414] to stream ECEI diagnostic data from the KSTAR tokamak in Korea to the NERSC HPC center in the USA, and complete spectral analysis of all channel pairs using multiple CPUs on the Cori supercomputer.…”
Section: Diagnostics and Fusion Data Streamsmentioning
confidence: 99%
“…The work discussed in this contribution focuses on measurements of KSTAR plasmas on Cori, located at the National Energy Research Scientific Compute Center. These sites are connected through the Energy Sciences Network and data transfer rate benchmarks are reported in Churchill et al [2021]. Second, Delta uses ADIOS2 Godoy et al [2020] as a streaming library.…”
Section: The Adaptable Near Real-time Analysis Frameworkmentioning
confidence: 99%
“…In this paper we are presenting a streaming data analysis framework that aims to connect fusion experiments with remote HPC resources Choi et al [2016], Ralph , , Churchill et al [2021]. The streaming paradigm implemented by this framework allows the big-and fast data generated by fusion experiments to be seamlessly analyzed using supercomputers as they are generated.…”
Section: Introductionmentioning
confidence: 99%
“…Increasing data volume and data rates are met by a dichotomy between growth in computing power and growth in capacity of the storage systems at high-performance computing (HPC) systems. Over the last 20 years, the floating point operations of all systems listed in the top 500 list have increased approximately tenfold every four years 6 . The performance of the I/O systems of the top 500 supercomputer has only been tracked for the last four years, and the peak bandwidth of the fastest system increased approximately four-fold 7 .…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we present a streaming data analysis framework that aims to connect fusion experiments with remote HPC resources [2,6,19,32]. The streaming paradigm implemented by this framework allows the data generated by fusion experiments to be seamlessly analyzed using supercomputers as they are generated.…”
Section: Introductionmentioning
confidence: 99%