Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2015
DOI: 10.1145/2807591.2807663
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(17 citation statements)
references
References 29 publications
0
16
0
Order By: Relevance
“…In the in-situ approach, simulation and analyzes are co-located in the same node, while in the in-transit approach, the data analyzes are outsourced onto dedicated nodes [12]. Several studies have shown that large-scale simulations with in-situ could benefit from co-scheduling approaches [11,15]. The difficulty consists in ensuring that the in-situ part processes the data fast enough to avoid slowing down the main simulation, which is directly related to co-scheduling issues: how to partition the resources across the concurrent analysis applications that share the CMP?…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In the in-situ approach, simulation and analyzes are co-located in the same node, while in the in-transit approach, the data analyzes are outsourced onto dedicated nodes [12]. Several studies have shown that large-scale simulations with in-situ could benefit from co-scheduling approaches [11,15]. The difficulty consists in ensuring that the in-situ part processes the data fast enough to avoid slowing down the main simulation, which is directly related to co-scheduling issues: how to partition the resources across the concurrent analysis applications that share the CMP?…”
Section: Related Workmentioning
confidence: 99%
“…In this paper, we also use CAT to partition the LLC into several areas when coscheduling applications, but with the objective of optimizing the throughput of in-situ or in-transit analysis for large-scale simulations. Indeed, in such simulations, data is generated at each iteration and periodically analyzed by parallel processes on dedicated nodes, concurrently of the main simulation [11]. If these dedicated nodes belong to the main simulation platform (thereby reducing the number of available cores for simulation), we speak of in-situ processing, while if they belong to an auxiliary platform, we speak of of in-transit processing [12].…”
Section: Introductionmentioning
confidence: 99%
“…Scientific data that generated from large‐scale simulations and observational devices are growing rapidly in data size and complexity. A fundamental challenge of managing scientific data on high‐performance computing (HPC) systems is to provide efficient data locating services in both file level and record level.…”
Section: Introductionmentioning
confidence: 99%
“…These solutions are starting to be implemented for HPC applications. Sewell et al [27] explain that in the case of the HACC application (a cosmological code), petabytes of data are created to be analyzed later. The analysis is done by multiple independent processes.…”
Section: Introductionmentioning
confidence: 99%