SC18: International Conference for High Performance Computing, Networking, Storage and Analysis 2018
DOI: 10.1109/sc.2018.00076
|View full text |Cite
|
Sign up to set email alerts
|

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(17 citation statements)
references
References 31 publications
0
17
0
Order By: Relevance
“…But the performance improvements that they reported were negligible (0.1-6%). Nevertheless, there is a growing trend in using ML techniques to solve storage and OS problems: predicting index structures in key-value stores [17,38], memory allocation [47], TCP congestion control [24], offline black-box optimization for storage parameters [8], database query optimization [37], local and distributed caching [60,66] and cloud resource management [16,19,20].…”
Section: Related Workmentioning
confidence: 99%
“…But the performance improvements that they reported were negligible (0.1-6%). Nevertheless, there is a growing trend in using ML techniques to solve storage and OS problems: predicting index structures in key-value stores [17,38], memory allocation [47], TCP congestion control [24], offline black-box optimization for storage parameters [8], database query optimization [37], local and distributed caching [60,66] and cloud resource management [16,19,20].…”
Section: Related Workmentioning
confidence: 99%
“…For example, the scheduling of data transfer between tasks can too often create bottlenecks between computation and communication phases, and manual optimizations are often complex (Huang et al, 2019). We can train ML models to classify the workflow phases to optimize data movements, to orchestrate I/O (Meng et al, 2014; Wang et al, 2015), and to manage hierarchical storage (Dong et al, 2016) and data staging (Subedi et al, 2018). Also, as in-situ execution becomes more prevalent (Huang et al, 2019; Kwan-Liu, 2009; Subedi et al, 2018), ML can play an important role in automating the placement of tasks to automatically find an optimal trade-off.…”
Section: Current Challenges In Scientific Workflowsmentioning
confidence: 99%
“…TRIO [19] explores how to efficiently move large checkpointing datasets to the PFS by utilizing the burst buffers. Data Elevator [20] and Stacker [21] are similar to NORNS in that they focus on asynchronously moving data across I/O layers to optimize scientific workflows. The former specializes on applications using HDF5 while the latter optimizes data movements using machine learning techniques.…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, while computing and network resources can be shared and managed effectively by state-of-the-art job schedulers, storage resources are still mostly considered as black boxes by these infrastruc-978-1-7281-4734-5/19/$31.00 ©2019 IEEE [18]. While there has been increasing interest in HPC to use burst buffers to optimize the I/O path of datadriven workflows through autonomous, asynchronous data staging [19] [20] [21], these research efforts have not considered I/O as a first class entity in resource scheduling decisions. Thus, we argue that the integration of application I/O needs with scheduling and resource managers is critical to effectively use and manage a hierarchical storage stack that can include as many layers as NVRAM, node-local burst buffers, shared burst buffers, parallel file system, campaign storage, and archival storage.…”
Section: Introductionmentioning
confidence: 99%