2022
DOI: 10.1145/3514245
|View full text |Cite
|
Sign up to set email alerts
|

A Case For Intra-rack Resource Disaggregation in HPC

Abstract: The expected halt of traditional technology scaling is motivating increased heterogeneity in high performance computing (HPC) systems with the emergence of numerous specialized accelerators. As heterogeneity increases, so does the risk of underutilizing expensive hardware resources if we preserve today’s rigid node configuration and reservation strategies. This has sparked interest in resource disaggregation to enable finer-grain allocation of hardware resources to applications. However, there is currently no … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(22 citation statements)
references
References 64 publications
0
22
0
Order By: Relevance
“…Other works utilize performance monitoring infrastructure to characterize application and system performance in HPC [6,9,19,20,24,25]. In particular, the paper presented by Ji et al analyzed various application memory usage in terms of object access patterns [6].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Other works utilize performance monitoring infrastructure to characterize application and system performance in HPC [6,9,19,20,24,25]. In particular, the paper presented by Ji et al analyzed various application memory usage in terms of object access patterns [6].…”
Section: Related Workmentioning
confidence: 99%
“…Peng et al focused on the memory subsystem and studied the temporal and spatial memory usage in two production HPC systems at LLNL [20]. Michelogiannakis et al [9] performed a detailed analysis of key metrics sampled in NERSC's Cori as a data-driven study of the potential of resource disaggregation in HPC.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…We evaluate three metrics of memory usage in this section capacity, page access, and bandwidth. While capacity usage has been extensively studied in multiple large-scale studies on leadership clusters [1], [2], [24], the other two metrics on page and bandwidth usage are important for understanding the need for memory resources and optimization opportunities.…”
Section: A Characterization Of Memory Usagementioning
confidence: 99%
“…Such static coarse-grained provisioning simplifies resource management. However, recent studies on large-scale supercomputers indicate that node-level memory utilization can be as low as 15% [1], [2]. One reason is that memory bandwidth and capacity are tightly coupled in current HPC systems.…”
Section: Introductionmentioning
confidence: 99%