Proceedings of the 34th ACM International Conference on Supercomputing 2020
DOI: 10.1145/3392717.3392764
|View full text |Cite
|
Sign up to set email alerts
|

Mapping and scheduling HPC applications for optimizing I/O

Abstract: In HPC platforms, concurrent applications are sharing the same file system. This can lead to conflicts, especially as applications are more and more data intensive. I/O contention can represent a performance bottleneck. The access to bandwidth can be split in two complementary yet distinct problems. The mapping problem and the scheduling problem. The mapping problem consists in selecting the set of applications that are in competition for the I/O resource. The scheduling problem consists then, given I/O reques… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(23 citation statements)
references
References 32 publications
0
22
0
Order By: Relevance
“…Too many I/O phases [9] ✓ ✓ Bandwidth limited by a single OST I/O bandwidth [9], [10] ✓ Limited by the small data size [9] ✓ ✓ Unbalanced I/O workload among ranks [9] ✓ ✓ Large number of small I/O requests [9] ✓ ✓ Unbalanced I/O workload on OSTs [9], [11] ✓ ✓ ✓ Bad file system weather [9], [12] ✓ Redundant/overlapping I/O accesses [24], [25] ✓ ✓ I/O resource contention at OSTs [13], [14] ✓ Heavy metadata load [10] ✓…”
Section: Dxt Systemmentioning
confidence: 99%
“…Too many I/O phases [9] ✓ ✓ Bandwidth limited by a single OST I/O bandwidth [9], [10] ✓ Limited by the small data size [9] ✓ ✓ Unbalanced I/O workload among ranks [9] ✓ ✓ Large number of small I/O requests [9] ✓ ✓ Unbalanced I/O workload on OSTs [9], [11] ✓ ✓ ✓ Bad file system weather [9], [12] ✓ Redundant/overlapping I/O accesses [24], [25] ✓ ✓ I/O resource contention at OSTs [13], [14] ✓ Heavy metadata load [10] ✓…”
Section: Dxt Systemmentioning
confidence: 99%
“…PADLL is able to control the rate of both data and metadata workflows. Other systems are directly implemented within core layers of the HPC I/O stack, including the PFS [14], [18], [20], [22], [23], scheduler [21], and I/O libraries [16], [17]. These solutions are intrusive and offer limited maintainability and portability.…”
Section: Related Workmentioning
confidence: 99%
“…If usage i is higher than demand i , the controller assigns the minimum between demand i and the fair_share (8-9). The algorithm then distributes leftover rate (left R ) across actives jobs (11)(12)(13)(14). Specifically, it computes the overall rate used by all jobs (11), and assigns left R based on their usage proportion, given by usage_proportion i (13)(14).…”
Section: Algorithm 1 Prop Sharing W/o False Resource Allocationmentioning
confidence: 99%
See 2 more Smart Citations