2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC) 2020
DOI: 10.1109/hipc50609.2020.00036
|View full text |Cite
|
Sign up to set email alerts
|

Extending SLURM for Dynamic Resource-Aware Adaptive Batch Scheduling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 24 publications
0
1
0
Order By: Relevance
“…They showed experimentally how average response time and utilization could be improved when considering malleability. In some cases, for example [40], the scheduler should be able to collect data about the performance of running applications to decide who should receive additional processors or from whom processors should be subtracted. A variant of backfilling scheduling in [41] enables malleability by shrinking the resources of running jobs to make room for jobs that run with a reduced amount of resources only if the estimated slowdown improves over the static approach.…”
Section: Introductionmentioning
confidence: 99%
“…They showed experimentally how average response time and utilization could be improved when considering malleability. In some cases, for example [40], the scheduler should be able to collect data about the performance of running applications to decide who should receive additional processors or from whom processors should be subtracted. A variant of backfilling scheduling in [41] enables malleability by shrinking the resources of running jobs to make room for jobs that run with a reduced amount of resources only if the estimated slowdown improves over the static approach.…”
Section: Introductionmentioning
confidence: 99%
“…SLURMSTEPD daemons launch the local processes and interact with them via the Process Management Interface (PMI). To incorporate elasticity, some modifications to the design and functionality of SLURM are proposed in [19] to support its malleability framework [21,22].…”
mentioning
confidence: 99%
“…Chadha et al [22] use an MPI extension and SLURM to build a malleable job scheduling policy guided by a ratio between the number of processes and the compute time. They use a two queue system to separate rigid and evolving jobs and use malleability if a new job cannot start.…”
Section: Malleable Job Schedulingmentioning
confidence: 99%
“…Prabhakaran [89], using Charm++ malleability, implemented shrink/expand operations in a production scheduler, together with a scheduling strategy based on equipartition and combining rigid, evolving, and malleable jobs. Chadha [22] uses an MPI extension and SLURM to build a malleable job scheduling policy guided by a ratio between the number of processes and the compute time. Martin [73] introduces FLEX-MPI library for dynamic reconfiguration of MPI applications based on checkpoint and restart, while Comprés at al.…”
Section: Related Workmentioning
confidence: 99%