2015
DOI: 10.1002/cpe.3735
|View full text |Cite
|
Sign up to set email alerts
|

Qespera: an adaptive framework for prediction of queue waiting times in supercomputer systems

Abstract: Production parallel systems are space-shared, and resource allocation on such systems is usually performed using a batch queue scheduler. Jobs submitted to the batch queue experience a variable delay before the requested resources are granted. Predicting this delay can assist users in planning experiment time-frames and choosing sites with less turnaround times and can also help meta-schedulers make scheduling decisions. In this paper, we present an integrated adaptive framework, Qespera, for prediction of que… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
8
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 20 publications
1
8
0
Order By: Relevance
“…We evaluated our queue waiting time prediction framework using production supercomputer workload traces with varying site and job characteristics, including two Top500 systems, obtained from Parallel Workloads Archive [21]. The detailed results and analyses are contained in our previous work [17]. In summary, our predictions results in up to 22% reduction in the average absolute error and up to 56% reduction in the percentage prediction errors over existing strategies including QBETS [43] and IBL [44] across workloads.…”
Section: Queue Waiting Time Predictionmentioning
confidence: 99%
“…We evaluated our queue waiting time prediction framework using production supercomputer workload traces with varying site and job characteristics, including two Top500 systems, obtained from Parallel Workloads Archive [21]. The detailed results and analyses are contained in our previous work [17]. In summary, our predictions results in up to 22% reduction in the average absolute error and up to 56% reduction in the percentage prediction errors over existing strategies including QBETS [43] and IBL [44] across workloads.…”
Section: Queue Waiting Time Predictionmentioning
confidence: 99%
“…Different prediction approaches exist in order to estimate the performance of jobs running in clusters. Previous research analyses have focused on predicting execution time of jobs [12,13], waiting time [14,15] and slowdown time of applications spawned when resources are shared [16].…”
Section: Related Workmentioning
confidence: 99%
“…In our previous experiments, we generated the parameters by using a GA approach. Murali and Vadhiyar [24] use a computationally cheaper approach: they compute the correlation between each feature and labels and use this value as their estimator's weights. In this section, we compare this strategy with the GA optimization approach, considering both how much faster models can be trained and how accurate such models are.…”
Section: Speeding-up Predictionsmentioning
confidence: 99%
“…More recently, Murali and Vadhiyar [24] proposed a framework called Qespera for prediction of queue waiting times for HPC settings. The proposed framework is based on spatial clustering using history of job submissions and executions.…”
Section: Queue Time Predictionsmentioning
confidence: 99%