2018 International Conference on High Performance Computing &Amp; Simulation (HPCS) 2018
DOI: 10.1109/hpcs.2018.00058
|View full text |Cite
|
Sign up to set email alerts
|

A Scalable Framework for Online Power Modelling of High-Performance Computing Nodes in Production

Abstract: Power and thermal design and management are critical components of high performance computing (HPC) systems, due to their cutting-edge position in terms of high power density and large total power consumption. Many HPC power management strategies rely on the availability of accurate compact power models, capable of predicting power consumption and tracking its sensitivity to workload parameters and operating points. In this paper we describe a methodology and a framework for training two of the best-in-class p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
9
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 24 publications
(55 reference statements)
1
9
0
Order By: Relevance
“…By comparing Fig. 2 with the results in [27], we note that the new model performs slightly worse than the old one, as expected because we use significantly less features and we are estimating also the individual cores contributions. However the additional error is not very significant, and anyways for more than 90% of the points the error in package power is below 10W, which corresponds to roughly 1W error per core (assuming uniform distribution of the power) and about 10% of the maximum power.…”
Section: B Power Modelsupporting
confidence: 70%
See 4 more Smart Citations
“…By comparing Fig. 2 with the results in [27], we note that the new model performs slightly worse than the old one, as expected because we use significantly less features and we are estimating also the individual cores contributions. However the additional error is not very significant, and anyways for more than 90% of the points the error in package power is below 10W, which corresponds to roughly 1W error per core (assuming uniform distribution of the power) and about 10% of the maximum power.…”
Section: B Power Modelsupporting
confidence: 70%
“…In order to meet all these requirements, we have decided to extend the distributed and scalable monitoring framework that we presented in [26], [27]. As far as the thermal model is concerned, we have used a variation to the algorithm presented in [23], which makes use of an identification algorithm (see Sec.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations