Workshop Proceedings of the 48th International Conference on Parallel Processing 2019
DOI: 10.1145/3339186.3339215
|View full text |Cite
|
Sign up to set email alerts
|

Paving the Way Toward Energy-Aware and Automated Datacentre

Abstract: Energy efficiency and datacentre automation are critical targets of the research and deployment agenda of CINECA and its research partners in the Energy Efficient System Laboratory of the University of Bologna and the Integrated System Laboratory in ETH Zurich. In this manuscript, we present the primary outcomes of the research conducted in this domain and under the umbrella of several European, National and Private funding schemes. These outcomes consist of: (i) the ExaMon scalable, flexible, holistic monitor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 20 publications
(27 citation statements)
references
References 29 publications
(35 reference statements)
0
22
0
Order By: Relevance
“…The profiling analysis described in this section has been performed to measure the power consumption of ARMIDA nodes and GPUs, as well as the expected execution times of the considered applications on all the configurations available in the reference hardware system. The profiling has been performed through ExaMon (Exascale Monitoring) a highly scalable framework for the performance and energy monitoring of HPC servers [2].…”
Section: A Profiling Analysismentioning
confidence: 99%
“…The profiling analysis described in this section has been performed to measure the power consumption of ARMIDA nodes and GPUs, as well as the expected execution times of the considered applications on all the configurations available in the reference hardware system. The profiling has been performed through ExaMon (Exascale Monitoring) a highly scalable framework for the performance and energy monitoring of HPC servers [2].…”
Section: A Profiling Analysismentioning
confidence: 99%
“…The system is interfaced with the power sensor via a 12-bit 8-channels SAR ADC, and with existing in-band / out-of-band telemetries to collect hardware performance counters (e.g., Amester [39], IPMI [40], and RAPL [17]). Moreover, it includes (i) hardware support for the Precision Time Protocol (PTP), which allows sub-microsecond measurements synchronization [43], [44], (ii) two Programmable Real-Time Units (PRU0 and PRU1), that we exploit for real-time feature extraction on-board, and (iii) an ARM Cortex-A8 processor with NEON technology, useful for DSP processing and edge ML inference (e.g., by leveraging the ARM NN SDK [45], which enables efficient translation of existing NN frameworks -such as TensorFlow -to ARM Cortex-A CPUs); • a scalable distributed data analytics framework, namely ExaMon [15], [16], that we use to collect at a lower rate -from seconds to milliseconds -power and performance activity of the all cluster and thus to carry out cluster-level analytics. To send data from the distributed monitoring agents (i.e., daemons running on the BBBs) to the centralized monitoring unit, we adopted MQTT [46], which is a robust, lightweight and scalable publish-subscribe protocol, already used for large-scale systems both in industry and academia (e.g., Amazon, Facebook, [46], [47]).…”
Section: Dig and Ad On The Edge Of A Top500's Supercomputermentioning
confidence: 99%
“…For the performance metrics, we use a time granularity of 20 ms i.e., in line with SoA DCs / SCs monitoring tools [15]), dedicating 2 cores for ExaMon to prevent that the in-band monitoring can affect the measurements with noise. For this purpose, we exploit isolcpus, which is a kernel parameter that can be set from the boot loader configurations.…”
Section: A Building Dataset For Analysis and Comparison With Soamentioning
confidence: 99%
See 1 more Smart Citation
“…When running an unoptimized Stream benchmark, Monte Cimone obtained just the 15.5% of the peak bandwidth, while Marconi100 and Armida obtained an efficiency of 48.2% and 63.21% respectively, pointing to significant margins for improvement in application and software stack tuning to the hardware target. • We extended the ExaMon monitoring framework [12] to monitor the Monte Cimone cluster. We characterised the power consumption of various applications executed on Monte Cimone.…”
Section: Introductionmentioning
confidence: 99%