Achieving autonomous power management using reinforcement learning

Shen, Hao; Tan, Ying; Lu, Jun; Wu, Qing; Qiu, Qinru

doi:10.1145/2442087.2442095

Cited by 89 publications

(71 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The advantage our approach employing EPD (2) during initial learning (i.e., RL step), is illustrated in Table II, highlighting the average number of explorations for three applications compared against an existing approach [21]. It can be observed that our approach benefits from reduced exploration due to the relationship between current performance and the V-F action (4) [21].…”

Section: Number Of Explorationsmentioning

confidence: 99%

“…Predicting the state of the system is a key step in RL and in our methodology the expected workload is classified into a system state at the beginning of each decision epoch [8], [9]. The state of the system is represented using the CPU Cycle Count (CC), obtained using the performance monitoring unit.…”

Section: A State Prediction and Q-tablementioning

confidence: 99%

“…It can be observed that our approach benefits from reduced exploration due to the relationship between current performance and the V-F action (4) [21]. FFT uses the fewest explorations since it exhibits less workload variations resulting in faster learning by the algorithm.…”

Section: Number Of Explorationsmentioning

confidence: 99%

“…However, the majority of online approaches [8], [9] do not consider changing application performance requirements. Processor workloads are exercised differently depending on the application tasks being executed.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Machine learning for run-time energy optimisation in many-core systems

Biswas

Balagopal

Shafik

et al. 2017

Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE), 2017

View full text Add to dashboard Cite

Abstract-In recent years, the focus of computing has moved away from performance-centric serial computation to energyefficient parallel computation. This necessitates run-time optimisation techniques to address the dynamic resource requirements of different applications on many-core architectures. In this paper, we report on intelligent run-time algorithms which have been experimentally validated for managing energy and application performance in many-core embedded system. The algorithms are underpinned by a crosslayer system approach where the hardware, system software and application layers work together to optimise the energyperformance trade-off. Algorithm development is motivated by the biological process of how a human brain (acting as an agent) interacts with the external environment (system) changing their respective states over time. This leads to a pay-off for the action taken, and the agent eventually learns to take the optimal/best decisions in future. In particular, our online approach uses a model-free reinforcement learning algorithm that suitably selects the appropriate voltage-frequency scaling based on workload prediction to meet the applications' performance requirements and achieve energy savings of up to 16% in comparison to stateof-the-art-techniques, when tested on four ARM A15 cores of an ODROID-XU3 platform.

show abstract

Section: Number Of Explorationsmentioning

confidence: 99%

Section: A State Prediction and Q-tablementioning

confidence: 99%

Section: Number Of Explorationsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Machine learning for run-time energy optimisation in many-core systems

Biswas

Balagopal

Shafik

et al. 2017

Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE), 2017

View full text Add to dashboard Cite

show abstract

“…RESULTS The proposed run-time approach is validated on Texas Instrument's PandaBoard featuring ARM A9 cores and Intel quad-core system running Linux. The proposed approach is compared with one representative approach from each category of related works -the reinforcement learning-based technique of [4], the prediction-based DVFS technique of [1] and the multinomial logistic regression-based technique of [9].…”

Section: B Parameter Fixingmentioning

confidence: 99%

Workload Uncertainty Characterization and Adaptive Frequency Scaling for Energy Minimization of Embedded Systems

Das¹,

Kumar²,

Veeravalli³

et al. 2015

Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE), 2015

View full text Add to dashboard Cite

Abstract-A primary design optimization objective for multicore embedded systems is to minimize the energy consumption of applications while satisfying their performance requirement. A system-level approach to this problem is to scale the frequency of the processing cores based on the readings obtained from the hardware performance monitors. However, performance monitor readings contain uncertainty, which becomes prominent when applications are executed in a multicore environment. This uncertainty can be attributed to factors such as cache contention and DRAM access time, that are very difficult to predict dynamically. We demonstrate that such uncertainty can be controlled to make better decision on the processor frequency in order to minimize energy consumption. To achieve this, we propose a multinomial logistic regression model, which combines probabilistic interpretation with maximum likelihood (ML) estimation to classify an incoming workload, at run-time, into a finite set of classes. Every workload class corresponds to a frequency pre-determined using an appropriate training set and results in minimum energy consumption. The classifier incorporates (1) uncertainty with arbitrary probability distribution to estimate the actual frame workload; and (2) the frequency switching overhead, neither of which are considered in any of the existing approaches. The classified frequency is applied on the processing cores to execute the workload. The proposed approach is engineered into an embedded multicore system and is validated with a set of standard multimedia applications. Results demonstrate that the proposed approach minimizes energy consumption by an average 20% as compared to the existing techniques.

show abstract

The Construction and Key Technologies Research of the Integration of EMS and DMS with Rush Repair Scheduling in Area Grid

Song

Leifan

et al. 2019

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

Achieving autonomous power management using reinforcement learning

Cited by 89 publications

References 34 publications

Machine learning for run-time energy optimisation in many-core systems

Machine learning for run-time energy optimisation in many-core systems

Workload Uncertainty Characterization and Adaptive Frequency Scaling for Energy Minimization of Embedded Systems

The Construction and Key Technologies Research of the Integration of EMS and DMS with Rush Repair Scheduling in Area Grid

Contact Info

Product

Resources

About