2022
DOI: 10.1002/cpe.7285
|View full text |Cite
|
Sign up to set email alerts
|

Efficient exact algorithms for continuous bi‐objective performance‐energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms

Abstract: Performance and energy are the two most important objectives for optimization on heterogeneous high performance computing platforms. This work studies a mathematical problem motivated by the bi-objective optimization of data-parallel applications on such platforms for performance and energy. First, we formulate the problem and present an exact algorithm of polynomial complexity solving the problem where all the application profiles of objective type one are continuous and strictly increasing, and all the appli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…The decomposition of the matrix A is computed using a model-based workload partitioning algorithm [26], [27] Therefore, the output by the algorithm is an array d of three elements, where d[0], d [1], and d [2], respectively, contain the number of rows of A and C assigned to {S CPU , S GPU 1 , S GPU 2 }.…”
Section: B Performance Functions and Workload Distributionmentioning
confidence: 99%
See 3 more Smart Citations
“…The decomposition of the matrix A is computed using a model-based workload partitioning algorithm [26], [27] Therefore, the output by the algorithm is an array d of three elements, where d[0], d [1], and d [2], respectively, contain the number of rows of A and C assigned to {S CPU , S GPU 1 , S GPU 2 }.…”
Section: B Performance Functions and Workload Distributionmentioning
confidence: 99%
“…The OpenH matrix multiplication application performs the best since it employs the performance-optimal workload distribution and mapping of the software components to the CPU cores of the hybrid server. Furthermore, the performanceoptimal workload distribution is the load-balanced distribution between the three software components ( [28], [27]) since their execution times are linear functions of workload size.…”
Section: Performance Of Openh Parallel Matrix Multiplication Applicationmentioning
confidence: 99%
See 2 more Smart Citations