2013
DOI: 10.1007/978-3-642-36949-0_60
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Design Space Exploration of GPGPU Architectures

Abstract: The goal of this work is to revisit GPU design and introduce a fast, low-cost and effective approach to optimize resource allocation in future GPUs. We have achieved this goal by using the Plackett-Burman methodology to explore the design space efficiently. We further formulate the design exploration problem as that of a constraint optimization. Our approach produces the optimum configuration in 84% of the cases, and in case that it does not, it produces the second optimal case with a performance penalty of le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…Mirsoleimani et al [6] also employ a linear regression model, which was used to find the most important GPU parameters affecting the run time of different GPGPU programs, where GPGPU is the acronym for General Purpose Computation on Graphics Processing Units. Jooya et al [7] presented an approach to find the GPU configuration which delivers the best run time under a given transistor budget. It uses the Plackett-Burman scheme to set up the experiments and solves a knapsack problem.…”
Section: Related Workmentioning
confidence: 99%
“…Mirsoleimani et al [6] also employ a linear regression model, which was used to find the most important GPU parameters affecting the run time of different GPGPU programs, where GPGPU is the acronym for General Purpose Computation on Graphics Processing Units. Jooya et al [7] presented an approach to find the GPU configuration which delivers the best run time under a given transistor budget. It uses the Plackett-Burman scheme to set up the experiments and solves a knapsack problem.…”
Section: Related Workmentioning
confidence: 99%
“…3a gives an overview of the Bifrost architecture. 2 The architecture features up to 32 unified Shader Core (SC)s, and a single logical L2 GPU cache that is split into several fully coherent physical cache segments. Full system coherency support and shared main memory tightly couples the GPU and CPU memory systems.…”
Section: Background: Arm Bifrost Gpumentioning
confidence: 99%