Proceedings of the 10th Workshop on MEmory Performance: DEaling With Applications, Systems and Architecture 2009
DOI: 10.1145/1621960.1621962
|View full text |Cite
|
Sign up to set email alerts
|

Performance tuning and analysis of future vector processors based on the roofline model

Abstract: Because of a recent steep drop in the ratio of memory bandwidth to computational performance (B/F) of vector processors, their advantage against scalar ones regarding relatively high sustained performance is decaying. To cover the insufficient B/F rate, an on-chip vector cache mechanism is promising for the vector processors. Although the effectiveness of the vector cache has been evaluated, cache-conscious tuning of vector codes and the analysis of the obtained performance have not been discussed yet. Under t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…In particular, the Roofline model [10] provides insights into inherent architectural bottlenecks and potential application optimizations. Its usefulness is patent in several works [9], both at the application [4], [6], [8] and at the architectural level [5], [7].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, the Roofline model [10] provides insights into inherent architectural bottlenecks and potential application optimizations. Its usefulness is patent in several works [9], both at the application [4], [6], [8] and at the architectural level [5], [7].…”
Section: Introductionmentioning
confidence: 99%
“…This model has been applied to reallife codes in the past to analyze and report performance including oceanic climate models [5], combustion modeling [6] and even seismic imaging [7]. It has also been used to evaluate the effectiveness of implementation-time optimizations like autotuning [8], or cache-blocking on specific hardware platforms like vector processors [9] and GPUs [10]. Tools are available to plot the machine-specific parameters of the roofline model automatically [11].…”
Section: Roofline Performance Analysismentioning
confidence: 99%
“…The roofline model [1], proposed in 2008, is a visual performance model that makes the identification of potential bottlenecks easier and provides a guideline to explore the architecture. It has been proved to be flexible enough to characterize not only multicore architectures but also innovative architectures ( [2][3][4]). In the GPU community the model has been well accepted ( [5][6][7]), due to the similarity of GPU architectures and multicore processors.…”
Section: Related Workmentioning
confidence: 99%