2018
DOI: 10.1007/978-3-319-92040-5_9
|View full text |Cite
|
Sign up to set email alerts
|

Applicability of the ECM Performance Model to Explicit ODE Methods on Current Multi-core Processors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 21 publications
0
12
0
Order By: Relevance
“…In this work, we propose Offsite, an offline AT approach that automatically identifies the most efficient implementation variant(s) during installation time based on performance predictions. These predictions stem from an analytic performance prediction methodology for explicit ODE methods proposed by [19] that uses a combined white and black box model approach based on the ECM model. The main contributions of this paper are:…”
Section: Main Contributionsmentioning
confidence: 99%
“…In this work, we propose Offsite, an offline AT approach that automatically identifies the most efficient implementation variant(s) during installation time based on performance predictions. These predictions stem from an analytic performance prediction methodology for explicit ODE methods proposed by [19] that uses a combined white and black box model approach based on the ECM model. The main contributions of this paper are:…”
Section: Main Contributionsmentioning
confidence: 99%
“…With estimates for individual kernels in place we can now present multicore-scaling data for the full PCG algorithm. Composing the model from single-loop predictions is simple due to the time-based formulation of the ECM model [21]. In the case of PCG we have three invocations of daxpby, two of dot, one gs forward-and backward-sweep each, as well as one of stencil.…”
Section: Compositionmentioning
confidence: 99%
“…Both have been subject to intense study, refinements, and validation, and their areas of applicability are well understood. However, while there is ample data available for Roofline on a wide variety of architectures [15,18], one drawback of previous applications of the ECM model [2,4,12,21,22,24,26] is that they were mostly restricted to Intel processors. We provide the first thorough cross-architecture study of the model.…”
Section: Related Workmentioning
confidence: 99%
“…In the online phase we can measure the runtime of the fastest specialized implementation variant and avoid the execution of the general implementation variants if their synchronization overhead is expected to be higher than the runtime of the best specialized implementation. Another idea is to use performance models in the offline phase, such as the ECM model, in order to predict the execution time of variants or at least to forecast their performance ranking [32]. Figure 8 demonstrates that for variants with loop tiling an appropriate selection of the tile size is important to achieve a good performance.…”
Section: Experimental Setup and Evaluationmentioning
confidence: 99%
“…Furthermore, the offline phase can be used to estimate synchronization and communication overheads for different configurations and input scenarios with the help of micro-benchmarks. This information can be used together with a performance model to predict a performance ranking of different implementation variants [32] and different parameter values. All evaluated configurations can be arranged in a decision tree according to their ranking, which is then forwarded to the online tuning phase.…”
Section: Summary Of the Observations For Ode Solversmentioning
confidence: 99%