2017
DOI: 10.1002/cpe.4401
|View full text |Cite
|
Sign up to set email alerts
|

A performance spectrum for parallel computational frameworks that solve PDEs

Abstract: Time to solution I n t e n s i t y R a t eO p e r a t i o n s p e r b y t e t r a n s f e r r e d Proposed performance spectrum that documents time, intensity and rate. Abstract. Important computational physics problems are often large-scale in nature, and it is highly desirable to have robust and high performing computational frameworks that can quickly address these problems. However, it is no trivial task to determine whether a computational framework is performing efficiently or is scalable. The aim of thi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
23
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(23 citation statements)
references
References 47 publications
0
23
0
Order By: Relevance
“…• Arithmetic intensity: A logical extension of the TAS spectrum performance analysis would be to incorporate the Arithmetic Intensity (AI), used in the performance spectrum [16] and roofline performance model [40]. The AI of an algorithm or software is a measure that aids in estimating how efficiently the hardware resources and capabilities can be utilized.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…• Arithmetic intensity: A logical extension of the TAS spectrum performance analysis would be to incorporate the Arithmetic Intensity (AI), used in the performance spectrum [16] and roofline performance model [40]. The AI of an algorithm or software is a measure that aids in estimating how efficiently the hardware resources and capabilities can be utilized.…”
Section: Resultsmentioning
confidence: 99%
“…By taking into account the total time-to-solution, numerical accuracy with respect to an error norm, and the computation rate, a cost-benefit analysis can be performed to determine which algorithm and discretization are particularly suited for an application. This work extends the performance spectrum model in [16] for interpretation of hardware and algorithmic tradeoffs in numerical PDE simulation. As a proof-of-concept, popular finite element software packages are used to illustrate this analysis for Poisson's equation.…”
mentioning
confidence: 96%
“…In the final region, the problem size is a perfect match for the number of processes, which minimizes solve time and results in optimal scaling. More about static scaling can be found in [6,9,26,31]. Strong scaling plot generated using the 3.52M DOF mesh for the electrode domain and the 3.14M DOF mesh for the electrolyte domain and 16 processes per node.…”
Section: Parallel Scalingmentioning
confidence: 99%
“…Unlike in the solid domains, the electrolyte problem contains coupling on more than just the interface boundary. This could lead to a degradation in parallel performance because sparse matrix-vector multiplications have very low arithmetic intensities and can present as serial bottlenecks at the memory levels of each server node (see [9] for further discussion). Fig.…”
Section: Parallel Scalingmentioning
confidence: 99%
“…Note the use of the log scale and that, toward the right, the GPUs are performing significantly better than the 42 CPU cores, while towards the left the CPUs are faster. Figure 3 presents an alternative view of the same data, known as a static scaling or work-time spectrum plot [4,5]. It has the advantage that both the asymptotic bandwidth and the latency of the operations can be directly read from the figure.…”
Section: Petsc Vector Operationsmentioning
confidence: 99%