2016
DOI: 10.1007/978-3-319-40528-5_1
|View full text |Cite
|
Sign up to set email alerts
|

Hardware-Based Efficiency Advances in the EXA-DUNE Project

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 15 publications
0
11
0
Order By: Relevance
“…The higher order of the derivatives in the Laplacian imply more work, in particular for the face integrals where both the values and the gradients must be computed from Eq. (7). In Table 2, we count the interpolation of the values and normal derivatives as two invocations to a face normal interpolation to quantify the increased cost, even though they are implemented by a single pass through the data.…”
Section: Tensor Product Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations
“…The higher order of the derivatives in the Laplacian imply more work, in particular for the face integrals where both the values and the gradients must be computed from Eq. (7). In Table 2, we count the interpolation of the values and normal derivatives as two invocations to a face normal interpolation to quantify the increased cost, even though they are implemented by a single pass through the data.…”
Section: Tensor Product Algorithmsmentioning
confidence: 99%
“…Design Choice 4 In order to simplify implementation and re-use 2D kernels, the local coordinate system on faces is always set such that reference cell gradients touch the d − 1 tangential directions first and the face-normal direction comes last by adjusting the order of components in the geometry tensors rather than changing indices of evaluators, see Eq. (7). Data Structure 1 lists a slim way of storing a pair of faces in case of vectorization.…”
Section: Vectorization Layout For Face Integralsmentioning
confidence: 99%
See 1 more Smart Citation
“…The concept of matrix-free evaluation with sum factorization has been widely adopted by now, like in the deal.II [1], DUNE [5,40,60], Firedrake [63], mfem [2], Nek5000 [28] or Nektar++ [13] projects. These fast evaluation techniques are directly applicable to explicit time stepping schemes, as we have demonstrated for wave propagation in [42,53,[65][66][67][68] and the compressible Navier-Stokes equations [24].…”
Section: Implementation Of Sum Factorization In the Dealii Librarymentioning
confidence: 99%
“…Certainly the combination of using explicit time-stepping for high-order methods alongside performance portable and architecture independent methods running on both CPUs and GPUs has been demonstrate in solvers such as PyFR [13]. Additionally, work by other finite element groups such as deal.II [14] and Dune [15] on mostly tensor product quadrilateral and hexahedral elements has been conducted, but typically from the CPU perspective. In this work, we aim to demonstrate how architecture-independent programming can be applied from the context of implicit time-stepping.…”
Section: Introductionmentioning
confidence: 99%