2017
DOI: 10.1177/1094342017732395
|View full text |Cite
|
Sign up to set email alerts
|

Acceleration of the IMplicit–EXplicit nonhydrostatic unified model of the atmosphere on manycore processors

Abstract: We present the acceleration of an IMplicit-EXplicit (IMEX) non-hydrostatic atmospheric model on manycore processors such as GPUs and Intel's MIC architecture. IMEX time integration methods sidestep the constraint imposed by the Courant-Friedrichs-Lewy condition on explicit methods through corrective implicit solves within each time step. In this work, we implement and evaluate the performance of IMEX on manycore processors relative to explicit methods. Using 3D-IMEX at Courant number C=15 , we obtained a speed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(28 citation statements)
references
References 43 publications
0
25
0
Order By: Relevance
“…Over the last two decades or so, the spectral element (SE) method has been considered as a numerical method for the fluid flow solver in global weather/climate models (Baer et al, ; Choi & Hong, ; Fournier et al, ; Giraldo et al, ; Kelly & Giraldo, ). The main motivations were the SE methods' near‐perfect scalability (Dennis et al, ), GPU (Graphics Processing Unit) acceleration (e.g., Abdi, Giraldo, et al, ; Abdi, Wilcox, et al, ), high‐order accuracy for smooth problems, and mesh refinement capabilities. For some time the Community Earth System Model (CESM; Hurrell et al, ) has supported a SE dynamical core option in the atmosphere component CAM (Community Atmosphere Model; Neale et al, ) discretized on a cubed‐sphere grid (Figure a).…”
Section: Introductionmentioning
confidence: 99%
“…Over the last two decades or so, the spectral element (SE) method has been considered as a numerical method for the fluid flow solver in global weather/climate models (Baer et al, ; Choi & Hong, ; Fournier et al, ; Giraldo et al, ; Kelly & Giraldo, ). The main motivations were the SE methods' near‐perfect scalability (Dennis et al, ), GPU (Graphics Processing Unit) acceleration (e.g., Abdi, Giraldo, et al, ; Abdi, Wilcox, et al, ), high‐order accuracy for smooth problems, and mesh refinement capabilities. For some time the Community Earth System Model (CESM; Hurrell et al, ) has supported a SE dynamical core option in the atmosphere component CAM (Community Atmosphere Model; Neale et al, ) discretized on a cubed‐sphere grid (Figure a).…”
Section: Introductionmentioning
confidence: 99%
“…The second assessment concerns the tests we conducted with OCCA kernels in the NUMA code on fully 1D and 3D Implicit-Explicit time-integrated dynamics. We describe in detail these results in the RESULTS section and present them in more detail in Abdi, Giraldo, Constantinescu, Carr III, Wilcox, and Warburton [4].…”
Section: Assess Performancementioning
confidence: 99%
“…We have validated our results with standard benchmark problems in numerical weather prediction and evaluated the performance and strong scalability of the IMEX method using up to 4192 GPUs. The results regarding the implementation of the IMEX methods in NUMA on many-core architectures can be found in Abdi, Giraldo, Constantinescu, Carr III, Wilcox, and Warburton [4].…”
Section: Figure 1: Overview Of Occa Api Kernel Languages and Programentioning
confidence: 99%
See 1 more Smart Citation
“…Promising approaches for satisfying the latter condition are exponential time integrators [36,47]; (b) to overcome the overly restrictive time-step limitations of EBTI schemes combined with highly scalable horizontal discretizations, either through horizontal/vertical splitting (HEVI) [2,8,40] or through combining SISL PBTI methods with discontinuous Galerkin (DG) discretization [99]; and (c) to further the scalability and the adaptation of algorithms to emerging HPC architectures involving SE [32] or fully-implicit time-stepping approaches [113], and further through exploiting additional parallelism with time-parallel algorithms [33].…”
Section: Discussion and Concluding Remarksmentioning
confidence: 99%