Evaluation of the Intel Xeon Phi 7120 and NVIDIA K80 as accelerators for two-dimensional panel codes

Einkemmer, Lukas

doi:10.1371/journal.pone.0178156

Cited by 7 publications

(5 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, as opposed to the case in section 4.2, the resulting ODE cannot be integrated exactly in time. For the time discretization, we will then use the order two Magnus integrator (18). We call the resulting scheme Hermite Kronecker Magnus Pseudospectral method (HKMP).…”

Section: Schrödinger Equation With Time Dependent Potentialmentioning

confidence: 99%

“…It has increasingly been realized that in order to fully exploit present and future highperformance computing systems we require algorithms that parallelize well and which can be implemented efficiently on accelerators, such as GPUs [5]. In particular, for GPU computing much research effort has been undertaken to obtain efficient implementations (see, e.g., [6,8,17,18,19,31,34,39,41,44]).…”

Section: Implementation On Multi-core Cpus and Gpusmentioning

confidence: 99%

See 1 more Smart Citation

A $μ$-mode integrator for solving evolution equations in Kronecker form

Caliari,

Cassini,

Einkemmer

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper, we propose a µ-mode integrator for computing the solution of stiff evolution equations. The integrator is based on a d-dimensional splitting approach and uses exact (usually precomputed) one-dimensional matrix exponentials. We show that the action of the exponentials, i.e. the corresponding batched matrix-vector products, can be implemented efficiently on modern computer systems. We further explain how µ-mode products can be used to compute spectral transformations efficiently even if no fast transform is available. We illustrate the performance of the new integrator by solving three-dimensional linear and nonlinear Schrödinger equations, and we show that the µ-mode integrator can significantly outperform numerical methods well established in the field. We also discuss how to efficiently implement this integrator on both multi-core CPUs and GPUs. Finally, the numerical experiments show that using GPUs results in performance improvements between a factor of 10 and 20, depending on the problem.

show abstract

Section: Schrödinger Equation With Time Dependent Potentialmentioning

confidence: 99%

Section: Implementation On Multi-core Cpus and Gpusmentioning

confidence: 99%

A $μ$-mode integrator for solving evolution equations in Kronecker form

Caliari,

Cassini,

Einkemmer

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The original REXI scheme and the differences to REXII The original REXI scheme was developed for matrices A with real entries. It is based on (15) where τ A is substituted for ix. A further simplification comes from the fact that e τ A is real which suggests to neglect the imaginary part of (15).…”

Section: The New Scheme Rexii For Matricesmentioning

confidence: 99%

“…If A = iB, where B is a real diagonalizable matrix, then the transformation matrix V is real. If moreover f 0 is real, the use of (15) for approximating e τ A is justified, and we end up with the following scheme:…”

Section: The New Scheme Rexii For Matricesmentioning

confidence: 99%

“…Therefore, methods which are highly parallelizable and work well on these new computer architectures are needed to take advantage of their computational power. A significant body of research has been accumulated in recent years that considers numerical methods that are well suited for such systems (see, e.g., [15,16,21,27,28]). More specifically, in the context of exponential integrators we refer to [17,18].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An accurate and time-parallel rational exponential integrator for hyperbolic and oscillatory PDEs

Caliari,

Einkemmer,

Moriggl

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Rational exponential integrators (REXI) are a class of numerical methods that are well suited for the time integration of linear partial differential equations with imaginary eigenvalues. Since these methods can be parallelized in time (in addition to the spatial parallelization that is commonly performed) they are well suited to exploit modern high performance computing systems. In this paper, we propose a novel REXI scheme that drastically improves accuracy and efficiency. The chosen approach will also allow us to easily determine how many terms are required in the approximation in order to obtain accurate results. We provide comparative numerical simulations for a shallow water equation that highlight the efficiency of our approach and demonstrate that REXI schemes can be efficiently implemented on graphic processing units.

show abstract