2021
DOI: 10.1177/10943420211016525
|View full text |Cite
|
Sign up to set email alerts
|

Highly efficient lattice Boltzmann multiphase simulations of immiscible fluids at high-density ratios on CPUs and GPUs through code generation

Abstract: A high-performance implementation of a multiphase lattice Boltzmann method based on the conservative Allen-Cahn model supporting high-density ratios and high Reynolds numbers is presented. Meta-programming techniques are used to generate optimized code for CPUs and GPUs automatically. The coupled model is specified in a high-level symbolic description and optimized through automatic transformations. The memory footprint of the resulting algorithm is reduced through the fusion of compute kernels. A roofline ana… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 19 publications
(5 citation statements)
references
References 43 publications
0
5
0
Order By: Relevance
“…Hereafter, the performance model relies on the memory bandwidth and the number of memory accesses, as LB schemes are generally memory-bound on GPUs. 12,13 Applying this model to our family of LB schemes leads to the following theoretical peak performance:…”
Section: Performance Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Hereafter, the performance model relies on the memory bandwidth and the number of memory accesses, as LB schemes are generally memory-bound on GPUs. 12,13 Applying this model to our family of LB schemes leads to the following theoretical peak performance:…”
Section: Performance Analysismentioning
confidence: 99%
“…For all these reasons, isothermal and weakly compressible LBMs are very well-suited for parallel and high performance computing on both central processing units (CPUs) 11 and graphical processing units (GPUs). 12,13 Unfortunately, the development of fully compressible LBMs is not as straightforward and successful. 14 In fact, it took more than a decade to get the right formulation of the velocity discretization, equilibrium state, and collision model to recover the polyatomic compressible NSF equations with variable Prandtl number.…”
Section: Introductionmentioning
confidence: 99%
“…3.1 waLBerla waLBerla [9,10] is a massively parallel multi-physics framework developed at Friedrich-Alexander University Erlangen-Nuremberg (FAU) with focus on simulations using the lattice Boltzmann method. Its specific software design combined with the code generation framework lbmpy [18] allows for highly optimised, performance-portable solutions on block-structured grids for various application areas, like phase-field simulations, and particle-laden flows [19,20]. It supports shared and distributed memory parallelism with OpenMP and MPI, respectively, automated SIMD vectorisation, and the execution on NVIDIA graphics cards.…”
Section: Framework Descriptionmentioning
confidence: 99%
“…Thus, complex mathematical models like multi-phase solidification models [1] can be described in a high-level latex-like representation close to usual descriptions in the literature. Furthermore, other packages like the Lattice-Boltzmann-Method (LBM) code generation framework lbmpy [2], [3] build on top of pystencils to derive highly optimized numerical schemes for solving flow problems. From the numerical schemes, pystencils then generates low-level C/OpenMP code to target CPUs or CUDA/OpenCL for GPUs.…”
Section: B Pystencilsmentioning
confidence: 99%
“…The curvature of the phase field is computed with a finite difference discretization, which adds a 3D7pt stencil to the conventional, very memory intensive D3Q15 LBM stencil. This makes it particularly complicated to achieve good performance results [3], but at the same time interesting for automatic performance estimations, which can give significant insight into the complex structure of the compute kernels.…”
Section: Multi Phase Lbmmentioning
confidence: 99%