2008
DOI: 10.1016/j.camwa.2007.08.001
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of implementations of the lattice-Boltzmann method

Abstract: Simplicity of coding is usually an appealing feature of the lattice-Boltzmann method (LBM). Conventional implementations of LBM are often based on the two-lattice or the two-step algorithm, which however suffer from high memory consumption and poor computational performance, respectively. The aim of this work was to identify implementations of LBM that would achieve high computational performance with low memory consumption. Effects of memory addressing schemes were investigated in particular. Data layouts for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 54 publications
(27 citation statements)
references
References 17 publications
0
27
0
Order By: Relevance
“…This is "only" 2 times slower than native optimized implementations on almost similar equipments (AMD R Opteron TM 2Ghz and Intel R Xeon TM 3.2Ghz) [12]. We therefore expect to reach at least similar results if applying our optimization technic to a native implementation.…”
Section: Comparison Of Simple and Optimized Algorithmsmentioning
confidence: 63%
“…This is "only" 2 times slower than native optimized implementations on almost similar equipments (AMD R Opteron TM 2Ghz and Intel R Xeon TM 3.2Ghz) [12]. We therefore expect to reach at least similar results if applying our optimization technic to a native implementation.…”
Section: Comparison Of Simple and Optimized Algorithmsmentioning
confidence: 63%
“…This paper also applies the techniques of blocking for repeated iterations and eliminating unnecessary dependencies and branches, resulting in performance improvements of 1.2-1.3 relative to the original code. A similar effort (Mattila et al, 2008) looked at memory addressing schemes and data layouts in LBM codes across multiple platforms and evaluated the results in terms of computational speed, cache performance, and memory consumption, with the conclusion that the optimal approach is dependent on the particular case and the desired trade-off between memory consumption and performance. Wellein et al (2005) also considered data layouts as well as other optimization strategies such as blocking, again across multiple platforms.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, LBM contains two distinct steps: streaming and collision. In streaming step, data are coupled to and from adjacent lattice nodes, while in collision step, data are usually independent of the underlying lattice type and computations are performed in this step (Mattila et al 2008). …”
Section: Introductionmentioning
confidence: 99%