2014
DOI: 10.1016/j.jcp.2014.01.023
|View full text |Cite
|
Sign up to set email alerts
|

GPU computing of compressible flow problems by a meshless method with space-filling curves

Abstract: A graphic processing unit (GPU) implementation of a meshless method for solving compressible flow problems is presented in this paper. Least-square fit is used to discretise the spatial derivatives of Euler equations and an upwind scheme is applied to estimate the flux terms. The compute unified device architecture (CUDA) C programming model is employed to efficiently and flexibly port the meshless solver from CPU to GPU. Considering the data locality of randomly distributed points, space-filling curves are ad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 23 publications
(20 citation statements)
references
References 39 publications
(54 reference statements)
0
17
0
Order By: Relevance
“…Since 3D compressible multiphase flow simulations request very much more intensive computation than 1D problems, it is necessary to consider parallel computing implementations via many-core GPUs. The benefits of utilising GPU acceleration in CFD can be found in recent works pertaining to single-phase flows which successfully accelerated the flow solvers by several times and even orders of magnitude [37][38][39][40][41]. It is very likely that applying many core GPU techniques to dramatically reduce the computing time for simulating compressible multiphase flows will be attractive and beneficial to the academic and industrial communities.…”
Section: Introductionmentioning
confidence: 99%
“…Since 3D compressible multiphase flow simulations request very much more intensive computation than 1D problems, it is necessary to consider parallel computing implementations via many-core GPUs. The benefits of utilising GPU acceleration in CFD can be found in recent works pertaining to single-phase flows which successfully accelerated the flow solvers by several times and even orders of magnitude [37][38][39][40][41]. It is very likely that applying many core GPU techniques to dramatically reduce the computing time for simulating compressible multiphase flows will be attractive and beneficial to the academic and industrial communities.…”
Section: Introductionmentioning
confidence: 99%
“…It should be emphasized that when using GPU for parallel computing, the challenge is not only parallelization of the code, but also optimization of the access to device memory by making the best use of registers and shared memory (Corrigan, Camelli, Löhner, & Wallin, 2011;Julien & Inanc, 2009). Meanwhile, it should be pointed out that the shared memory is difficult to use for an irregular meshless structure of clouds of points due to the unpredictable memory access pattern (see Ma et al (2014) for details); therefore, the present work is mainly focused on the best use of registers by searching for an appropriate size of thread block used for the meshless solver; this will be discussed in section 4.2.2.…”
Section: Cuda Programming Modelmentioning
confidence: 99%
“…As mentioned above, Ma et al (2014) renumbered the points of meshless clouds to improve the performance of the global memory address; here, we manage the kernel algorithm directly by developing a point-based algorithm to achieve an appropriate global memory address. In this way, the GPU kernels can own the same thread structure, and the new measurement can therefore be taken by combining part of the kernels to enable further improvement of the performance.…”
Section: Point-based Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the context fluid dynamics and meshless methods, Kelly et al (2014) explore a GPU implementation of an incompressible Navier-Stokes code based on a Radial-Basis Function Collocation Meshless Method for two-fluid flows. For compressive flows, Ma et al (2014) show a GPU implementation.…”
Section: Introductionmentioning
confidence: 99%