2018
DOI: 10.1016/j.jcp.2018.01.037
|View full text |Cite
|
Sign up to set email alerts
|

A GPU-accelerated implicit meshless method for compressible flows

Abstract: This paper develops a recently proposed GPU based two-dimensional explicit meshless method (Ma et al., 2014) by devising and implementing an efficient parallel LU-SGS implicit algorithm to further improve the computational efficiency. The capability of the original 2D meshless code is extended to deal with 3D complex compressible flow problems. To resolve the inherent data dependency of the standard LU-SGS method, which causes thread-racing conditions destabilizing numerical computation, a generic rainbow colo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…Emelyanov et al [21] discussed the popular CFD benchmark solution of the flow over a smooth flat plate on a GPU with various meshes, and the speedup reached more than 46 times. Zhang et al [22] performed an implicit meshless method for compressible flow on an NVIDIA GTX TITAN GPU, and the solution agrees well with experimental results.…”
Section: Introductionmentioning
confidence: 64%
“…Emelyanov et al [21] discussed the popular CFD benchmark solution of the flow over a smooth flat plate on a GPU with various meshes, and the speedup reached more than 46 times. Zhang et al [22] performed an implicit meshless method for compressible flow on an NVIDIA GTX TITAN GPU, and the solution agrees well with experimental results.…”
Section: Introductionmentioning
confidence: 64%
“…24) present more inefficient when the computation domain gets larger, especially for time-marching kernel. Consequently, for the same scheme, speedups keep declining when mesh sizes increase for both method 3 and optimization of reduction, while speedups should improve with the increasing mesh size under certain [9,16,39]. The difference mainly comes from limited performance of time-marching kernel, whose increased multiple of computation time exceeds that of meshes, leading to the significant decreasing performance.…”
Section: D Flow Past a Forward-facing Stepmentioning
confidence: 99%
“…As described in Algorithm 1, a mass of independent arithmetic operations associated with the dominated or non-dominated identifications (see Algorithm 1, line 6) are proved to be time-consuming. Fortunately, such tasks are mostly weak-dependent compute-intensive and are very suitable for GPU parallel architecture [39][40][41]. Therefore, such a kind of computation is implemented on the GPU to achieve acceleration.…”
Section: Gpu-accelerated Infill Criterion For the Moego Algorithmmentioning
confidence: 99%