2013
DOI: 10.1016/j.jpdc.2012.07.010
|View full text |Cite
|
Sign up to set email alerts
|

Towards accelerating smoothed particle hydrodynamics simulations for free-surface flows on multi-GPU clusters

Abstract: Starting from the single graphics processing unit (GPU) version of the Smoothed Particle Hydrodynamics (SPH) code DualSPHysics, a multi-GPU SPH program is developed for free-surface flows. The approach is based on a spatial decomposition technique, whereby different portions (sub-domains) of the physical system under study are assigned to different GPUs. Communication between devices is achieved with the use of Message Passing Interface (MPI) application programming interface (API) routines. The use of the sor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 53 publications
(22 citation statements)
references
References 16 publications
0
22
0
Order By: Relevance
“…The computationally most expensive simulation (3D, h = 0.480 m, dp = 10 mm) took 60 h per second of real time and included 23 million particles. Problems involving more than 1000 million particles for free-surface flows have been already investigated with DualSPHysics using the multi-GPU version Valdez-Balderas et al, 2013). Nevertheless, Section 4.2 including a convergence study and Section 4.3 show that dp = 10 mm provides a good compromise between simulation time and accuracy for the present purpose based on the CPU version.…”
Section: Implementation In Dualsphysics and Performancementioning
confidence: 99%
“…The computationally most expensive simulation (3D, h = 0.480 m, dp = 10 mm) took 60 h per second of real time and included 23 million particles. Problems involving more than 1000 million particles for free-surface flows have been already investigated with DualSPHysics using the multi-GPU version Valdez-Balderas et al, 2013). Nevertheless, Section 4.2 including a convergence study and Section 4.3 show that dp = 10 mm provides a good compromise between simulation time and accuracy for the present purpose based on the CPU version.…”
Section: Implementation In Dualsphysics and Performancementioning
confidence: 99%
“…Using CUDA as the programming framework for SPH leads to possible confusion with the word 'kernel'. An SPH kernel is the weighting function used in the SPH interpolation process in Equations (5)(6)(7)(8)(9)(10). A CUDA kernel, however, is defined as a C function that is set up and executed N times in parallel by N different CUDA threads.…”
Section: Dualsphysics and Its Neighbour Listmentioning
confidence: 99%
“…Hérault et al [6] and Crespo et al [3] presented comparisons of GPU and CPU codes with significantly improved speedups for the GPU codes. Valdez-Balderas et al [7] and Dominguez et al [8] proposed the use of a multiple GPU system which will allow the simulation of hundreds of millions of particles, up to a billion particles. However, to the best of the authors' knowledge, there has been no study implementing a multi-phase SPH model on GPUs.…”
Section: Introductionmentioning
confidence: 99%
“…An algorithm was proposed to incorporate, into the TS scheme, a constitutive model based on continuum damage mechanics such that the scheme could be used to simulate brittle materials. The TS scheme is more computationally expensive than both the AV and PM schemes (the 2 nd O scheme is approximately 2.5× more expensive than the AV scheme), however advances in massively parallel GPU computing techniques may reduce this burden to the point where the TS scheme is practical [20,36,31,6].…”
Section: Applicationmentioning
confidence: 99%