Unsteady Turbulent Simulations on a Cluster of Graphics Processors

Phillips, Everett; Davis, Roger L.; Owens, John D.

doi:10.2514/6.2010-5036

Cited by 14 publications

(6 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Phillips et al [86] developed one of the first GPU solvers capable of simulating turbulence using the k-ω model, extending on the group's previous work porting portions of the existing MBFLO solver to the GPU [85]. In addition, their new solver was capable of running on a cluster of multiple CPU/GPU nodes, using a domain decomposition technique to give each node responsibility for a block of the overall domain.…”

Section: Turbulent Flowmentioning

confidence: 99%

“…The CPU only drove the simulation and passed information between the blocks of the domain, using MPI to transfer information between independent cluster nodes. Phillips et al [86] also improved performance by implementing a novel asynchronous memory transfer using CUDA streams; in their previous work, the GPU remained idle while the CPU transferred memory between different blocks (i.e., subdomains). Here, each block was further divided in half such that the GPU could continue to perform calculations on one half while the CPU transferred memory associated with the other half of the block; this improved performance up to 40%.…”

Section: Turbulent Flowmentioning

confidence: 99%

“…Here, each block was further divided in half such that the GPU could continue to perform calculations on one half while the CPU transferred memory associated with the other half of the block; this improved performance up to 40%. Phillips et al [86] tested their code using a simulation of unsteady turbulent flow over a cylinder, finding that a cluster of eight GPUs performed about nine times faster than an equivalent parallel code running on eight quad-core CPUs.…”

Section: Turbulent Flowmentioning

confidence: 99%

“…DeLeon et al [29] parallelized their solver to run on a cluster of multiple GPUs using MPI, such that the overall code contained two levels of parallelism. The overhead of communication between blocks was minimized by using the same asynchronous memory transfer as Phillips et al [86]. They did not compare the performance of their LES GPU solver to an equivalent CPU version, but simulations of turbulent channel flow with approximately 9.4 million grid cells took 45 hours to complete running on a cluster with eight total GPUs.…”

Section: Turbulent Flowmentioning

confidence: 99%

See 3 more Smart Citations

Recent progress and challenges in exploiting graphics processors in computational fluid dynamics

Niemeyer

Sung

2013

J Supercomput

View full text Add to dashboard Cite

The progress made in accelerating simulations of fluid flow using GPUs, and the challenges that remain, are surveyed. The review first provides an introduction to GPU computing and programming, and discusses various considerations for improved performance. Case studies comparing the performance of CPU-and GPU-based solvers for the Laplace and incompressible Navier-Stokes equations are performed in order to demonstrate the potential improvement even with simple codes. Recent efforts to accelerate CFD simulations using GPUs are reviewed for laminar, turbulent, and reactive flow solvers. Also, GPU implementations of the lattice Boltzmann method are reviewed. Finally, recommendations for implementing CFD codes on GPUs are given and remaining challenges are discussed, such as the need to develop new strategies and redesign algorithms to enable GPU acceleration.

show abstract

Section: Turbulent Flowmentioning

confidence: 99%

Section: Turbulent Flowmentioning

confidence: 99%

Section: Turbulent Flowmentioning

confidence: 99%

Section: Turbulent Flowmentioning

confidence: 99%

See 2 more Smart Citations

Recent progress and challenges in exploiting graphics processors in computational fluid dynamics

Niemeyer

Sung

2013

J Supercomput

View full text Add to dashboard Cite

show abstract

“…Manavski et al [43] used GPUs as an accelerator for Smith-Waterman sequence alignment. Phillips et al [44] implemented a multi-block turbulent flow solver in GPU processors.…”

Section: Implementation Of Peridynamics In Gpumentioning

confidence: 99%

Discretized peridynamics for brittle and ductile solids

Liu

Hong

2011

Numerical Meth Engineering

View full text Add to dashboard Cite

Peridynamics is a theory of continuum mechanics expressed in forms of integral equations rather than partial differential equations. In this paper, a peridynamics code is implemented using a graphics processing unit for highly parallel computation, and numerical studies are conducted to investigate the responses of brittle and ductile material models. Stress-strain behavior with different grid sizes and horizons is studied for a brittle material model. A comparison of stresses and strains between finite element analysis (FEA) and peridynamic solutions is performed for a ductile material. By applying the proposed procedure to bridge the material model defined for peridynamic bonds and the corresponding macroscale material model for FEA, peridynamics and FEA show good agreements as regards the stresses and strains.

show abstract

Comparison of Flow Parameters in Internal Combustion Engines

Anetor

2012

Arab J Sci Eng

View full text Add to dashboard Cite

Unsteady Turbulent Simulations on a Cluster of Graphics Processors

Cited by 14 publications

References 19 publications

Recent progress and challenges in exploiting graphics processors in computational fluid dynamics

Recent progress and challenges in exploiting graphics processors in computational fluid dynamics

Discretized peridynamics for brittle and ductile solids

Comparison of Flow Parameters in Internal Combustion Engines

Contact Info

Product

Resources

About