2013
DOI: 10.1016/j.cpc.2013.07.005
|View full text |Cite
|
Sign up to set email alerts
|

A performance comparison of different graphics processing units running direct N-body simulations

Abstract: Hybrid computational architectures based on the joint power of Central Processing Units and Graphic Processing Units (GPUs) are becoming popular and powerful hardware tools for a wide range of simulations in biology, chemistry, engineering, physics, etc.. In this paper we present a comparison of performance of various GPUs available on market when applied to the numerical integration of the classic, gravitational, N-body problem. To do this, we developed an OpenCL version of the parallel code (HiGPUs) used for… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…However, it also requires more operations than a fourth order integrator, something which is discussed in detail in [5]. Previous work [15,19,20] indicates that double-single accuracy is sufficient for a sixth order integrator. However, to give the user the choice we implemented both a double-single and a double precision version of this method.…”
Section: Sixth Order Performancementioning
confidence: 99%
See 1 more Smart Citation
“…However, it also requires more operations than a fourth order integrator, something which is discussed in detail in [5]. Previous work [15,19,20] indicates that double-single accuracy is sufficient for a sixth order integrator. However, to give the user the choice we implemented both a double-single and a double precision version of this method.…”
Section: Sixth Order Performancementioning
confidence: 99%
“…This code is able to run on multiple GPUs and supports up to eighth order accuracy. In [19,20], the authors introduce the HiGPUs N -body code. This standalone code contains a sixth order integrator, and supports CUDA, OpenCL and IEEE-754 double precision accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…Potentially, GPUs can achieve hundreds or even thousands of GFLOPS, whereas general CPUs are only capable of dozens of GFLOPS at present. NVIDIA's computed unified device architecture (CUDA) provides a C-like programming model for exploiting the massively parallel processing power of NVIDIA's GPU (NVIDIA 2013), and it is now employed widely for many parallel computation applications (Lu et al 2013, Capuzzo-Dolcetta and Spera 2013, Westphal et al 2014. Some studies have also used NVIDIA GPUs to accelerate PSTM.…”
Section: Introductionmentioning
confidence: 99%