2013
DOI: 10.1145/2400682.2400718
|View full text |Cite
|
Sign up to set email alerts
|

Partans

Abstract: GPGPUs are a powerful and energy-efficient solution for many problems. For higher performance or larger problems, it is necessary to distribute the problem across multiple GPUs, increasing the already high programming complexity.In this article, we focus on abstracting the complexity of multi-GPU programming for stencil computation. We show that the best strategy depends not only on the stencil operator, problem size, and GPU, but also on the PCI express layout. This adds nonuniform characteristics to a seemin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 50 publications
(1 citation statement)
references
References 15 publications
0
0
0
Order By: Relevance
“…In any other case, for all neighbor cells, the current amount of particles is read (l. 10-12). In the collision step, all cells which are obstacles reverse the airflow (l. [16][17][18][19][20][21][22]. All other cells calculate the particles streaming from the next cells (l. 27-34).…”
Section: Example Applications For Three-dimensional Stencil Operationsmentioning
confidence: 99%
“…In any other case, for all neighbor cells, the current amount of particles is read (l. 10-12). In the collision step, all cells which are obstacles reverse the airflow (l. [16][17][18][19][20][21][22]. All other cells calculate the particles streaming from the next cells (l. 27-34).…”
Section: Example Applications For Three-dimensional Stencil Operationsmentioning
confidence: 99%