2008
DOI: 10.1504/ijcse.2008.021111
|View full text |Cite
|
Sign up to set email alerts
|

Using GPUs to improve multigrid solver performance on a cluster

Abstract: This article explores the coupling of coarse and fine-grained parallelism for Finite Element simulations based on efficient parallel multigrid solvers. The focus lies on both system performance and a minimally invasive integration of hardware acceleration into an existing software package, requiring no changes to application code. Because of their excellent price performance ratio, we demonstrate the viability of our approach by using commodity graphics processors (GPUs) as efficient multigrid preconditioners.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
61
0
1

Year Published

2010
2010
2013
2013

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 76 publications
(62 citation statements)
references
References 57 publications
0
61
0
1
Order By: Relevance
“…Section 2.4), we are not able to perform any meaningful tests for strong scalability as well. Feast has been shown to scale well [2], and we have previously performed limited strong scalability tests on few, but more powerful GPUs [12]. Thus, we can assume that the strong scalability holds true on more GPU nodes as well.…”
Section: Weak Scalability and Power Considerationsmentioning
confidence: 92%
See 4 more Smart Citations
“…Section 2.4), we are not able to perform any meaningful tests for strong scalability as well. Feast has been shown to scale well [2], and we have previously performed limited strong scalability tests on few, but more powerful GPUs [12]. Thus, we can assume that the strong scalability holds true on more GPU nodes as well.…”
Section: Weak Scalability and Power Considerationsmentioning
confidence: 92%
“…Here, we only very briefly summarize the ideas and focus on the changes to the Feast kernel while treating the GPUbased multigrid solver as a black box. For a more detailed description of the implementation and a discussion of various tradeoffs, we refer to previous work [12].…”
Section: Integration Of Hardware Acceleratorsmentioning
confidence: 99%
See 3 more Smart Citations