Proceedings of the International Conference on Supercomputing 2011
DOI: 10.1145/1995896.1995937
|View full text |Cite
|
Sign up to set email alerts
|

Using GPUs to compute large out-of-card FFTs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
19
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(20 citation statements)
references
References 12 publications
1
19
0
Order By: Relevance
“…As stated in the introduction, and as illustrated in the above figure, our performance is substantially better than those reported in the most recent related works of [1] and [11].…”
Section: A the Case Of Three Periodic Boundary Conditionssupporting
confidence: 38%
See 1 more Smart Citation
“…As stated in the introduction, and as illustrated in the above figure, our performance is substantially better than those reported in the most recent related works of [1] and [11].…”
Section: A the Case Of Three Periodic Boundary Conditionssupporting
confidence: 38%
“…They reported a performance of around 50 GFLOPS on four nodes, somewhat lower than our performance on a single node with a Tesla C1060 (in fact, our performance number is an underestimate since it does not include all the components of our Poisson solver). Another recent work is reported by Gu et al [11], which tries to optimize both CPU-GPU data transfer and GPU computations for 1D, 2D, and 3D FFTs. In particular, they develop a blocked buffered technique for 1D FFTs which achieves a high bandwidth on the CPU-GPU data channel.…”
Section: Introductionmentioning
confidence: 99%
“…To demonstrate the improvement of our PCI bandwidth, we used the same subarray test as Gu's work [6], where there are C regular subarrays of length W each.…”
Section: Pci Bandwidth Evaluationmentioning
confidence: 99%
“…Gu et al [12] propose a number of techniques for performing large FFTs when the data is maintained out of a single devices memory space.…”
Section: Related Workmentioning
confidence: 99%