2010 IEEE International Symposium on Parallel &Amp; Distributed Processing (IPDPS) 2010
DOI: 10.1109/ipdps.2010.5470417
|View full text |Cite
|
Sign up to set email alerts
|

An introductory exascale feasibility study for FFTs and multigrid

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(21 citation statements)
references
References 6 publications
0
21
0
Order By: Relevance
“…For example, we only consider the pencil decomposition of the transpose method for computing a 3D FFT. Other distributed FFT algorithms exist, but for realistic problem sizes on current and future large-scale systems the pencil decomposition is the best option [17,22]. We also assume ideal problem sizes: n is a power of two and all dimensions are equally sized.…”
Section: Limitationsmentioning
confidence: 99%
See 2 more Smart Citations
“…For example, we only consider the pencil decomposition of the transpose method for computing a 3D FFT. Other distributed FFT algorithms exist, but for realistic problem sizes on current and future large-scale systems the pencil decomposition is the best option [17,22]. We also assume ideal problem sizes: n is a power of two and all dimensions are equally sized.…”
Section: Limitationsmentioning
confidence: 99%
“…The second study is Gahvari's and Gropp's theoretical analysis of feasible latency and bandwidth regimes at exascale, using LogGP modeling and pencil/transpose-based FFTs as one benchmark [9,22]. Their model is more general than ours in that it is agnostic about specific architectural forms at exascale; however, ours may be more prescriptive about the necessary changes by explicitly modeling particular architectural features.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…PME calculation, due to the data transposes required in three dimensional Fourier transforms, is highly communication intensive [22], and therefore very challenging to scale. While NAMD supports both slab (one dimensional decomposition) and pencil (two dimensional decomposition) PME, this paper addresses only the pencil form due to its superior scaling characteristics [6].…”
Section: Namdmentioning
confidence: 99%
“…Constructing the grid and extracting the result from it is shown at left and the 3-D FFT forward and backward at right. Pencil based distributed parallel implementations of 3-D FFT have communication requirements that are well studied in the literature [6], so we present a minimal summary of the critical issues for completeness. Furthermore, the communication process from reciprocal space to real space is the reverse of the real to reciprocal process, therefore only the forward path will be considered in detail.…”
Section: Namdmentioning
confidence: 99%