2014
DOI: 10.1137/130945569
|View full text |Cite
|
Sign up to set email alerts
|

A Parallel Directional Fast Multipole Method

Abstract: Abstract. This paper introduces a parallel directional fast multipole method (FMM) for solving N -body problems with highly oscillatory kernels, with a focus on the Helmholtz kernel in three dimensions. This class of oscillatory kernels requires a more restrictive low-rank criterion than that of the low-frequency regime, and thus effective parallelizations must adapt to the modified data dependencies. We propose a simple partition at a fixed level of the octree and show that, if the partitions are properly bal… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(11 citation statements)
references
References 44 publications
1
10
0
Order By: Relevance
“…Noting that, for any surface Γ N , there are only sixty-four boxes overall on level d = 3 of the algorithm (and, in general, even fewer relevant boxes), we see that a definite limit exists on the parallelism achievable by this approach. The method presented in [14] uses this strategy in an MPI context, and it is therefore subject to such a hard limitation on achievable parallelism (although in a somewhat mitigated form, owing to the characteristics of that algorithm, as discussed in Section 1). To avoid such limitations we consider an alternate OpenMP parallelization strategy specifically enabled by the characteristics of the IFGF algorithm, as described in what follows.…”
Section: Openmp Parallelizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Noting that, for any surface Γ N , there are only sixty-four boxes overall on level d = 3 of the algorithm (and, in general, even fewer relevant boxes), we see that a definite limit exists on the parallelism achievable by this approach. The method presented in [14] uses this strategy in an MPI context, and it is therefore subject to such a hard limitation on achievable parallelism (although in a somewhat mitigated form, owing to the characteristics of that algorithm, as discussed in Section 1). To avoid such limitations we consider an alternate OpenMP parallelization strategy specifically enabled by the characteristics of the IFGF algorithm, as described in what follows.…”
Section: Openmp Parallelizationmentioning
confidence: 99%
“…FMM-type algorithms. Indeed, in contrast to the incremental propagation and surface evaluation approach inherent in the IFGF method, previous acceleration methods rely on the FFT algorithmwhich, as discussed in Section 1, leads to inefficiencies in the upper portions of the corresponding octree structures [2,14].…”
Section: Openmp Parallelizationmentioning
confidence: 99%
“…The memory savings that the fast algorithms provide stem from the fact that the far-field part of the interaction matrix is replaced with the operators in (6). These are the same for groups that have the same position relative to each other.…”
Section: Elsementioning
confidence: 99%
“…The resulting performance is typically a bit better for volume formulations then for boundary formulations, since the computational density is higher in the former case. A particular issue for the MLFMA formulation of electromagnetic scattering problems is that the work per element (group) in the tree data structure increases with the level, and additional partitioning strategies are needed for the coarser part of the structure [6,30,56].…”
Section: State Of the Artmentioning
confidence: 99%
“…We emphases that these algebraic representations are not only valid for linear elliptic operators, but also valid for operators associated with low-to-medium frequency Helmholtz equations and radial basis function kernel matrices. When operators admit high-frequency property, the low-rank structure appears in a very different way comparing to that in all aforementioned fast algorithms, and are also well-studied by the community [8,9,13,14,[31][32][33][34]38].…”
Section: Introductionmentioning
confidence: 99%