2012 IEEE 14th International Conference on High Performance Computing and Communication &Amp; 2012 IEEE 9th International Confe 2012
DOI: 10.1109/hpcc.2012.88
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Tuning of the Fast Multipole Method Based on Integrated Performance Prediction

Abstract: The Fast Multipole Method (FMM) is an efficient, widely used method for the solution of N-body problems. One of the main data structures is a hierarchical tree data structure describing the separation into near-field and farfield particle interactions. This article presents a method for automatic tuning of the FMM by selecting the optimal FMM tree depth based on an integrated performance prediction of the FMM computations. The prediction method exploits benchmarking of significant parts of the FMM implementati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…In [9] a parallel sorting for the particles in the particle systems is presented which improves the locality of interacting particles for computation on a distributed memory architecture. A more application specific optimization has been presented in [10], which introduces a method for automatic tuning of the FMM by selecting the optimal FMM tree depth based on an integrated performance prediction of the FMM computations.…”
Section: Summary Of the Performance Resultsmentioning
confidence: 99%
“…In [9] a parallel sorting for the particles in the particle systems is presented which improves the locality of interacting particles for computation on a distributed memory architecture. A more application specific optimization has been presented in [10], which introduces a method for automatic tuning of the FMM by selecting the optimal FMM tree depth based on an integrated performance prediction of the FMM computations.…”
Section: Summary Of the Performance Resultsmentioning
confidence: 99%
“…Especially for irregular applications, it is necessary to consider the input data because for these applications, the execution path often is heavily dependent on the data and, furthermore, the data layout in the memory cannot be predicted easily. Examples for such applications are as follows: sparse linear algebra, where the layout of the matrices may vary considerably and possibly a suitable storage format has to be selected ; particle simulations, where specific measures have to be taken to ensure an efficient execution with different particle distributions and particle distributions changing over the time ; stencil computations, especially on fractal structures which may grow unpredictably ; and neural simulations, where the behaviour of each neuron and its connections to other neurons may change over the simulation time . …”
Section: Autotuning and Model‐based Autotuningmentioning
confidence: 99%
“…Applications investigated less frequently are particle simulations , stencil computations and the visualisation of molecular dynamic processes . The particle simulation is a fast multipole method, which uses different methods for calculating interactions between particles of small and of long distance. The depth of the fast multipole method tree determines the numbers of both kinds of interactions.…”
Section: Existing Work Applying Model‐based Autotuningmentioning
confidence: 99%
See 1 more Smart Citation