2013
DOI: 10.1007/978-3-642-40047-6_82
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs

Abstract: Nanosimulations present a big HPC challenge as they present increasing performance demands in heterogeneous execution environments. In this paper, we present our optimization methodology for BigDFT, a nanosimulation software using Density Functional Theory. We explore autotuning possibilities for BigDFT's 3D convolutions by studying optimization techniques for several architectures. Namely, we focus on processors with vector units and on GPU acceleration. We report on the portability and the performance gains … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…GPU acceleration has been achieved for wave function based DFT methods, e.g. plane wave methods, wavelets, grid and local orbitals [11,20,22,24]. Closer to this work are projects in the class of linear-scaling methods like SIESTA or CP2K [6,21].…”
Section: Related Workmentioning
confidence: 98%
“…GPU acceleration has been achieved for wave function based DFT methods, e.g. plane wave methods, wavelets, grid and local orbitals [11,20,22,24]. Closer to this work are projects in the class of linear-scaling methods like SIESTA or CP2K [6,21].…”
Section: Related Workmentioning
confidence: 98%
“…The CPU convolutions of BigDFT have thus been thoroughly optimized. In a recent paper (Videau et al, 2013), the optimization of the CPU convolutions of BigDFT has been extensively considered. One example of a specific convolution, called MagicFilter (Genovese et al, 2010), can be seen in Listing 16.…”
Section: Creating An Autotuned Convolution Library For Bigdft Using Bmentioning
confidence: 99%
“…Several works have studied data layout restructuring for specific applications [21,12] and for stencils [9]. In a recent work [1] the authors proposed a framework to analyze binary codes, and to formulate user-targeted hints about SIMDization potentials and hindrances.…”
Section: Introductionmentioning
confidence: 99%