2013
DOI: 10.2528/pier13030606
|View full text |Cite
|
Sign up to set email alerts
|

B-Calm: An Open-Source Multi-Gpu-Based 3d-FDTD With Multi-Pole Dispersion for Plasmonics

Abstract: Abstract-Numerical calculations based on finite-difference timedomain (FDTD) simulations for metallic nanostructures in a broad optical spectrum require an accurate modeling of the permittivity of dispersive materials. In this paper, we present the algorithms behind B-CALM (Belgium-CAlifornia Light Machine), an open-source 3D-FDTD solver simultaneously operating on multiple Graphical Processing Units (GPUs) and efficiently utilizing multi-pole dispersion models while hiding latency in inter-GPU memory transfer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0
1

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 16 publications
0
11
0
1
Order By: Relevance
“…In the first part of the research an algorithm to solve the 3D FDTD problem with GPU using CUDA platform was proposed and analyzed. GPU was used widely before in this field and some recent researches (Wahl, Ly Gagnon, Debaes, Van Erps, Vermeulen, Miller, Thienpont 2013) show that this type of calculations can be automatically parallelized on several GPUs using special software packages. Other alternative could be the usage of implementation simplifying tools such as (Hoshino, Maruyama, Matsuoka, Takaki 2013), which in general has approximately 50% lower performance than CUDA, however, for some applications it can reach up to 98% of CUDA efficiency.…”
Section: Scientific Novelty Of the Thesismentioning
confidence: 99%
“…In the first part of the research an algorithm to solve the 3D FDTD problem with GPU using CUDA platform was proposed and analyzed. GPU was used widely before in this field and some recent researches (Wahl, Ly Gagnon, Debaes, Van Erps, Vermeulen, Miller, Thienpont 2013) show that this type of calculations can be automatically parallelized on several GPUs using special software packages. Other alternative could be the usage of implementation simplifying tools such as (Hoshino, Maruyama, Matsuoka, Takaki 2013), which in general has approximately 50% lower performance than CUDA, however, for some applications it can reach up to 98% of CUDA efficiency.…”
Section: Scientific Novelty Of the Thesismentioning
confidence: 99%
“…Different types of GPUs were used to compare their real computability. Researchers from Belgium used CUDA on up to four GPUs for a 3-D FDTD application with multi-pole dispersion for plasma [7,8], and found an almost linear speedup with respect to the number of GPUs.…”
Section: Introductionmentioning
confidence: 99%
“…There are two major approaches to programming multi-GPU based parallel FD-FDTD: open computing language (OpenCL) [3,4] and compute unified device architecture (CUDA) [5][6][7][8]. OpenCL is a framework and programming language that executes across heterogeneous platforms consisting of CPUs, GPUs, or other processors.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, the discrete Green's function (DGF) [1][2][3][4] has been proven to be an efficient tool facilitating the finite-difference time-domain (FDTD) method [5][6][7][8][9][10][11]. DGF is a response of the FDTD grid to the Kronecker delta current source.…”
Section: Introductionmentioning
confidence: 99%