2010 International Conference on Microwave and Millimeter Wave Technology 2010
DOI: 10.1109/icmmt.2010.5524901
|View full text |Cite
|
Sign up to set email alerts
|

Overcoming the GPU memory limitation on FDTD through the use of overlapping subgrids

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2016
2016

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 5 publications
0
8
0
Order By: Relevance
“…In order to reduce the communication cost between CPU and GPU side, temporal blocking method, where each sub-domain is computed for multiple time steps at once without frequent communication, has been introduced [5], [6], [7] as shown in Figure 4. We name TBS as the number of time steps computed at once on the GPU side.…”
Section: A Naive Methods and Temporal Blocking Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to reduce the communication cost between CPU and GPU side, temporal blocking method, where each sub-domain is computed for multiple time steps at once without frequent communication, has been introduced [5], [6], [7] as shown in Figure 4. We name TBS as the number of time steps computed at once on the GPU side.…”
Section: A Naive Methods and Temporal Blocking Methodsmentioning
confidence: 99%
“…So, it has to copy current sub-domain back to CPU side and copy the next sub-domain to GPU side to continue which cause frequent communication between CPU and GPU. There is temporal blocking method [5], [6], [7], [8] which can solve this frequent communication problem. It has been utilized to improve cache locality [9], [10], [11] and to reduce the number of inter process communication [12].…”
Section: Introductionmentioning
confidence: 99%
“…There are many recent works such as [12][13][14][15][16] that use GPUs to accelerate FDTD. GPU acceleration of 2-D and 3-D FDTD for electromagnetic field analysis is proposed in [12,13], respectively.…”
Section: The Fdtd Simulation Of Cylindrical Resonatormentioning
confidence: 99%
“…This vector wraps both modulus (speed) and phase (direction) in a single value reducing and reduce the rise of directional errors for small velocities. Since the vector is self-normalized, the angle between the measured velocity v e and the correct one v c is given by Equation (12). This error measurement is calculated for every pixel for which a velocity measurement was recovered.…”
Section: Multi-criteria Motivation For Tunning Mcgmmentioning
confidence: 99%
“…To solve this problem, research [12] has often proposed a data reuse alternative with the aim of minimizing the memory traffic between GPU and CPU. Another approach in the field of rendering meshes can be found in [13] a solution that uses more efficient algorithms in terms of memory consumption alongside other techniques based on simplification or information compression.…”
Section: Introductionmentioning
confidence: 99%