“…We conduct a large-scale parameter search to find the optimal parameters for each combination of stencil pattern and GPU. Here, around 10,000 and 5,000 parameter configurations are explored for each 2D (b T = [2,20], b S = [1, 32] × [32,2048], n thr = [1, 32] × [32,1024]) and 3D stencil ( [2,12], [1,4] × [1,32] × [32,256], [1,4] × [1,32] × [32,256]), respectively. We set 8,192 2 and 512 3 as 2D/3D grid size and 120 as iteration count for parameter search.…”