2011
DOI: 10.1631/jzus.c1100051
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating geospatial analysis on GPUs using CUDA

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 34 publications
(27 citation statements)
references
References 14 publications
0
26
0
Order By: Relevance
“…Each thread in a GPU block, represents a single LoS calculation; therefore 16,000 GPU threads for example would represent 16,000 individual parallel LoS calculations, disregarding certain GPU bandwidth limits. This volume of threads fits well with the GPGPU paradigm, which argues for a very high number of independent threaded operations being executed simultaneously over a sustained [10,14]. There has been a concerted effort to discover the potential performance benefits of using the GPU as a viewshed processor [15,12,3], which aims to either modify existing CPU algorithms, or design new algorithms specifically for CUDA hardware; [6] presents a novel algorithm for 'combing' the DEM via thread directions.…”
Section: Introductionmentioning
confidence: 79%
See 2 more Smart Citations
“…Each thread in a GPU block, represents a single LoS calculation; therefore 16,000 GPU threads for example would represent 16,000 individual parallel LoS calculations, disregarding certain GPU bandwidth limits. This volume of threads fits well with the GPGPU paradigm, which argues for a very high number of independent threaded operations being executed simultaneously over a sustained [10,14]. There has been a concerted effort to discover the potential performance benefits of using the GPU as a viewshed processor [15,12,3], which aims to either modify existing CPU algorithms, or design new algorithms specifically for CUDA hardware; [6] presents a novel algorithm for 'combing' the DEM via thread directions.…”
Section: Introductionmentioning
confidence: 79%
“…It can be then stated that XDraw-O is the most efficient algorithm, across the test cases for generating viewsheds in the general case. Further analysis will be required to determine if this performance difference is maintained when memory optimization techniques are applied to the LoS algorithms, such as those proposed by Zhao, Padmanabhan and Wang [16] for the DDA algorithm or the work of Xia, Yang and Xingmin [14] for creating more optimized ray traversals across a DEM. It can be assumed that the number of GPU cores would significantly affect the performance of the algorithm.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…A basic octree in the three-dimensional space is an 8-way branching tree, wherein at each level a cubic domain is decomposed into eight equal-size cubes. By traversing all the leaf nodes of the octree, the generated sub-domains are represented by a specific data structure which is linked and stored in a single directional list [33]. Big traffic data in the computational domain can be decomposed into data pieces in sub-domains by the octree structure.…”
Section: Octree-based Computational Domain Decompositionmentioning
confidence: 99%
“…Beutel et al (2010) realized a GPU-based natural neighbour interpolation algorithm to generate DEM and accomplished a 10 times speedup ratio. Xia et al (2011) implemented IDW algorithm on a GPU and made a range of speedups from 12 to 33 depending on the scale and resolution of dataset. Hence, fully harvesting the computational power of a GPU has the potential to shorten the computing time of HASM.…”
Section: Introductionmentioning
confidence: 99%