Accelerating geospatial analysis on GPUs using CUDA

Xia, Yingjie; Kuang, Li; Li, Xiumei

doi:10.1631/jzus.c1100051

Cited by 34 publications

(27 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Each thread in a GPU block, represents a single LoS calculation; therefore 16,000 GPU threads for example would represent 16,000 individual parallel LoS calculations, disregarding certain GPU bandwidth limits. This volume of threads fits well with the GPGPU paradigm, which argues for a very high number of independent threaded operations being executed simultaneously over a sustained [10,14]. There has been a concerted effort to discover the potential performance benefits of using the GPU as a viewshed processor [15,12,3], which aims to either modify existing CPU algorithms, or design new algorithms specifically for CUDA hardware; [6] presents a novel algorithm for 'combing' the DEM via thread directions.…”

Section: Introductionmentioning

confidence: 79%

“…It can be then stated that XDraw-O is the most efficient algorithm, across the test cases for generating viewsheds in the general case. Further analysis will be required to determine if this performance difference is maintained when memory optimization techniques are applied to the LoS algorithms, such as those proposed by Zhao, Padmanabhan and Wang [16] for the DDA algorithm or the work of Xia, Yang and Xingmin [14] for creating more optimized ray traversals across a DEM. It can be assumed that the number of GPU cores would significantly affect the performance of the algorithm.…”

Section: Resultsmentioning

confidence: 99%

“…The performance results detailed correlate well with the performance increases of many previous GPU viewshed studies. Xia, Yang and Xingmin [14] and Zhao, Padmanabhan and Wang [16] demonstrated that significant performance improvements can be made with careful consideration to data operations. Narasiman et al [9] also demonstrated that spatial independency is a key factor when generating viewsheds on a GPU.…”

Section: Resultsmentioning

confidence: 99%

See 2 more Smart Citations

GPU enabled XDraw viewshed analysis

Cauchi-Saunders

Lewis

2015

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 79%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

GPU enabled XDraw viewshed analysis

Cauchi-Saunders

Lewis

2015

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

“…A basic octree in the three-dimensional space is an 8-way branching tree, wherein at each level a cubic domain is decomposed into eight equal-size cubes. By traversing all the leaf nodes of the octree, the generated sub-domains are represented by a specific data structure which is linked and stored in a single directional list [33]. Big traffic data in the computational domain can be decomposed into data pieces in sub-domains by the octree structure.…”

Section: Octree-based Computational Domain Decompositionmentioning

confidence: 99%

Formalizing computational intensity of big traffic data understanding and analysis for parallel computing

2015

View full text Add to dashboard Cite

“…Beutel et al (2010) realized a GPU-based natural neighbour interpolation algorithm to generate DEM and accomplished a 10 times speedup ratio. Xia et al (2011) implemented IDW algorithm on a GPU and made a range of speedups from 12 to 33 depending on the scale and resolution of dataset. Hence, fully harvesting the computational power of a GPU has the potential to shorten the computing time of HASM.…”

Section: Introductionmentioning

confidence: 99%

Speeding up the high-accuracy surface modelling method with GPU

et al. 2015

View full text Add to dashboard Cite

In order to find a solution for accurate, topographic data-demanding applications, such as catchment hydrologic modeling and assessments of anthropic activities impact on environmental systems, high-accuracy surface modeling (HASM) method is developed. Although it can produce a digital elevation model (DEM) surface of higher accuracy than classical methods, e.g. inverse distance weighted, spline and kriging, HASM requires numerous iterations to solve large linear systems, which impede its applications in high-resolution and large-scale surface interpolation. This paper aims to demonstrate the utilization of graphics' processing units (GPUs) device to accelerate HASM in constructing large-scale and highresolution DEM surfaces. We parallelized the linear system algorithm for solving HASM with Compute Unified Device Architecture, a parallel programming model developed by NVIDIA. We designed a memory-saving strategy to enable the HASM algorithm to run on GPUs. The speedup ratio of GPU-based algorithm was tested and compared with CPUbased algorithm through simulations of both ideal Gaussian synthetic surface and real topographic surface in the loess plateau of Gansu province. The GPU-parallelized algorithm can attain an over 109 speedup ratio with the CPUbased algorithm as a reference. The speedup ratio increases with the scale and resolution of the dataset. The memory management strategy efficiently reduces the memory usage by more than eight times the grid cell number. Implementing HASM in the GPUs device enables modeling large-scale and high-resolution surfaces in a reasonable time period and implies the potential benefits from the use of GPUs as massive, parallel co-processors for arithmeticintensive data-processing applications.

show abstract

Accelerating geospatial analysis on GPUs using CUDA

Cited by 34 publications

References 14 publications

GPU enabled XDraw viewshed analysis

GPU enabled XDraw viewshed analysis

Formalizing computational intensity of big traffic data understanding and analysis for parallel computing

Speeding up the high-accuracy surface modelling method with GPU

Contact Info

Product

Resources

About