2010
DOI: 10.1016/j.parco.2009.12.002
|View full text |Cite
|
Sign up to set email alerts
|

Solving path problems on the GPU

Abstract: We consider the computation of shortest paths on Graphic Processing Units (GPUs). The blocked recursive elimination strategy we use is applicable to a class of algorithms (such as all-pairs shortest-paths, transitive closure, and LU decomposition without pivoting) having similar data access patterns. Using the all-pairs shortest-paths problem as an example, we uncover potential gains over this class of algorithms. The impressive computational power and memory bandwidth of the GPU make it an attractive platform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
60
0
1

Year Published

2011
2011
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 88 publications
(61 citation statements)
references
References 28 publications
0
60
0
1
Order By: Relevance
“…[9]) has been the choice of several parallel implementation as the algorithm allows one to study cache blocking techniques. Examples of this approach can be seen in Buluc et al [5], Matsumoto et al [28] and Katz et al [23]. The above works report results on a variety of CPU and GPU architectures.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…[9]) has been the choice of several parallel implementation as the algorithm allows one to study cache blocking techniques. Examples of this approach can be seen in Buluc et al [5], Matsumoto et al [28] and Katz et al [23]. The above works report results on a variety of CPU and GPU architectures.…”
Section: Related Workmentioning
confidence: 99%
“…As graphs corresponding to real-world and practical applications have a massive size, parallel processing is often necessary. It is therefore natural that a lot of current research is directed towards efficient algorithmics on a variety of modern and emerging multi-and many-core architectures [4,28,5,34].…”
Section: Introductionmentioning
confidence: 99%
“…Actual algorithms based on this proof are given by various researchers, with minor differences. Our decision to use the DC algorithm as our starting point is inspired by its demonstrated better cache reuse on CPUs [33], and its impressive performance attained on the many-core graphical processor units [11].…”
Section: Previous Workmentioning
confidence: 99%
“…SSSP algorithms based on ∆-stepping [32] scale better in practice but their performance is input dependent and scales with O(m+d·L·log n), where d is the maximum vertex degree and L is the maximum shortest path weight from the source. Consequently, it is likely that a Floyd-Warshall based approach would be competitive even for sparse graphs, as realized on graphical processing units [11].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, recent GPU cards produced by NVIDIA Corporation provide substantial benefits for parallel computation, and the company itself supplies an easyto-implement environment for developers and researchers. Recently, the effectiveness and advantages of using GPUs for technical computations have been widely reported [6]- [8].…”
Section: Introductionmentioning
confidence: 99%