2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2017
DOI: 10.1109/ipdps.2017.117
|View full text |Cite
|
Sign up to set email alerts
|

Multi-GPU Graph Analytics

Abstract: We present a single-node, multi-GPU programmable graph processing library that allows programmers to easily extend single-GPU graph algorithms to achieve scalable performance on large graphs with billions of edges. Directly using the single-GPU implementations, our design only requires programmers to specify a few algorithm-dependent concerns, hiding most multi-GPU related implementation details. We analyze the theoretical and practical limits to scalability in the context of varying graph primitives and datas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
39
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 47 publications
(39 citation statements)
references
References 28 publications
0
39
0
Order By: Relevance
“…Another limitation of existing systems is that they are integrated solutions that come with their own programming models, runtime systems, and communication runtimes, which makes it difficult to reuse infrastructure to build new systems. For example, all existing GPU graph analytics systems such as Gunrock [56,69], Groute [8], and IrGL [55] are limited to a single node, and there is no way to reuse infrastructure from existing distributed graph analytics systems to build GPU-based distributed graph analytics systems from these single-node systems.…”
Section: Introductionmentioning
confidence: 99%
“…Another limitation of existing systems is that they are integrated solutions that come with their own programming models, runtime systems, and communication runtimes, which makes it difficult to reuse infrastructure to build new systems. For example, all existing GPU graph analytics systems such as Gunrock [56,69], Groute [8], and IrGL [55] are limited to a single node, and there is no way to reuse infrastructure from existing distributed graph analytics systems to build GPU-based distributed graph analytics systems from these single-node systems.…”
Section: Introductionmentioning
confidence: 99%
“…We compare our results with previous efforts in Table II. When compared against single-node multi-GPU Gunrock [5], this work is a little slower when using the same graphs, which may be the effect of more optimizations in Gunrock's traversal kernels. As we add more GPUs in this work, we see the gap in performance is narrowing, which indicates better scalability; and the memory size improvements we made in this paper allows us to process larger graphs on one node, up to scale 28 on 4 GPUs, than any other GPU-based previous work.…”
Section: Overall Results and Comparisonsmentioning
confidence: 99%
“…Using GPUs in the same node for BFS yields impressive per-node performance [5], [9], [11], [12], but because all [5] their communication is within a node and thus faster than within a cluster, their per-node performance is superior to cluster-based solutions. However, their graphs must fit into one node's memory (GPU or CPU), and this inherently limits the maximum size of a processed graph.…”
Section: Bfs Within Single Nodementioning
confidence: 99%
“…In this paper, we address this problem by exploiting graphics processing units (GPUs). Because of massive hardware parallelism and high memory bandwidth, GPUs have been widely used in diverse applications including machine learning [5][6][7], graph processing [8][9][10], big data analytics [11,12], image processing [13], and fluid dynamics [14]. In order to reap the power of GPUs, the algorithmic steps need to be mapped delicately onto the architecture of GPUs, especially the thread and memory hierarchy.…”
Section: Introductionmentioning
confidence: 99%