2017 IEEE 33rd International Conference on Data Engineering (ICDE) 2017
DOI: 10.1109/icde.2017.153
|View full text |Cite
|
Sign up to set email alerts
|

Spinner: Scalable Graph Partitioning in the Cloud

Abstract: Several organizations, like social networks, store and routinely analyze large graphs as part of their daily operation. Such graphs are typically distributed across multiple servers, and graph partitioning is critical for efficient graph management. Existing partitioning algorithms focus on finding graph partitions with good locality, but disregard the pragmatic challenges of integrating partitioning into large-scale graph management systems deployed on the cloud, such as dealing with the scale and dynamicity … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
75
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 70 publications
(76 citation statements)
references
References 29 publications
(24 reference statements)
1
75
0
Order By: Relevance
“…Our initial experiments (see Figure 1) and earlier works [18,29,33] indicate that two important dimensions for the performance of Giraph jobs are the number of vertices and the number of edges. For this reason, we specify two weights for the vertices, w…”
Section: Multi-dimensional Partitioningmentioning
confidence: 87%
See 1 more Smart Citation
“…Our initial experiments (see Figure 1) and earlier works [18,29,33] indicate that two important dimensions for the performance of Giraph jobs are the number of vertices and the number of edges. For this reason, we specify two weights for the vertices, w…”
Section: Multi-dimensional Partitioningmentioning
confidence: 87%
“…While one-dimensional balanced graph partitioning has been studied extensively and a number of tools exist [7,13,14,22,23,33,41,42] (see also surveys by Bichot and Siarry [9] and by Buluç et al [12]), to the best of our knowledge none of the practical algorithms for this problem have been previously based on running gradient descent on a continuous relaxation. Existing approaches are inherently discrete and are based on combinations of various discrete algorithms: greedy heuristics (METIS [23], Fennel [41]), branch-and-bound [13], label propagation and local search (balanced label propagation [42], Social Hash Partitioner [22], Spinner [33]), as well as hybrid approaches (linear embedding method combined with various optimizations [7]). Due to the combinatorial nature of these algorithms, their generalizations to the multi-dimensional case appear to be non-straightforward without substantial losses in performance, while our continuous relaxation handles multiple balance constraints uniformly.…”
Section: Previous Workmentioning
confidence: 99%
“…Multilevel graph partitioners such as METIS [29] and ParMETIS [30] do not scale very well. Spinner [31] is a highly scalable graph partitioner that, like SHP, performs iterative random permutations and greedy selection of the best permutation. There is a large number of streaming graph partitioning algorithms, such as HDRF [17], H-load [32], and ADWISE [33].…”
Section: Related Workmentioning
confidence: 99%
“…Several query-agnostic partitioning algorithms optimizing the number of cut vertices or edges have been proposed [7,15,23,25,30,34,36,37,42]. As shown in Section 4, even best-case edgecut partitioning algorithms lead to suboptimal locality, workload balancing, and query latency.…”
Section: Related Workmentioning
confidence: 99%