Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2012
DOI: 10.1145/2339530.2339722
|View full text |Cite
|
Sign up to set email alerts
|

Streaming graph partitioning for large distributed graphs

Abstract: Extracting knowledge by performing computations on graphs is becoming increasingly challenging as graphs grow in size. A standard approach distributes the graph over a cluster of nodes, but performing computations on a distributed graph is expensive if large amount of data have to be moved. Without partitioning the graph, communication quickly becomes a limiting factor in scaling the system up. Existing graph partitioning heuristics incur high computation and communication cost on large graphs, sometimes as hi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
270
0
2

Year Published

2013
2013
2020
2020

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 328 publications
(280 citation statements)
references
References 26 publications
2
270
0
2
Order By: Relevance
“…If the splits have similar score, then the element is assigned to the split with the lowest index. While different penalty functions can be used, we go for a linear weighted function, as it has experimentally shown good performance results [21]. This forces our distribution to be uniform.…”
Section: A Greedy Model-partitioning Algorithm For Distributed Transfmentioning
confidence: 99%
See 2 more Smart Citations
“…If the splits have similar score, then the element is assigned to the split with the lowest index. While different penalty functions can be used, we go for a linear weighted function, as it has experimentally shown good performance results [21]. This forces our distribution to be uniform.…”
Section: A Greedy Model-partitioning Algorithm For Distributed Transfmentioning
confidence: 99%
“…Stanton and Kliot [21] compared a set of lightweight Streaming Graph Partitioning for Large Distributed Graphs and compare their performance to some well-known offline algorithms. They run their benchmark on large collections of datasets, and showed up to 76% of average gain.…”
Section: Data-drivenmentioning
confidence: 99%
See 1 more Smart Citation
“…First, ensuring good application performance, our ultimate goal, requires graph partitions that minimise communication and achieve load balancing. Minimising communication allows more parallelisation and improves system scalability, while load balancing can have a significant impact on overall processing time [21,35]. However, these are often conflicting requirements.…”
Section: Challengesmentioning
confidence: 99%
“…A bandwidth aware graph partitioning framework to minimize the network traffic in partitioning and processing is proposed in [19]. Finally, another recent work [20] shows that using simple partitioning heuristics can bring a significant performance improvement that surpasses the widely-used offline metis partitioner.…”
Section: Related Workmentioning
confidence: 99%