Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2015
DOI: 10.1145/2807591.2807632
|View full text |Cite
|
Sign up to set email alerts
|

Data partitioning strategies for graph workloads on heterogeneous clusters

Abstract: Large scale graph analytics are an important class of problem in the modern data center. However, while data centers are trending towards a large number of heterogeneous processing nodes, graph analytics frameworks still operate under the assumption of uniform compute resources. In this paper, we develop heterogeneity-aware data ingress strategies for graph analytics workloads using the popular Power-Graph framework. We illustrate how simple estimates of relative node computational throughput can guide heterog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 18 publications
0
14
0
Order By: Relevance
“…These systems use graph partitioning to scale out computations on graphs or sparse matrices that do not fit in the memory of a single node. In the graph analytics literature, partitioning strategies are classified into Edge-Cuts [4,[22][23][24]41,42,50,51] and Vertex-Cuts [8,12,18,26,29,37,39,44]. In the matrix literature, they are classified into 1D and 2D partitionings [7,11].…”
Section: Related Workmentioning
confidence: 99%
“…These systems use graph partitioning to scale out computations on graphs or sparse matrices that do not fit in the memory of a single node. In the graph analytics literature, partitioning strategies are classified into Edge-Cuts [4,[22][23][24]41,42,50,51] and Vertex-Cuts [8,12,18,26,29,37,39,44]. In the matrix literature, they are classified into 1D and 2D partitionings [7,11].…”
Section: Related Workmentioning
confidence: 99%
“…Such sketches represent data structures that approximate properties of a data stream. LeBeane et al [97] proposed on-line graphpartitioning multiple strategies to optimise data-ingress across heterogeneous clusters. SWORD [98] handles the partitioning and placement for OLTP workloads.…”
Section: Graph-based Data Placementmentioning
confidence: 99%
“…With a new composite graph, we further leverage a hybrid-cut algorithm [28], which attempts to perform an either vertex-or edge-cut method for the balance of workload and communication. We first give priority in the use of a vertex-cut partition (with a greedy heuristic algorithm [15]) to minimize the number of vertex mirrors for reducing the total storage as well as communication requirements (of symbol assignment).…”
Section: Activation-induced Graph Partitionmentioning
confidence: 99%
“…The vertex-cut partition [9,15,31] considers evenly placing all edges of graph for mitigating imbalanced communication. LeBeane et al [28] developed severalf graph partitioning strategies to investigate the heterogeneity of graph processing. Further, Song et al [44] presented a profiling-based approach to fully exploit the computational capability of each compute node for guiding better graph partitioning with minimal overhead.…”
Section: Related Workmentioning
confidence: 99%