Proceedings of the ACM International Conference on Computing Frontiers 2016
DOI: 10.1145/2903150.2903180
|View full text |Cite
|
Sign up to set email alerts
|

Breadth first search vectorization on the Intel Xeon Phi

Abstract: Breadth First Search (BFS) is a building block for graph algorithms and has recently been used for large scale analysis of information in a variety of applications including social networks, graph databases and web searching. Due to its importance, a number of different parallel programming models and architectures have been exploited to optimize the BFS. However, due to the irregular memory access patterns and the unstructured nature of the large graphs, its efficient parallelization is a challenge. The Xeon … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 26 publications
0
9
0
Order By: Relevance
“…Reference Approach optimisation vectorisation 2012 Saule and Ç atalyürek [17] top-down no optimisation automatic 2013 Gao et al [19] top-down bitmaps intrinsics 2013 Stanic et al [ The key contribution of this work builds on the studies carried out by Gao et al in [19] and [5]. In the first study, they present the vectorisation of the top-down BFS algorithm arXiv:1704.02259v2 [cs.DC] 20 Apr 2017 using vector intrinsic functions, which was outperformed by [15], which clarified the impact of prefetching, thread affinity and vector unit usage rate. The second study is related with the vectorisation of the hybrid BFS algorithm.…”
Section: Yearmentioning
confidence: 99%
See 2 more Smart Citations
“…Reference Approach optimisation vectorisation 2012 Saule and Ç atalyürek [17] top-down no optimisation automatic 2013 Gao et al [19] top-down bitmaps intrinsics 2013 Stanic et al [ The key contribution of this work builds on the studies carried out by Gao et al in [19] and [5]. In the first study, they present the vectorisation of the top-down BFS algorithm arXiv:1704.02259v2 [cs.DC] 20 Apr 2017 using vector intrinsic functions, which was outperformed by [15], which clarified the impact of prefetching, thread affinity and vector unit usage rate. The second study is related with the vectorisation of the hybrid BFS algorithm.…”
Section: Yearmentioning
confidence: 99%
“…Particularly, the vectorisation of the hybrid involves the vectorised version of both algorithms. The vectorisation of the top-down algorithm is described and analysed in [15], whereas the vectorisation of the bottom-up is described further in Section 5. Table 2 shows an example of the switching points to swap between the top-down and the bottom-up approaches of the , , , ←getCounters() 14: swap(in, out) 15: ← 0 16: end while hybrid BFS algorithm for a graph created by the Graph 500 graph generator introduced in Section 6.…”
Section: The Hybrid Bfs Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…(3) Low-level single node: These are single-node native platforms which are performance-oriented and include architecture-dependent optimization [47]. As a consequence, such implementations are able to exploit better the hardware capability, leveraging good performance, but the programming and tuning efforts are high and, more importantly, portability is difficult to achieve.…”
Section: Pad: Graph-processing Platformsmentioning
confidence: 99%
“…There exist some BFS schemes for GPUs [31], [28], [11], [39], [14], [18]. However, they usually underutilize the available SIMD and vectorization parallelism as they focus on work-optimal traditional BFS or BFS based on SpMSpV that use fine-grained irregular memory accesses.…”
Section: Introductionmentioning
confidence: 99%