2019
DOI: 10.1007/s41019-019-0088-6
|View full text |Cite
|
Sign up to set email alerts
|

Novel Parallel Algorithms for Fast Multi-GPU-Based Generation of Massive Scale-Free Networks

Abstract: A novel parallel algorithm is presented for generating random scale-free networks using the preferential attachment model. The algorithm, named cuPPA, is custom-designed for "single instruction multiple data (SIMD)" style of parallel processing supported by modern processors such as graphical processing units (GPUs). To the best of our knowledge, our algorithm is the first to exploit GPUs, and also the fastest implementation available today, to generate scale-free networks using the preferential attachment mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 21 publications
0
10
0
Order By: Relevance
“…In recent years, more paralleled deep learning methods have been brought up [11]. In the aspect of algorithms, several algorithms have been brought up to accelerate multi-GPU implementation or make the inference more accurate [1,26] and faster [7,12]. Moreover, there are researches have been done to integrate DP and MP [8].…”
Section: Multi-gpu Parallel Computingmentioning
confidence: 99%
“…In recent years, more paralleled deep learning methods have been brought up [11]. In the aspect of algorithms, several algorithms have been brought up to accelerate multi-GPU implementation or make the inference more accurate [1,26] and faster [7,12]. Moreover, there are researches have been done to integrate DP and MP [8].…”
Section: Multi-gpu Parallel Computingmentioning
confidence: 99%
“…Alam et al [12] transfer the algorithm into the distributed memory model and show how dependency chains, which are short in practice, are resolved e ciently in parallel. Alternatively if multi-edges are acceptable, [294] can be adapted to multi-GPU scenarios [11] yielding an even more scalable approach.…”
Section: Algorithmic Similarities Between Ba and Node Copymentioning
confidence: 99%
“…In that spirit, we execute EM ES on an undirected version of the crawled ClueWeb12 graph's core [324] which we obtain by deleting all nodes corresponding to uncrawled URLs. 11 Performing k = m swaps on this graph with n ≈ 9.8 • 10 8 nodes and m ≈ 3.7 • 10 10 edges is feasible in less than 19.1 h on SysB. Bhuiyan et al propose a distributed edge switching algorithm and evaluate it on a compute cluster with 64 nodes each equipped with two Intel Xeon E5-2670 2.60GHz 8-core processors and 64GB RAM [43].…”
Section: Em Esmentioning
confidence: 99%
See 2 more Smart Citations