Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3380562
|View full text |Cite
|
Sign up to set email alerts
|

Memory-Aware Framework for Efficient Second-Order Random Walk on Large Graphs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 33 publications
0
8
0
Order By: Relevance
“…Sampling from a discrete probability distribution 𝑃 = {𝑝 0 , 𝑝 1 , ..., 𝑝 π‘›βˆ’1 } is to select an element β„Ž from {0, 1, ..., 𝑛 βˆ’ 1} based on 𝑃 (i.e., 𝑃 [β„Ž = 𝑖] = 𝑝 𝑖 ). In this paper, we focus on five sampling techniques, including naive sampling, inverse transformation sampling [40], alias sampling [58], rejection sampling [50] and a special case of rejection sampling [65] because they are efficient and widely used [46,52,53,59,65]. Naive sampling only works on the uniform discrete distribution, while the other four can handle non-uniform and select the element β„Ž in two phases: initialization, which preprocesses the distribution 𝑃, and generation, which picks an element on the basis of the initialization result.…”
Section: Sampling Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Sampling from a discrete probability distribution 𝑃 = {𝑝 0 , 𝑝 1 , ..., 𝑝 π‘›βˆ’1 } is to select an element β„Ž from {0, 1, ..., 𝑛 βˆ’ 1} based on 𝑃 (i.e., 𝑃 [β„Ž = 𝑖] = 𝑝 𝑖 ). In this paper, we focus on five sampling techniques, including naive sampling, inverse transformation sampling [40], alias sampling [58], rejection sampling [50] and a special case of rejection sampling [65] because they are efficient and widely used [46,52,53,59,65]. Naive sampling only works on the uniform discrete distribution, while the other four can handle non-uniform and select the element β„Ž in two phases: initialization, which preprocesses the distribution 𝑃, and generation, which picks an element on the basis of the initialization result.…”
Section: Sampling Methodsmentioning
confidence: 99%
“…RW algorithm optimization. Due to the importance of the RW-based applications, a variety of algorithm-specific optimizations have been proposed for different RW applications, e.g., PPR [17,35,54,60,62], Node2Vec [69] and second-order random walks [53]. In contrast, we aim to design a generic and efficient random walk framework on which users can easily implement different kinds of random walk applications.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The use of the Alias Method for Node2Vec incurs the "memory explosion problem" [54] since the preprocessing phase for a second-order random walk on a graph with |E| edges has a support whose cardinality is O eij ∈E deg (j) , where deg(j) is the degree of the destination node of the edge e ij ∈ E. Therefore, the time and memory complexity needed for preprocessing make the Alias method intractable even on relatively small graphs. For instance, on the unfiltered Human STRING PPI graph (19.354 nodes and 5.879.727 edges) it would require 777 GB of RAM.…”
Section: Efficient Sampling For Node2vec Random Walksmentioning
confidence: 99%
“…The empirical results demonstrate that UniNet can be 10X-900X faster than the existing open-sourced versions. Compared to the state-of-the-art sampling techniques (e.g., KnightKing [35], Memory-aware sampler [32]), our M-H based edge sampler achieves 9.6%-73.2% efficiency improvement in most of the settings and can scale to much larger networks.…”
Section: Introductionmentioning
confidence: 99%