Proceedings of the 17th International Conference on World Wide Web 2008
DOI: 10.1145/1367497.1367738
|View full text |Cite
|
Sign up to set email alerts
|

Fast algorithms for topk personalized pagerank queries

Abstract: In entity-relation (ER) graphs (V, E), nodes V represent typed entities and edges E represent typed relations. For dynamic personalized PageRank queries, nodes are ranked by their steady-state probabilities obtained using the standard random surfer model. In this work, we propose a framework to answer top-k graph conductance queries. Our top-k ranking technique leads to a 4× speedup, and overall, our system executes queries 200-1600× faster than whole-graph PageRank. Some queries might contain hard predicates … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
53
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(53 citation statements)
references
References 4 publications
(6 reference statements)
0
53
0
Order By: Relevance
“…Moreover, we also show that -these can be obtained highly efficiently, if necessary, leveraging existing approximation algorithms [2,4,14,17,21,23,41] and/or parallel implementations [3,32] for computing the PPR scores, -the proposed formulations are reuse-promoting in the sense that, it is possible to divide the work relative to individual seed nodes and cache the intermediary results obtained during the computation -these cached results can then be reused for future queries sharing seed nodes, and -especially in systems with large query throughputs, it may be possible to cluster queries based on the partial overlaps between the seed sets and, thus, significantly reduce the overall robust PPR computation costs.…”
Section: Our Contributions: Robust Personalized Pagerank (Rpr)mentioning
confidence: 84%
See 2 more Smart Citations
“…Moreover, we also show that -these can be obtained highly efficiently, if necessary, leveraging existing approximation algorithms [2,4,14,17,21,23,41] and/or parallel implementations [3,32] for computing the PPR scores, -the proposed formulations are reuse-promoting in the sense that, it is possible to divide the work relative to individual seed nodes and cache the intermediary results obtained during the computation -these cached results can then be reused for future queries sharing seed nodes, and -especially in systems with large query throughputs, it may be possible to cluster queries based on the partial overlaps between the seed sets and, thus, significantly reduce the overall robust PPR computation costs.…”
Section: Our Contributions: Robust Personalized Pagerank (Rpr)mentioning
confidence: 84%
“…Alternatively, PowerIteration [27] or using iterative approximations [14,30], which explicitly simulate the dissemination of probability mass by repeatedly applying the transition process to an initial distribution π 0 until a convergence criterion is satisfied. Recent advances on PPR computation include top-k and approximate personalized PageRank algorithms [2,4,14,17,21,23,41] and parallelized implementations on MapReduce or Pregel based systems [3,32,36,38]. The FastRWR algorithm [41], for example partitions the graph into subgraphs and indexes partial intermediary solutions.…”
Section: Obtaining Pagerank and Personalized Pagerank Scoresmentioning
confidence: 99%
See 1 more Smart Citation
“…Unlike existing approximate PPR algorithms (Avrachenkov et al, 1995;Bahmani et al, 2010;Csalogany et al, 2005;Chakrabarti, 2007;Fujiwara et al, 2012;Gupta et al, 2008;Tong et al, 2006b;Song et al, 2009), LR-PPR is location sensitive. Therefore, given the set, S, of seed nodes and the corresponding localities, G S , the computation focuses on the combined locality G + (V + , E + ) ⊆ G, where…”
Section: Combined Locality and Its Boundarymentioning
confidence: 99%
“…Unfortunately, for large data sets, both of these processes are prohibitively expensive. Recent advances on personalized PageRank include top-k and approximate personalized PageRank algorithms (Avrachenkov et al, 1995;Bahmani et al, 2010;Csalogany et al, 2005;Chakrabarti, 2007;Fujiwara et al, 2012;Gupta et al, 2008;Tong et al, 2006b;Song et al, 2009), parallelized implementations on MapReduce or Pregel based batch data processing systems (Bahmani et al, 2011;Malewicz et al, 2010), and techniques to eliminate seed noise (Huang et al, 2014). The FastRWR algorithm, presented in (Tong et al, 2006b), for example partitions the graph into subgraphs and indexes partial intermediary solutions.…”
Section: Measuring Proximity With Pagerankmentioning
confidence: 99%