Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology 2009
DOI: 10.1145/1516360.1516418
|View full text |Cite
|
Sign up to set email alerts
|

Efficiently indexing shortest paths by exploiting symmetry in graphs

Abstract: Shortest path queries (SPQ) are essential in many graph analysis and mining tasks. However, answering shortest path queries on-the-fly on large graphs is costly. To online answer shortest path queries, we may materialize and index shortest paths. However, a straightforward index of all shortest paths in a graph of N vertices takes O(N 2 ) space. In this paper, we tackle the problem of indexing shortest paths and online answering shortest path queries. As many large real graphs are shown richly symmetric, the c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
46
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 59 publications
(47 citation statements)
references
References 28 publications
1
46
0
Order By: Relevance
“…Our paper addresses a different problem than the one in [20,21] since we are interested only on the length of a shortest path, not the path itself. Recently, Xiao et al have exploit graph symmetry to obtain speed-ups for PPSP queries over simple BFS traversals [45]. Our algorithms work only on the precomputed node-to-landmark-distances and do not perform any Dijkstra-type computation at query-time.…”
Section: Related Workmentioning
confidence: 99%
“…Our paper addresses a different problem than the one in [20,21] since we are interested only on the length of a shortest path, not the path itself. Recently, Xiao et al have exploit graph symmetry to obtain speed-ups for PPSP queries over simple BFS traversals [45]. Our algorithms work only on the precomputed node-to-landmark-distances and do not perform any Dijkstra-type computation at query-time.…”
Section: Related Workmentioning
confidence: 99%
“…Naturally, how two nodes are related to each other reflects the topology of the graph G. Path-length based definitions, such as those proposed by [7,16,34,42,44,45] help capture the relatedness of a pair of nodes solely based on the properties of the nodes and edges on the shortest path between the pair. [12] and [13] were among the first works which recognized that random-walks can also be used for measuring the significance of the graph nodes relative to a given seed node set, S ⊆ V : authors observed that, if one constructs a random-walk graph such that transition probabilities represent the separation between the seed nodes in the graph then the random-walk would spend more time on nodes that are closer to the seed nodes in S. More specifically, in [12] the authors proposed to construct a transition matrix, T S , where edges leading away from the seed nodes are weighted less than those edges leading towards the seed nodes.…”
Section: Context-sensitive Node Significance and Personalizationmentioning
confidence: 99%
“…Note that localities can be distance-constrained or size-constrained. Common definitions include h-hop neighborhoods (Boldi et al, 2011;Cohen et al, 2003;Wei, 2010;Xiao et al, 2009;Zhou et al, 2009), reachability neighborhoods (Cohen et al, 2003), cluster/partition neighborhoods (Feige et al, 2005;Karypis and Kumar, 1998;Newman, 2006), or hitting distance neighborhoods (Chen et al, 2008;Mei et al, 2008). One straight-forward way to identify the locality of a seed node n is to perform breadth-first search around n to locate the closest L nodes in linear time to the size of the locality.…”
Section: Locality Selectionmentioning
confidence: 99%
“…Due to the wide-spread use of graphs in analysis, mining, and visualization of interconnected data, there are many definitions of the node distance and proximity. Path-length based definitions, such as those used by Palmer et al (2006), Boldi et al (2011), Cohen et al (2003), Wei (2010), Xiao et al (2009), Zhou et al (2009) , are 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 useful when the relatedness can be captured solely based on the properties of the nodes and edges on the shortest path (based on some definition of path-length). Randomwalk based definitions, such as hitting distance (Chen et al, 2008;Mei et al, 2008) and personalized PageRank (PPR) score (Balmin et al, 2004;Chakrabarti, 2007;Jeh and Widom, 2002;Tong et al, 2006a;Tong et al, 2007;Liu et al, 2013;Lofgren et al, 2014;Maehara et al, 2014), of node relatedness, on the other hand, also take into account the density of the edges: intuitively, as in path-length based definitions, a node can be said to be...…”
Section: Introductionmentioning
confidence: 99%