2015 IEEE 31st International Conference on Data Engineering 2015
DOI: 10.1109/icde.2015.7113334
|View full text |Cite
|
Sign up to set email alerts
|

Scalable SPARQL querying using path partitioning

Abstract: Abstract-The emerging need for conducting complex analysis over big RDF datasets calls for scale-out solutions that can harness a computing cluster to process big RDF datasets. Queries over RDF data often involve complex self-joins, which would be very expensive to run if the data are not carefully partitioned across the cluster and hence distributed joins over massive amount of data are necessary. Existing RDF data partitioning methods can nicely localize simple queries but still need to resort to expensive d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 36 publications
(21 citation statements)
references
References 20 publications
(21 reference statements)
0
21
0
Order By: Relevance
“…In Sedge [142] a reimplementation of Pregel [84] is used to process distributed queries. A more common approach is, to use Hadoop to process only distributed joins as done in [63], [39], [143], SPA [81], VB-Partitioner [79], SHAPE [80] and [137].…”
Section: Distributed Rdf Storesmentioning
confidence: 99%
“…In Sedge [142] a reimplementation of Pregel [84] is used to process distributed queries. A more common approach is, to use Hadoop to process only distributed joins as done in [63], [39], [143], SPA [81], VB-Partitioner [79], SHAPE [80] and [137].…”
Section: Distributed Rdf Storesmentioning
confidence: 99%
“…In Sedge [142] a reimplementation of Pregel [84] is used to process distributed queries. A more common approach is, to use Hadoop to process only distributed joins as done in [63], [39], [143], SPA [81], VB-Partitioner [79], SHAPE [80] and [137].…”
Section: Distributed Rdf Storesmentioning
confidence: 99%
“…Nevertheless, SPA suffers from a large amount of data duplication problem. Wu et al [13] introduce a path strategy-based method, which decomposes the RDF graph into the end-to-end paths. Then such paths are considered as the finest partitioning elements to realize the partitioning.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, RDF data partitioning is a vital technique for high efficient cloud storage. Recently, many RDF data partitioning methods have been proposed [4,[8][9][10][11][12][13]. Most of them are based on graph partitioning that usually distributes closely related nodes to the same storage node.…”
Section: Introductionmentioning
confidence: 99%