2013
DOI: 10.14778/2556549.2556571
|View full text |Cite
|
Sign up to set email alerts
|

Scaling queries over big RDF graphs with semantic hash partitioning

Abstract: Massive volumes of big RDF data are growing beyond the performance capacity of conventional RDF data management systems operating on a single node. Applications using large RDF data demand efficient data partitioning solutions for supporting RDF data access on a cluster of compute nodes. In this paper we present a novel semantic hash partitioning approach and implement a Semantic HAsh Partitioning-Enabled distributed RDF data management system, called Shape. This paper makes three original contributions. First… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
118
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
3
3

Relationship

2
8

Authors

Journals

citations
Cited by 110 publications
(119 citation statements)
references
References 15 publications
(21 reference statements)
1
118
0
Order By: Relevance
“…Most existing works are based on scanning the data a-priori and either saving new pieces of information about it, or providing alternative data representations. The works in [7,9,13,14,16] are based on techniques that mainly focus on join optimizations by indexing the data. These works do not consider structured data and data typing.…”
Section: Related Workmentioning
confidence: 99%
“…Most existing works are based on scanning the data a-priori and either saving new pieces of information about it, or providing alternative data representations. The works in [7,9,13,14,16] are based on techniques that mainly focus on join optimizations by indexing the data. These works do not consider structured data and data typing.…”
Section: Related Workmentioning
confidence: 99%
“…Lee and Liu present a novel semantic hash approach that utilizes access locality to partition big graphs across multiple computing nodes by maximizing the intrapartition processing capability and minimizing the interpartition communication cost [22]. Huang et al use a graph partitioning algorithm instead of simple hash partitioning by source vertex, destination vertex, and labeled or unlabeled edge [23].…”
Section: Related Workmentioning
confidence: 99%
“…And one more interesting thing is that the key range generated by YCSB is uniform, when turn to highly skewed key range distribution, a more carefully PreSplit design is critical to achieve load balance. In HConfig system, we allow external data partitioning algorithms such as [32] to be plugged into the PreSplit policy.…”
Section: Multi-databases With Variable Blocksmentioning
confidence: 99%