2017 IEEE 33rd International Conference on Data Engineering (ICDE) 2017
DOI: 10.1109/icde.2017.38
|View full text |Cite
|
Sign up to set email alerts
|

gMark: Schema-Driven Generation of Graphs and Queries

Abstract: Abstract-Massive graph data sets are pervasive in contemporary application domains. Hence, graph database systems are becoming increasingly important. In the experimental study of these systems, it is vital that the research community has shared solutions for the generation of database instances and query workloads having predictable and controllable properties. In this paper, we present the design and engineering principles of gMark, a domain-and query language-independent graph instance and query workload ge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
56
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 30 publications
(57 citation statements)
references
References 17 publications
0
56
0
1
Order By: Relevance
“…The results of such query is small and constant, since the number of countries and languages is constant; they do not vary even when the dataset size is exponentially increased, and thus such results are expected for some of the generated queries. Actually these kinds of queries are intentionally generated by gMark for benchmarking purposes (check [3]). …”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The results of such query is small and constant, since the number of countries and languages is constant; they do not vary even when the dataset size is exponentially increased, and thus such results are expected for some of the generated queries. Actually these kinds of queries are intentionally generated by gMark for benchmarking purposes (check [3]). …”
Section: Resultsmentioning
confidence: 99%
“…We generate data according to the Social Network Benchmark (SNB) schema of the Linked Data Benchmark Council (LDBC) [5]. The data and the workload tests are generated by gMark [3]. gMark is a graph and query workload generator based on an input schema.…”
Section: Discussionmentioning
confidence: 99%
“…The optimization issues around RQ evaluation have never been formally addressed in the database literature and, even for the simple UC2RPQ class, current graph database engines perform poorly (see (Bagan et al 2017)). Even though our goal is not to provide a RQ optimizer, we touch base with some simple optimizations.…”
Section: Related Workmentioning
confidence: 99%
“…These graphshenceforth denoted by G -are diverse in terms of their density (increasing from SNB to WD) and of their in-degree and out-degree distributions (Bagan et al 2017). They represent two extreme cases to be considered in benchmarking graph database engines.…”
Section: Experimental Analysismentioning
confidence: 99%
See 1 more Smart Citation