DOI: 10.1007/978-3-540-85713-6_18
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating Performance and Quality of XML-Based Similarity Joins

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 10 publications
0
6
0
Order By: Relevance
“…We assume ordered trees where the sibling order matters. Ribeiro and Härder [2008] introduce extended pq-grams. Extended pq-grams consider, in addition to the structural similarity between trees, also the string similarity between leaf nodes.…”
Section: Related Workmentioning
confidence: 99%
“…We assume ordered trees where the sibling order matters. Ribeiro and Härder [2008] introduce extended pq-grams. Extended pq-grams consider, in addition to the structural similarity between trees, also the string similarity between leaf nodes.…”
Section: Related Workmentioning
confidence: 99%
“…DBLP [6] stores bibliographic data in XML format and includes, among others, authors, titles, and venues of computer science publications. Due to its availability and intuitiveness, the DBLP dataset has been used in many works for experimental purposes, e.g., as a collection of sets [44, 45], as a collection of trees [37, 38, 46], as a large hierarchical document [34, 40], and as a coauthor network graph [42, 49]. In this section, we show the impact of differences in the data preparation process that converts raw DBLP XML data into the desired input format.…”
Section: A Link Is Not Enoughmentioning
confidence: 99%
“…According to the main author, the other datasets are proprietary. In the case of [45], the raw data (R) was provided by the authors. However, the input data could not be generated due to incomplete preparation instructions.…”
Section: Data Reproducibility In Similarity Joinsmentioning
confidence: 99%
“…2.1.2 Ordered trees. For ordered trees, we use epq-grams (Ribeiro and Härder, 2008), an extension of the concept of pq-grams (Augsten et al, 2010). Informally, all subtrees of a specific shape -as defined by the values of p and q -are called pq-grams of the corresponding tree.…”
Section: Textual Representationmentioning
confidence: 99%