2020
DOI: 10.14778/3380750.3380754
|View full text |Cite
|
Sign up to set email alerts
|

Effective and efficient retrieval of structured entities

Abstract: Structured entities are commonly abstracted, such as from XML, RDF or hidden-web databases. Direct retrieval of various structured entities is highly demanded in data lakes, e.g., given a JSON object, to find the XML entities that denote the same real-world object. Existing approaches on evaluating structured entity similarity emphasize too much the structural inconsistency. Indeed, entities from heterogeneous sources could have very distinct structures, owing to various information representation conventions.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 34 publications
0
5
0
Order By: Relevance
“…After getting the schema graph and the pattern graph, we try to find the most similar sub-structure of the schema graph with the pattern graph, and a target trace family corresponding to this sub-structure. The concept of trace family follows the idea of entity family [2], and is defined as the set of all the traces sharing a common sub-structure, which means all the events happen as the sub-structure defined in the traces. The similarity between any sub-structure S of the schema graph and the pattern graph p is computed as follows:…”
Section: Representing Trace Datamentioning
confidence: 99%
“…After getting the schema graph and the pattern graph, we try to find the most similar sub-structure of the schema graph with the pattern graph, and a target trace family corresponding to this sub-structure. The concept of trace family follows the idea of entity family [2], and is defined as the set of all the traces sharing a common sub-structure, which means all the events happen as the sub-structure defined in the traces. The similarity between any sub-structure S of the schema graph and the pattern graph p is computed as follows:…”
Section: Representing Trace Datamentioning
confidence: 99%
“…Extracting tree/graph structured schema is important in query processing [3] and data repairing [4,5], etc. For example, entities from heterogeneous sources could have very distinct structures, owing to various information representation conventions.…”
Section: Introductionmentioning
confidence: 99%
“…Entity family could be regarded as a schema that defines the structure of the entities. [3] proposed a novel hierarchy smooth function to combine the term scores in different nodes of structured entities in the same entity family. Such an entity family is not only important in learning the parameters for effective score combination but also useful in pruning entities for efficient entity retrieval.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, embedding techniques are widely applied in various fields in database community, such as cardinality estimation [2,3], query optimization [4,5], language understanding [6], entity resolution [7,8], document retrieval [9], graph learning [10,11], and advertising recommendation [12], to learn the semantic representations of categorical features. Among these fields, Deep Learning Recommendation Models (DLRMs) are one of the most important applications of embedding techniques: they account for 35% of Amazon's revenue in 2018 [13][14][15], and consume more than 50% training and 80% inference cycles at Meta's data centers in 2020 [16,17].…”
Section: Introduction 1background and Motivationmentioning
confidence: 99%