2018
DOI: 10.1007/s00224-018-9874-1
|View full text |Cite
|
Sign up to set email alerts
|

A Logic for Document Spanners

Abstract: Document spanners are a formal framework for information extraction that was introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (PODS 2013, JACM 2015). One of the central models in this framework are core spanners, which formalize the query language AQL that is used in IBM's SystemT. As shown by Freydenberger and Holldack (ICDT 2016, ToCS 2018, there is a connection between core spanners and EC reg , the existential theory of concatenation with regular constraints. The present paper further develops this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 29 publications
(21 citation statements)
references
References 37 publications
0
19
0
Order By: Relevance
“…Let A be a vset-automaton. Analogously to regex formulas, we denote by Ref ( Example 2.6 suggests that vset-automata can be converted into equivalent functional vset-automata; but Freydenberger [15] showed that although this is possible with standard automata constructions, the resulting blow-up may be exponential in the number of variables.…”
Section: Variable-set Automatamentioning
confidence: 99%
See 1 more Smart Citation
“…Let A be a vset-automaton. Analogously to regex formulas, we denote by Ref ( Example 2.6 suggests that vset-automata can be converted into equivalent functional vset-automata; but Freydenberger [15] showed that although this is possible with standard automata constructions, the resulting blow-up may be exponential in the number of variables.…”
Section: Variable-set Automatamentioning
confidence: 99%
“…A core spanner is defined similarly to a regular spanner, but it also allows the string-equality selection predicate on spans (an example can be found further down in this section). Later developments of the framework include the exploration of the complexity of static-analysis tasks on core spanners [16], incorporating inconsistency repairing in spanners [13], a logical characterization [15], and a uniform model for relational data and IE [28].…”
Section: Introductionmentioning
confidence: 99%
“…We use two main models for representing spanners: regex-formulas and VSet-automata. For both, we follow Freydenberger [10], defining the semantics of these models using so-called ref-words. We also introduce here a class of VSet-automata, dfVSA, that have determinism properties essential to the tractability of problems we study in the paper.…”
Section: Preliminaries On Regular Spannersmentioning
confidence: 99%
“…The first condition follows directly from Lemma 5.3. 10 Thus it only remains to show that the second condition holds. For the sake of contradiction, assume that it does not hold.…”
mentioning
confidence: 98%
“…The study of word equations (or the existential theory of equations over free monoids) is an important topic found at the intersection of algebra and computer science, with significant connections to, e.g., combinatorial group or monoid theory [64,63,21], unification [78,46,47]), and, more recently, data base theory [33,32].…”
Section: Word Equationsmentioning
confidence: 99%