Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology 2009
DOI: 10.1145/1516360.1516472
|View full text |Cite
|
Sign up to set email alerts
|

Provenance for nested subqueries

Abstract: Data provenance is essential in applications such as scientific computing, curated databases, and data warehouses. Several systems have been developed that provide provenance functionality for the relational data model. These systems support only a subset of SQL, a severe limitation in practice since most of the application domains that benefit from provenance information use complex queries. Such queries typically involve nested subqueries, aggregation and/or user defined functions. Without support for these … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 24 publications
(16 citation statements)
references
References 15 publications
0
16
0
Order By: Relevance
“…This minimizes the main problem of the technique presented in [4], which was the huge number of tuples that the user must consider in order to determine the validity of the result produced by a relation. Previous works deal with the problem of tracking provenance information for query results [9,7], but to the best of our knowledge, none of them treat the case of missing tuples, which is important in our setting. The proposed algorithm looks for particular but common error sources, like tuples missed in the from section or in and conditions (that is, intersect components in our representation).…”
Section: Discussionmentioning
confidence: 99%
“…This minimizes the main problem of the technique presented in [4], which was the huge number of tuples that the user must consider in order to determine the validity of the result produced by a relation. Previous works deal with the problem of tracking provenance information for query results [9,7], but to the best of our knowledge, none of them treat the case of missing tuples, which is important in our setting. The proposed algorithm looks for particular but common error sources, like tuples missed in the from section or in and conditions (that is, intersect components in our representation).…”
Section: Discussionmentioning
confidence: 99%
“…We select 11 out of the 22 TPC-H queries to evaluate optimization of provenance capture for complex queries. The technique [31] we are using supports all TPC-H queries, but instrumentations for nested subqueries have not been implemented in GProM yet.…”
Section: Tpc-h Queriesmentioning
confidence: 99%
“…The closest to our work is the Perm System [81][84] [85][86] that extends the PostgreSQL DBMS and rewrites queries to obtain a provenance query that determines the source data. The reduction rules described in this paper also compute the source data for the coverage rules, but the implementation does not depend on a given DBMS and adds additional information needed to allow the reduction procedures to select a subset of the source data.…”
Section: Testing Database Applicationsmentioning
confidence: 99%