Proceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems 2020
DOI: 10.1145/3375395.3387970
|View full text |Cite
|
Sign up to set email alerts
|

Coping with Incomplete Data: Recent Advances

Abstract: Handling incomplete data in a correct manner is a notoriously hard problem in databases. Theoretical approaches rely on the computationally hard notion of certain answers, while practical solutions rely on ad hoc query evaluation techniques based on threevalued logic. Can we find a middle ground, and produce correct answers efficiently? The paper surveys results of the last few years motivated by this question. We reexamine the notion of certainty itself, and show that it is much more varied than previously th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
16
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(17 citation statements)
references
References 66 publications
1
16
0
Order By: Relevance
“…One way of resolving this is to assume that conditions involving NULL evaluate to false rather than unknown. The idea in itself is not new, it was present in old query languages such as Quel, and was recently studied in connection with various proposals on using 2-valued logic in place of SQL's 3VL, see [23,24]. In such a logic, the above MANDATORY condition would not hold for a person whose passport expiry is NULL, thus fulfilling our intuition about these constraints.…”
Section: Null Valuesmentioning
confidence: 98%
“…One way of resolving this is to assume that conditions involving NULL evaluate to false rather than unknown. The idea in itself is not new, it was present in old query languages such as Quel, and was recently studied in connection with various proposals on using 2-valued logic in place of SQL's 3VL, see [23,24]. In such a logic, the above MANDATORY condition would not hold for a person whose passport expiry is NULL, thus fulfilling our intuition about these constraints.…”
Section: Null Valuesmentioning
confidence: 98%
“…This corresponds to the open-world semantics of incompleteness. Therefore, we assume that the the target domain of queries always uses the OWA semantics, while the source can use either the CWA or OWA semantics; see (Console et al 2020).…”
Section: Relational Databasesmentioning
confidence: 99%
“…All these notions are traditionally tested against a single setting: first-order (or closely related) queries over relational databases, interpreted under set semantics. While understanding certainty and its computational properties in this setting was very useful (Console et al 2020), it nonetheless falls short of what one needs to deal with in realistic everyday queries, like those written in SQL.…”
Section: Introductionmentioning
confidence: 99%
“…Analyzing uncertain data without accounting for its uncertainty can create hard to trace errors with severe real world implications. Incomplete database techniques [23] have emerged as a principled way to model and manage uncertainty in data 1 . An incomplete database models uncertainty by encoding a set of possible worlds, each of which is one possible state of the real world.…”
Section: Introductionmentioning
confidence: 99%
“…Many computational problems are intractable over incomplete databases. Even approximations (e.g., [28,31,46]) are often still not efficient enough, are insufficiently expressive, or exclude useful answers [23,26]. Thus, typical database users resort to a cruder, but more practical alternative: resolving uncertainty using heuristics and then treating the result as a deterministic database [67].…”
Section: Introductionmentioning
confidence: 99%