Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020
DOI: 10.1145/3318464.3383132
|View full text |Cite
|
Sign up to set email alerts
|

Optimal Join Algorithms Meet Top-k

Abstract: Top-k queries have been studied intensively in the database community and they are an important means to reduce query cost when only the "best" or "most interesting" results are needed instead of the full output. While some optimality results exist, e.g., the famous Threshold Algorithm, they hold only in a fairly limited model of computation that does not account for the cost incurred by large intermediate results and hence is not aligned with typical database-optimizer cost models. On the other hand, the idea… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 86 publications
0
7
0
Order By: Relevance
“…Since this algorithm assumes restricted key-to-key joins, later works generalize it to more general joins [37,46,58,84], which may even involve theta-joins [60]. However, they retain the middleware cost model, hence do not provide any non-trivial guarantees when the actual join cost is taken into account [79]. Ilyas et al [47] survey some of these approaches, along with some related ones such as building top-𝑘 indexes [22,77] or views [25,43].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Since this algorithm assumes restricted key-to-key joins, later works generalize it to more general joins [37,46,58,84], which may even involve theta-joins [60]. However, they retain the middleware cost model, hence do not provide any non-trivial guarantees when the actual join cost is taken into account [79]. Ilyas et al [47] survey some of these approaches, along with some related ones such as building top-𝑘 indexes [22,77] or views [25,43].…”
Section: Related Workmentioning
confidence: 99%
“…Ranked enumeration. Requiring join answers to be returned in a specific order gives rise to ranked enumeration [29,31,78,79,85,86], where the main goal is to quickly return the most important answers without having to materialize and sort the entire output. Ranked enumeration generalizes the well-known top-𝑘 paradigm, removing the requirement of having to specify 𝑘 in advance.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A ranked enumeration-or "any-𝑘"-algorithm needs to perform well for all possible values of 𝑘. Any-𝑘 algorithms can conceptually be seen as a fusion of top-𝑘 [77] and anytime algorithms [131] that gradually improve their result over time. Secondly and more importantly, algorithms for top-𝑘 joins, including the famous Threshold Algorithm [58], were developed for a middleware-centric cost model that only accounts for accesses to external data sources, and not for the cost of potentially huge intermediate results [121]. In contrast, our goals are strict guarantees for the time and memory for every 𝑘 th ranked answer in the standard RAM-model of computation.…”
Section: Introductionmentioning
confidence: 99%
“…: spanning trees [26], s-t paths [28,35], s-t cuts [47], and (weighted) perfect matchings [38] (See also [20], for more information). Among others, enumerating multiple solutions with some specific order has attracted special interests in the field of database research, which they call ranked enumeration [18,41,46]. When we wish to enumerate K solutions in the non-decreasing order of their cardinality, this is equivalent to K-best enumeration.…”
Section: Introductionmentioning
confidence: 99%