2015 IEEE 31st International Conference on Data Engineering 2015
DOI: 10.1109/icde.2015.7113294
|View full text |Cite
|
Sign up to set email alerts
|

Smooth Scan: Statistics-oblivious access paths

Abstract: Abstract-Query optimizers depend heavily on statistics representing column distributions to create efficient query plans. In many cases, though, statistics are outdated or non-existent, and the process of refreshing statistics is very expensive, especially for ad-hoc workloads on ever bigger data. This results in suboptimal plans that severely hurt performance. The main problem is that any decision, once made by the optimizer, is fixed throughout the execution of a query. In particular, each logical operator t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
8
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 29 publications
(34 reference statements)
1
8
0
Order By: Relevance
“…Each query consists of one select-project-join block 4 . The join graph of the query is shown in Figure 2.…”
Section: The Job Queriesmentioning
confidence: 99%
See 2 more Smart Citations
“…Each query consists of one select-project-join block 4 . The join graph of the query is shown in Figure 2.…”
Section: The Job Queriesmentioning
confidence: 99%
“…The query runtime heavily depends on how the system's optimizer uses the estimates and how much trust it puts into these numbers. A sophisticated engine may employ adaptive operators (e.g., [4,8]) and thus mitigate the impact of misestimations. The results do, however, demonstrate that the state-of-the-art in cardinality estimation is far from perfect.…”
Section: Estimates For Joinsmentioning
confidence: 99%
See 1 more Smart Citation
“…On the contrary, our join processing does not need to rely on extra statistics as in zone-maps. Adaptivity during run-time regarding the decision of scanning a base relation or use a secondary index has been studied in [10,11] for disk-based systems.…”
Section: Rdf and Sparqlmentioning
confidence: 99%
“…F1 Query eliminates both of these cliffs from its implementation of sorting and aggregation. Successful examples of cliff avoidance or removal include SmoothScan [16] and dynamic destaging in hash joins [52]. Dynamic re-optimization would introduce a huge cliff if a single row "too many" will stop execution and re-start the compile-time optimizer.…”
Section: Robust Performancementioning
confidence: 99%