Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data 2014
DOI: 10.1145/2588555.2588579
|View full text |Cite
|
Sign up to set email alerts
|

The analytical bootstrap

Abstract: Sampling is one of the most commonly used techniques in Approximate Query Processing (AQP)-an area of research that is now made more critical by the need for timely and cost-effective analytics over "Big Data". Assessing the quality (i.e., estimating the error) of approximate answers is essential for meaningful AQP, and the two main approaches used in the past to address this problem are based on either (i) analytic error quantification or (ii) the bootstrap method. The first approach is extremely efficient bu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 74 publications
(4 citation statements)
references
References 40 publications
0
4
0
Order By: Relevance
“…For example, the confidence interval [3.5, 5.5] with the confidence level 95% means that we have 95% confidence to ensure that the accurate result will fall into the interval [3.5, 5.5]. In our progressive execution model, the expected performance is that the width of Currently, there are three widely-used methods in AQP system to do error estimation: closed-form estimates based on either the central limit theorem (CLT) [26], large deviation inequalities such as Hoeffding bounds [12], and the bootstrap [8,30]. As discussed before, the Zip-F law of natural languages motivated us to use bootstrap techniques in our method.…”
Section: Quantifying Results Errormentioning
confidence: 99%
See 2 more Smart Citations
“…For example, the confidence interval [3.5, 5.5] with the confidence level 95% means that we have 95% confidence to ensure that the accurate result will fall into the interval [3.5, 5.5]. In our progressive execution model, the expected performance is that the width of Currently, there are three widely-used methods in AQP system to do error estimation: closed-form estimates based on either the central limit theorem (CLT) [26], large deviation inequalities such as Hoeffding bounds [12], and the bootstrap [8,30]. As discussed before, the Zip-F law of natural languages motivated us to use bootstrap techniques in our method.…”
Section: Quantifying Results Errormentioning
confidence: 99%
“…These techniques either compute an error bound much wider than the real which lost guidance to users or require data to follow the normal distribution while it's not suitable for natural languages. Another estimation technique, bootstrap [23,30], can be applied to arbitrary queries. However, before bootstrap techniques have poor performance to apply in our progressive execution model due to lots of duplicate computation.…”
Section: Error Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…The analytical bootstrap method [25], reduces the overhead of the bootstrap error estimation, removing the need for re-sampling.…”
Section: Analyticalmentioning
confidence: 99%