2012
DOI: 10.1007/978-3-642-29035-0_12
|View full text |Cite
|
Sign up to set email alerts
|

Halt or Continue: Estimating Progress of Queries in the Cloud

Abstract: Abstract. With cloud-based data management gaining more ground by day, the problem of estimating the progress of MapReduce queries in the cloud is of paramount importance. This problem is challenging to solve for two reasons: i) cloud is typically a large-scale heterogeneous environment, which requires progress estimation to tailor to non-uniform hardware characteristics, and ii) cloud is often built with cheap and commodity hardware that is prone to fail, so our estimation should be able to dynamically adjust… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2013
2013

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…The secondphase sampler allows reducers perform sampling from mappers' output during shuffle phase of join_job, and produces distributed stratified random samples from multiple tables respectively. You can find more details in our paper [8].…”
Section: Data Managermentioning
confidence: 99%
See 1 more Smart Citation
“…The secondphase sampler allows reducers perform sampling from mappers' output during shuffle phase of join_job, and produces distributed stratified random samples from multiple tables respectively. You can find more details in our paper [8].…”
Section: Data Managermentioning
confidence: 99%
“…Then the component computes the most critical path of the PERT network, which can represent the execution of the whole query. Our paper [8] has the detailed discussion about how to make an estimate of the query progress according to the critical path.…”
Section: Online Aggregation Executormentioning
confidence: 99%