2013
DOI: 10.14778/2536222.2536224
|View full text |Cite
|
Sign up to set email alerts
|

Optimization strategies for A/B testing on HADOOP

Abstract: In this work, we present a set of techniques that considerably improve the performance of executing concurrent MapReduce jobs. Our proposed solution relies on proper resource allocation for concurrent Hive jobs based on data dependency, inter-query optimization and modeling of Hadoop cluster load. To the best of our knowledge, this is the first work towards Hive/MapReduce job optimization which takes Hadoop cluster load into consideration.We perform an experimental study that demonstrates 233% reduction in exe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 20 publications
(19 reference statements)
0
1
0
Order By: Relevance
“…Conclusion: Solr's search performance far exceeds that of traditional relational databases, and Solr has proven to be highly efficient. Literature [10] proposed a series of methods to optimize concurrent MapReduce. Optimize Hadoop group load modeling based on datadependent dependencies and allocate appropriate resources for concurrent tasks.…”
Section: Introductionmentioning
confidence: 99%
“…Conclusion: Solr's search performance far exceeds that of traditional relational databases, and Solr has proven to be highly efficient. Literature [10] proposed a series of methods to optimize concurrent MapReduce. Optimize Hadoop group load modeling based on datadependent dependencies and allocate appropriate resources for concurrent tasks.…”
Section: Introductionmentioning
confidence: 99%