2017 IEEE 10th International Conference on Cloud Computing (CLOUD) 2017
DOI: 10.1109/cloud.2017.55
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Performance Tuning of Hadoop MapReduce: A Noisy Gradient Approach

Abstract: Hadoop MapReduce is a framework for distributed storage and processing of large datasets that is quite popular in big data analytics. It has various configuration parameters (knobs) which play an important role in deciding the performance i.e., the execution time of a given big data processing job. Default values of these parameters do not always result in good performance and hence it is important to tune them. However, there is inherent difficulty in tuning the parameters due to two important reasons -firstl… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 17 publications
0
4
0
1
Order By: Relevance
“…The goal of the SPSA algorithm in this article is to maximize the objective function f (θ) by updating the configuration parameters θ of the Elasticsearch over time. In order to optimize the configuration we resort to a modified version of the Simultaneous Perturbation Stochastic Approximation algorithm (Kumar et al, 2017 ). The implemented algorithm in this project is presented in Algorithm 1 .…”
Section: Methodsmentioning
confidence: 99%
“…The goal of the SPSA algorithm in this article is to maximize the objective function f (θ) by updating the configuration parameters θ of the Elasticsearch over time. In order to optimize the configuration we resort to a modified version of the Simultaneous Perturbation Stochastic Approximation algorithm (Kumar et al, 2017 ). The implemented algorithm in this project is presented in Algorithm 1 .…”
Section: Methodsmentioning
confidence: 99%
“…A recent related study of the search-based method is BestConfig [15], which uses the divide-and-diverge sampling method and the recursive bound-and-search algorithm to automatically tune configurations with limited resources for general systems. Kumar et al [16] propose a noise-gradient algorithm, called simultaneous perturbation stochastic approximation (SPSA), to optimize Hadoop's performance. Moreover, several studies explore the search-based method, such as ACTS [17] and MRONLINE [18].…”
Section: Related Workmentioning
confidence: 99%
“…Os autores Kumar et al (2016) usaram um algoritmo Noisy Gradient para otimizar valores para os parâmetros de con guração e alcançaram melhorias no tempo de execução próximas a 66% quando comparados com a con guração padrão do Hadoop.…”
Section: Trabalhos Relacionadosunclassified