2014
DOI: 10.3182/20140824-6-za-1003.01319
|View full text |Cite
|
Sign up to set email alerts
|

A Control Approach for Performance of Big Data Systems

Abstract: International audienceWe are at the dawn of a huge data explosion therefore companies have fast growing amounts of data to process. For this purpose Google developed MapReduce, a parallel programming paradigm which is slowly becoming the de facto tool for Big Data analytics. Although to some extent its use is already wide-spread in the industry, ensuring performance constraints for such a complex system poses great challenges and its management requires a high level of expertise. This paper answers these chall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(26 citation statements)
references
References 22 publications
0
26
0
Order By: Relevance
“…However, no such details are provided in the case of [72]. In contrast, the authors of [43,83] used a PI feedback controller for big data application. They focused to adjusts the computing nodes of a map reduce cluster to guarantee the desired service time of map reduce jobs.…”
Section: Classicmentioning
confidence: 99%
See 3 more Smart Citations
“…However, no such details are provided in the case of [72]. In contrast, the authors of [43,83] used a PI feedback controller for big data application. They focused to adjusts the computing nodes of a map reduce cluster to guarantee the desired service time of map reduce jobs.…”
Section: Classicmentioning
confidence: 99%
“…A solution can be centralized at one of the following three levels, i.e., Application, Node or Cloud. The solutions that focused on horizontal elasticity from the SP perspective are centralized at Application level (e.g., [43,44,48,50,61,72]), whereas the control solutions that cater CPs perspective runs centrally at Cloud level, where they could be responsible for different applications (e.g., [33,48,86]). The application level control solutions can be executed outside of the cloud environment and therefore they can control interactions with multiple control.…”
Section: Architecturementioning
confidence: 99%
See 2 more Smart Citations
“…Considering the scale of these workloads and the uncertainties of their execution environments, this can quickly lead to a waste of resources since the static configuration does not adapt to the current runtime condition. Optimizing Hadoop execution has therefore attracted a lot of research attention resulting in a number of different approaches in particular in the domain of selfadaptive software systems [1,2,4,5,6,7,8,9]. However, this research effort is often hindered by the accidental complexity of setting representative Hadoop deployment in different distributed environments and comparing different approaches.…”
Section: Introductionmentioning
confidence: 99%