2012
DOI: 10.1007/978-3-642-33078-0_30
|View full text |Cite
|
Sign up to set email alerts
|

On Modelling and Prediction of Total CPU Usage for Applications in MapReduce Environments

Abstract: recently, businesses have started using MapReduce as a popular computation framework for processing large amount of data, such as spam detection, and different data mining tasks, in both public and private clouds. Two of the challenging questions in such environments are (1) choosing suitable values for MapReduce configuration parameters -e.g., number of mappers, number of reducers, and DFS block size-, and (2) predicting the amount of resources that a user should lease from the service provider. Currently, th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(15 citation statements)
references
References 15 publications
0
15
0
Order By: Relevance
“…The number of servers per PN varied between 10 and 60 for all evaluated scenarios. Note that we used processor cycles in GHz as a measure of the total processing capability of a node [30]. Table II summarizes the input parameters to the model.…”
Section: A Results Of Veracity Scenariosmentioning
confidence: 99%
“…The number of servers per PN varied between 10 and 60 for all evaluated scenarios. Note that we used processor cycles in GHz as a measure of the total processing capability of a node [30]. Table II summarizes the input parameters to the model.…”
Section: A Results Of Veracity Scenariosmentioning
confidence: 99%
“…Also, Rahman et al [56] and Tiwari et al [65] studied the effect of compiler parameters on both performance and power/energy consumption for scientific computing. A lot of modeling and tuning effort has recently been devoted to the specific application of MapReduce [5,15,30,49,57,58].…”
Section: Related Workmentioning
confidence: 99%
“…6) Physical link capacity constraints (27) Constraint (27) ensures that the summation of the wavelengths in a virtual link traversing a physical link does not exceed the capacity of the fibers in the physical link. 7) Wavelengths capacity constraint (28) Constraint (28) ensures that the summation of the wavelengths traversing a physical link does not exceed the total number of wavelengths in that link. 8) Number of aggregation ports utilized by regular traffic constraint (29) 9) Number of aggregation ports utilized by CHT traffic constraint (30) 10) Number of aggregation ports utilized by INF traffic constraint (31) Constraints (29)(30)(31) insight into what minimizes power consumption in our proposed progressive processing big data networks.…”
Section: Power Consumption Of Optical Switch Installed At Node I N (Wmentioning
confidence: 99%
“…The storage capacity of the PNs was assigned to be large enough. Note that we used processor cycles in GHz as a measure of the total processing capability of a node [28]. Table III summarizes the input parameters.…”
Section: Complexity Analysismentioning
confidence: 99%