2017
DOI: 10.1007/978-3-319-72401-0_10
|View full text |Cite
|
Sign up to set email alerts
|

Performance Assurance Model for Applications on SPARK Platform

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(18 citation statements)
references
References 9 publications
0
18
0
Order By: Relevance
“…A Spark job execution consists of multiple parallel or sequential stages and each stage is composed of multiple tasks. The performance of each task is influenced by several factors at each task phase [46,103].…”
Section: Batch Processing Systemsmentioning
confidence: 99%
See 2 more Smart Citations
“…A Spark job execution consists of multiple parallel or sequential stages and each stage is composed of multiple tasks. The performance of each task is influenced by several factors at each task phase [46,103].…”
Section: Batch Processing Systemsmentioning
confidence: 99%
“…However, system performance is directly linked to a vast array of configuration parameters, which control various aspects of system execution, ranging from low-level memory settings and thread counts to higher-level decisions such as 43:2 H. Herodotou et al resource management and load balancing [88]. Improper settings of configuration parameters are shown to have detrimental effects on the overall system performance and stability [39,61,103].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For establishing the task execution cost model in Spark, we improve the method proposed by Singhal and Singh [15] and add the cost generated by sorting operation. When calculating the Stage cost, reading input data, merging and sorting intermediate data, and writing output data are considered, that is…”
Section: Cost Modelmentioning
confidence: 99%
“…1) Multi-Objective Optimization: One approach to achieve multi-objective optimization is to build cost models for each of the heterogeneous computing units. A cost model could be a performance prediction model, also known as surrogate models, to estimate performance efficiency of workload on different platforms for scheduling, e.g., Luo et al [64] presents power-performance characteristic of analytic queries on database systems, and Singhal et al [65], [66], [67] and [68] present similar cost models for relational database, Hive, Hadoop, and Spark platforms deployed on conventional CPU systems. However, none of these include design space for hardware accelerators.…”
Section: Optimization Challengesmentioning
confidence: 99%