Proceedings of the Sixth International Conference on Emerging Databases: Technologies, Applications, and Theory 2016
DOI: 10.1145/3007818.3007819
|View full text |Cite
|
Sign up to set email alerts
|

A comparative analysis of iterative MapReduce systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 5 publications
0
2
0
Order By: Relevance
“…Kang and Lee [47] examined five resource management frameworks including Apache Hadoop and Spark with respect to performance overheads (disk input/output, network communication, scheduling, etc.) in supporting iterative computation.…”
Section: Machine Learning and Iterative Tasks Supportmentioning
confidence: 99%
“…Kang and Lee [47] examined five resource management frameworks including Apache Hadoop and Spark with respect to performance overheads (disk input/output, network communication, scheduling, etc.) in supporting iterative computation.…”
Section: Machine Learning and Iterative Tasks Supportmentioning
confidence: 99%
“…Multiple Spark jobs initiated by different threads may run concurrently within each Spark application which gets its own executor processes. Spark runs long-running processes and threads, which stay up through the entire duration of the application and execute tasks in multiple threads, to avoid the overhead of repeatedly invoking tasks [9,10]. Allocation of executor resources on the cluster can be controlled by Spark YARN client using the --num-executors option, which overrides Spark's built-in DRA mechanism [18].…”
Section: Spark Architecture and Resilient Distributed Dataset (Rdd)mentioning
confidence: 99%