2013
DOI: 10.1016/j.future.2012.05.027
|View full text |Cite
|
Sign up to set email alerts
|

Scalable parallel computing on clouds using Twister4Azure iterative MapReduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
31
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 48 publications
(31 citation statements)
references
References 17 publications
0
31
0
Order By: Relevance
“…We will also extend current work to include an allreduce collective that is an alternative approach to Kmeans. The resultant Map-Collective model that captures the full range of traditional MapReduce and MPI features will be evaluated on Azure [22] as well as IaaS/HPC environments. We will integrate Twister with Infiniband RDMA based protocol and compare various communication scenarios.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We will also extend current work to include an allreduce collective that is an alternative approach to Kmeans. The resultant Map-Collective model that captures the full range of traditional MapReduce and MPI features will be evaluated on Azure [22] as well as IaaS/HPC environments. We will integrate Twister with Infiniband RDMA based protocol and compare various communication scenarios.…”
Section: Discussionmentioning
confidence: 99%
“…Spark has "broadcast" and "gather". Our Twister4Azure system [22] supports "allgather" and "allreduce" and in a later paper we will describe the integration of these different collectives into a single system that runs interoperably on HPC clusters (Twister) or PaaS cloud systems (Twister4Azure) changing the implementation to optimize performance for each infrastructure. The same high level collective primitive is used on each platform with different under-the-hood optimizations.…”
Section: Collective Communication and Intermediate Data Handlingmentioning
confidence: 99%
“…Microsoft Daytona 24 proposes an iterative Map-Reduce runtime for Windows Azure to support data analytics and machine learning algorithms. Twister [58] is an enhanced MapReduce runtime with an extended programming model for iterative Map-Reduce computations. Hadoop [20] is the most popular open source implementation of Map-Reduce on top of HDFS, as said in the previous section.…”
Section: Parallel Runtime Environmentsmentioning
confidence: 99%
“…In [16] an integrated task and data placement algorithm for workflows on clouds, based on graph partitioning is derived, with the goal of minimizing data transfers. The approach to use data locality for efficient task scheduling is also widely applied to MapReduce, where various improvements over default Hadoop scheduling are proposed [27]. Bharathi et al [10] analyze the impact of data staging strategies on workflow execution on clouds.…”
Section: Related Workmentioning
confidence: 99%