2013 42nd International Conference on Parallel Processing 2013
DOI: 10.1109/icpp.2013.78
|View full text |Cite
|
Sign up to set email alerts
|

High-Performance Design of Hadoop RPC with RDMA over InfiniBand

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
36
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 102 publications
(36 citation statements)
references
References 8 publications
0
36
0
Order By: Relevance
“…With these design changes, MapReduce job execution can be greatly accelerated by leveraging the benefits of high-performance interconnects. The high performance design of Hadoop (Hadoop-RDMA) [3] also shows significant performance benefits achievable through RDMA-capable interconnects using enhanced designs of various components (HDFS [6], MapReduce [12], RPC [9]) inside Hadoop. On the other hand, much performance modeling research [4,8,2,1,13,5,7,10,11] has been carried out to deeply analyze the default MapReduce framework.…”
Section: Motivationmentioning
confidence: 99%
“…With these design changes, MapReduce job execution can be greatly accelerated by leveraging the benefits of high-performance interconnects. The high performance design of Hadoop (Hadoop-RDMA) [3] also shows significant performance benefits achievable through RDMA-capable interconnects using enhanced designs of various components (HDFS [6], MapReduce [12], RPC [9]) inside Hadoop. On the other hand, much performance modeling research [4,8,2,1,13,5,7,10,11] has been carried out to deeply analyze the default MapReduce framework.…”
Section: Motivationmentioning
confidence: 99%
“…The data shuffling phase of the MapReduce job is communication intensive, and can immensely benefit from these enhancements. Recent studies [25,26,33,35,46] have shed light on the possibilities of such performance gains for different Big Data middleware on HPC clusters, including MapReduce [46,47,57]. To evaluate its potential, we require benchmarks that can give us insights into the factors that affect MapReduce as an independent component.…”
Section: Introductionmentioning
confidence: 99%
“…Recent research[44][45][46][47] has revealed significant performance benefits can be obtained by leveraging the benefits of interconnects such as InfiniBand for faster execution of MapReduce jobs, along with enhanced features including RDMA-based shuffle, in-memory and pipelined merge during reduce, and pre-fetching and caching of map outputs. This RDMA-enhanced hybrid design known as HOMR[47] is publicly available as a part of the RDMA for Apache Hadoop project (RDMA-Apache-Hadoop-2.x)[19,23,25,33,44,47].In this section, we demonstrate the advantages of utilizing our stand-alone Hadoop MapReduce micro-benchmark suite by evaluating the HOMR against the default Hadoop MapReduce designs, with different storage architectures. We employ the RDMA for Apache Hadoop 2.x v0.9.7 package, which is based on Apache Hadoop 2.6.0., for our experiments.…”
mentioning
confidence: 98%
“…The trend of converging big data and high performance computing (HPC) is emerging [6][7][8][9][10] . As a specific example of this trend, DataMPI [11][12] is proposed, which aims at extending MPI by a key-value pair based communication operations to provide high performance communication in large-scale data computing scenario.…”
Section: Introductionmentioning
confidence: 99%