2015 IEEE International Conference on Cloud Engineering 2015
DOI: 10.1109/ic2e.2015.35
|View full text |Cite
|
Sign up to set email alerts
|

Harp: Collective Communication on Hadoop

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 14 publications
0
9
0
Order By: Relevance
“…References in current paper give more details of different applications of this concept. We have applied HPC-ABDS to design high performance run time [1] [12], understand the mapping of applications into the software stack [2], design middleware for data-intensive analytics [3] [13], and to delineate a systematic big data benchmark approach [8] [9]. In this paper, we have aimed at an overall discussion of all layers which can be used in many applications of HPC-ABDS and in fact in studies of just the Commodity big data stack.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…References in current paper give more details of different applications of this concept. We have applied HPC-ABDS to design high performance run time [1] [12], understand the mapping of applications into the software stack [2], design middleware for data-intensive analytics [3] [13], and to delineate a systematic big data benchmark approach [8] [9]. In this paper, we have aimed at an overall discussion of all layers which can be used in many applications of HPC-ABDS and in fact in studies of just the Commodity big data stack.…”
Section: Discussionmentioning
confidence: 99%
“…Note recent developments at the programming layer like Apache Hive and Drill, which offer high-layer access models like SQL implemented on scalable NoSQL data systems. The communication layer includes Publish-subscribe technology used in many approaches to streaming data as well the HPC communication technologies (MPI) which are much faster than most default Apache approaches but can be added [12] to some systems like Hadoop whose modern version is modular and allows plug-ins for HPC stalwarts like MPI and sophisticated load balancers. The programming layer includes both the classic batch processing typified by Hadoop and streaming by Storm.…”
Section: Introductionmentioning
confidence: 99%
“…It contains 1 million words, each with 10 thousand topics, resulting in a total of 10 billion parameters. The [20]. One is "rtt", which follows Computation Model B (uses "rotateGlobal") in Parallelism Form I and Computation Model D in Form II.…”
Section: Methodsmentioning
confidence: 99%
“…HPC with MPI suggests that one could integrate categories 11.3 and 11.4 into a single environment. This approach is illustrated by the Harp plug-in for Hadoop which supports both models [27]. We recently added the map-streaming architecture of Table 11.5 and Figure 2b); recall that Table 2 listed 41 streaming applications in the 51 use cases.…”
Section: Hardware and Software Archi-tecture Issues 41 Six Importantmentioning
confidence: 99%
“…However important data analytics involve full matrix algorithms. For example, recent papers [27,29,30] on a new Multidimensional Scaling method use conjugate gradient solvers with full matrices as opposed to the new sparse conjugate gradient benchmark HPCG being developed for supercomputer (Top500) evaluations [31].…”
Section: Comparison Between Data Intensive and Simulation Problemsmentioning
confidence: 99%