2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS) 2016
DOI: 10.1109/icpads.2016.0108
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Spark RDD Operations with Local and Remote GPU Devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(6 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…The parallel batch processing technique also demonstrates the efficiency of the proposed technique for 8 GB of data, as it results in performance improvements ranging from 1.52š‘„ (Hilbert) to 2.34š‘„ (Montecarlo). The results shown in Figure 8 showcase that the proposed batch processing techniques enable applications to utilize volumes of data that exceed the physical memory capacity of the hardware device; a capability can be extremely beneficial, especially for Java-based Big Data frameworks such as Apache Spark [25,28] and Flink [3,5].…”
Section: Going Beyond the Gpu Memory Capacitymentioning
confidence: 99%
“…The parallel batch processing technique also demonstrates the efficiency of the proposed technique for 8 GB of data, as it results in performance improvements ranging from 1.52š‘„ (Hilbert) to 2.34š‘„ (Montecarlo). The results shown in Figure 8 showcase that the proposed batch processing techniques enable applications to utilize volumes of data that exceed the physical memory capacity of the hardware device; a capability can be extremely beneficial, especially for Java-based Big Data frameworks such as Apache Spark [25,28] and Flink [3,5].…”
Section: Going Beyond the Gpu Memory Capacitymentioning
confidence: 99%
“…In addition to the traditional Spark architecture discussed so far, some work has been done to support additional hardware acceleration. One paper [42] proposed a Spark modification to support the use of GPUs by invoking CUDA kernels for computationally intensive tasks. Several caching methods were investigated to reduce network overhead, and experiments using both local and remote GPUs showed significant speed improvements.…”
Section: Mapreduce Architecture With Sparkmentioning
confidence: 99%
“…We obtained 19 papers 20ā€22,32,75ā€89 . These papers were published in five distinct journals publications, 13 conferences/workshops and one PhD thesis.…”
Section: Related Workmentioning
confidence: 99%
“…Regarding Apache Spark, it is known that the intermediate datasets, the Resilient Distributed Datasets (RDDs), are stored in the distributed memory over the set machines under consideration. To speed up computeā€intensive operations, the work in Reference 82 focuses on GPU devices, and also a modified Spark framework to invoke CUDA kernels for these computeā€intensive operations was deployed. The implementation transformed RDDs into array structures and transferred to the GPU devices, which actually can be remote to the host machine since the number of local GPU devices is limited.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation