2011 IEEE 17th International Conference on Parallel and Distributed Systems 2011
DOI: 10.1109/icpads.2011.131
|View full text |Cite
|
Sign up to set email alerts
|

StreamMR: An Optimized MapReduce Framework for AMD GPUs

Abstract: Abstract-MapReduce is a programming model from Google that facilitates parallel processing on a cluster of thousands of commodity computers. The success of MapReduce in cluster environments has motivated several studies of implementing MapReduce on a graphics processing unit (GPU), but generally focusing on the NVIDIA GPU.Our investigation reveals that the design and mapping of the MapReduce framework needs to be revisited for AMD GPUs due to their notable architectural differences from NVIDIA GPUs. For instan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(8 citation statements)
references
References 11 publications
0
8
0
Order By: Relevance
“…Data processing using MapReduce has many advantages such as horizontal scalability, fault tolerance, high performance, high throughput and commodity hardware. Although it was primarily designed for index construction in search engines , it can be used in data analysis as well – quite a few algorithms can be expressed in MapReduce .…”
Section: Related Work and Backgroundmentioning
confidence: 99%
“…Data processing using MapReduce has many advantages such as horizontal scalability, fault tolerance, high performance, high throughput and commodity hardware. Although it was primarily designed for index construction in search engines , it can be used in data analysis as well – quite a few algorithms can be expressed in MapReduce .…”
Section: Related Work and Backgroundmentioning
confidence: 99%
“…Recently many studies to reduce the execution time of MapReduce operation using a graphics processing unit (GPU) have been actively conducted [2,5,6,7,8]. Single-Instruction, Multiple-Data (SIMD) processors on a GPU can quickly evaluate applications (e.g., String Match, Word Count, Kmeans, etc.)…”
Section: Introductionmentioning
confidence: 99%
“…This data has to be stored as the intermediate data on a local storage. Although Mars [2], StreamMR [5], MapCG [6] implemented a MapReduce framework with atomic-free operations, a general model for analyzing big data was not considered. Using the computation characterization of map and reduce operation, the separating scheduling scheme of MapReduce tasks on a CPU and a GPU was also addressed on the limited size of input data.…”
Section: Introductionmentioning
confidence: 99%
“…HadoopCL [15] was a seamless combination of OpenCL and Hadoop and provides an easy-to-learn and flexible API in a particular high-performance computing system. Last but not least, beyond the NVIDIA GPU-based systems, there were also optimized MapReduce frameworks for AMD GPUs (StreamMR [12]) and Intel Xeon Phi coprocessors (MrPhi [18]). However, no existing system provides a language-level easy-to-use programming model for GPU clusters as in Vispark.…”
Section: Related Workmentioning
confidence: 99%