2013
DOI: 10.1007/s11227-013-0906-y
|View full text |Cite
|
Sign up to set email alerts
|

High performance data clustering: a comparative analysis of performance for GPU, RASC, MPI, and OpenMP implementations

Abstract: Compared to Beowulf clusters and shared-memory machines, GPU and FPGA are emerging alternative architectures that provide massive parallelism and great computational capabilities. These architectures can be utilized to run compute-intensive algorithms to analyze ever-enlarging datasets and provide scalability. In this paper, we present four implementations of K-means data clustering algorithm for different high performance computing platforms. These four implementations include a CUDA implementation for GPUs, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…We also made comparisons with other recent works in the literature, that only in one case achieved results comparable but not better than ours, except for the FPGA solution proposed in [27]. However, FPGA memory constraint does not allow to process images with more than 17, 692 pixels.…”
Section: Discussionmentioning
confidence: 71%
See 3 more Smart Citations
“…We also made comparisons with other recent works in the literature, that only in one case achieved results comparable but not better than ours, except for the FPGA solution proposed in [27]. However, FPGA memory constraint does not allow to process images with more than 17, 692 pixels.…”
Section: Discussionmentioning
confidence: 71%
“…A comparative analysis similar to the one we conducted is reported in [27]. Authors exploited GPUs, OpenMP, Message Passing Interface (MPI) and FPGAs.…”
Section: Comparisons and Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Peralta et al [17] proposed a twolevel parallelised cluster framework that combined process-level and thread-level parallelisms. For large-scale automated FI system (AFIS), cluster-based solutions are expensive, as their cost depends on the number of high-end computers used in the cluster [18]. DSP-based accelerators offer limited parallelism and they may not be adequate for exploiting the maximum parallelism of minutiaebased fingerprint matching algorithms.…”
Section: Introductionmentioning
confidence: 99%