High performance data clustering: a comparative analysis of performance for GPU, RASC, MPI, and OpenMP implementations

Yang, Luobin; Chiu, Steve C.; Liao, Wei-keng; Thomas, Michael A.

doi:10.1007/s11227-013-0906-y

Cited by 14 publications

(5 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We also made comparisons with other recent works in the literature, that only in one case achieved results comparable but not better than ours, except for the FPGA solution proposed in [27]. However, FPGA memory constraint does not allow to process images with more than 17, 692 pixels.…”

Section: Discussionmentioning

confidence: 71%

“…A comparative analysis similar to the one we conducted is reported in [27]. Authors exploited GPUs, OpenMP, Message Passing Interface (MPI) and FPGAs.…”

Section: Comparisons and Discussionmentioning

confidence: 99%

“…Finally, concerning the FPGA implementation, experiments were reported only with a 17, 692 9-dimensionality dataset. Classification time is~100 ms, but, as stated in [27], the FPGA resources, especially memory banks, were not enough to process bigger datasets. The authors of this work do not use external DDR memory, therefore, the FPGA performance is limited due to this design choice.…”

Section: Comparisons and Discussionmentioning

confidence: 99%

“…Maximum Image Size Data Dimensionality Technology Speed-Up [18] 2,000,000 8 GPU NVIDIA GTX 280 220 [19] 500,000 2 GPU NVIDIA 9600 GT 14 [20] 1,000,000 2 GPU NVIDIA 8800 GTX 60 [21] 15,052,800 3 4 × GPU NVIDIA GTX 750Ti 60 [22] 1,000,000 32 GPU NVIDIA GTX 280 N. A. [23] 16,777,216 3 GPU NVIDIA Tesla C2050 25 [24] 245,057 4 GPU NVIDIA GeForce 210 386 [25] 500,000 16 GPU NVIDIA Quadro K5000 88 [26] N. A. N. A. GPU NVIDIA GTX 1080 18.5 [27] 20,000 10 2 × AMD Opteron quad-core 8 [27] 65,536 10 GPU NVIDIA Tesla 2050 60 [27] 17,692 9 Mitrion MVP FPGA Simulator N. A. Our work 264,408 128 GPU NVIDIA GTX 1060 126…”

Section: Papermentioning

confidence: 99%

See 3 more Smart Citations

Parallel K-Means Clustering for Brain Cancer Detection Using Hyperspectral Images

et al. 2018

View full text Add to dashboard Cite

The precise delineation of brain cancer is a crucial task during surgery. There are several techniques employed during surgical procedures to guide neurosurgeons in the tumor resection. However, hyperspectral imaging (HSI) is a promising non-invasive and non-ionizing imaging technique that could improve and complement the currently used methods. The HypErspectraL Imaging Cancer Detection (HELICoiD) European project has addressed the development of a methodology for tumor tissue detection and delineation exploiting HSI techniques. In this approach, the K-means algorithm emerged in the delimitation of tumor borders, which is of crucial importance. The main drawback is the computational complexity of this algorithm. This paper describes the development of the K-means clustering algorithm on different parallel architectures, in order to provide real-time processing during surgical procedures. This algorithm will generate an unsupervised segmentation map that, combined with a supervised classification map, will offer guidance to the neurosurgeon during the tumor resection task. We present parallel K-means clustering based on OpenMP, CUDA and OpenCL paradigms. These algorithms have been validated through an in-vivo hyperspectral human brain image database. Experimental results show that the CUDA version can achieve a speed-up of ~ 150 × with respect to a sequential processing. The remarkable result obtained in this paper makes possible the development of a real-time classification system.

show abstract

Section: Discussionmentioning

confidence: 71%

“…A comparative analysis similar to the one we conducted is reported in [27]. Authors exploited GPUs, OpenMP, Message Passing Interface (MPI) and FPGAs.…”

Section: Comparisons and Discussionmentioning

confidence: 99%

Section: Comparisons and Discussionmentioning

confidence: 99%

Section: Papermentioning

confidence: 99%

See 2 more Smart Citations

Parallel K-Means Clustering for Brain Cancer Detection Using Hyperspectral Images

et al. 2018

View full text Add to dashboard Cite

show abstract

“…Peralta et al [17] proposed a twolevel parallelised cluster framework that combined process-level and thread-level parallelisms. For large-scale automated FI system (AFIS), cluster-based solutions are expensive, as their cost depends on the number of high-end computers used in the cluster [18]. DSP-based accelerators offer limited parallelism and they may not be adequate for exploiting the maximum parallelism of minutiaebased fingerprint matching algorithms.…”

Section: Introductionmentioning

confidence: 99%