2020
DOI: 10.1109/tpds.2019.2955467
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer

Abstract: This paper presents an automatic k-means clustering solution targeting the Sunway TaihuLight supercomputer. We first introduce a multi-level parallel partition approach that not only partitions by dataflow and centroid, but also by dimension, which unlocks the potential of the hierarchical parallelism in the heterogeneous many-core processor and the system architecture of the supercomputer. The parallel design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 tar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…Although the KMeans API is supposed to be highly optimized and fast, it turns out that our GPU code appears ×1.4 to ×9.3 faster than the KMeans of RAPIDS v0.19 on RTX 2080Ti. We guess this significant difference is mainly induced by the wrapper overhead of Python interface (as our tests do not last long) and by the youth of RAPIDS (v0.19 in April 2021). k ‐Means of Yu et al 16 on one node of Sunway TaihuLight supercomputer . The SW26010 manycore processor offers 260 cores and has a significantly different design from other multicore and manycore processors 16 .…”
Section: Experimental Evaluationmentioning
confidence: 99%
See 3 more Smart Citations
“…Although the KMeans API is supposed to be highly optimized and fast, it turns out that our GPU code appears ×1.4 to ×9.3 faster than the KMeans of RAPIDS v0.19 on RTX 2080Ti. We guess this significant difference is mainly induced by the wrapper overhead of Python interface (as our tests do not last long) and by the youth of RAPIDS (v0.19 in April 2021). k ‐Means of Yu et al 16 on one node of Sunway TaihuLight supercomputer . The SW26010 manycore processor offers 260 cores and has a significantly different design from other multicore and manycore processors 16 .…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…We guess this significant difference is mainly induced by the wrapper overhead of Python interface (as our tests do not last long) and by the youth of RAPIDS (v0.19 in April 2021). k ‐Means of Yu et al 16 on one node of Sunway TaihuLight supercomputer . The SW26010 manycore processor offers 260 cores and has a significantly different design from other multicore and manycore processors 16 . This processor appeared in 2016 as part of the Sunway TaihuLight supercomputer which was at that time ranked #1 in the Top500 list from 2016 to 2018.…”
Section: Experimental Evaluationmentioning
confidence: 99%
See 2 more Smart Citations
“…The current teaching quality evaluation mostly adopts the method of student evaluation. Usually, the educational administration department publishes the teacher's teaching quality evaluation form on the Internet at the mid-term or end of the period, scores the teacher according to the evaluation items in the evaluation form, and determines the teacher's teaching quality evaluation grade according to the evaluation results after statistics by the educational administration department [5]. This evaluation method can only obtain simple evaluation results, can not analyze the evaluation data, can not give full play to the guiding role of teaching evaluation in teaching, and does not make full use of the existing data.…”
Section: Introductionmentioning
confidence: 99%