2018
DOI: 10.1007/s10586-018-2199-7
|View full text |Cite|
|
Sign up to set email alerts
|

RETRACTED ARTICLE: Research on semi supervised K-means clustering algorithm in data mining

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(9 citation statements)
references
References 9 publications
0
7
0
Order By: Relevance
“…The K-means clustering algorithm will be used in this research, not only because it's one of the most commonly used clustering techniques but also because it has been applied in many scientific and technological fields [6,19,27]. The K-means method has not only suffered from a major problem of which the algorithm produces empty clusters [3] added to that the problem produced by the random nature of cluster's initial centres selection that causes the algorithm to tend to sub optimal solutions [17]. Kmeans clustering algorithm will be used to group transcribed textual documents obtained from audio sources into topics by applying a similarity measure based on the Chi-square method, which is designed to eliminate non informative words that will more likely be erroneous words when applied on transcribed documents [5].…”
Section: K-means Clustering Algorithmmentioning
confidence: 99%
“…The K-means clustering algorithm will be used in this research, not only because it's one of the most commonly used clustering techniques but also because it has been applied in many scientific and technological fields [6,19,27]. The K-means method has not only suffered from a major problem of which the algorithm produces empty clusters [3] added to that the problem produced by the random nature of cluster's initial centres selection that causes the algorithm to tend to sub optimal solutions [17]. Kmeans clustering algorithm will be used to group transcribed textual documents obtained from audio sources into topics by applying a similarity measure based on the Chi-square method, which is designed to eliminate non informative words that will more likely be erroneous words when applied on transcribed documents [5].…”
Section: K-means Clustering Algorithmmentioning
confidence: 99%
“…The parameters involved should be used cautiously as incompatible use of parameters of clustering like, Number of Clusters (k-means) and Density Limit, may lead to situations like improper density shape of clusters, ambiguity in finding centroid and the noise [5][6][7]. Mainly The improved semi supervised K mean clustering is used for the greedy iteration to find the K mean clustering is presented in [8]. In this work, modification of iterative objective function for semi supervised K clustering in dealing with multi-objective optimization problems of insufficient is illustrated.…”
Section: Introductionmentioning
confidence: 99%
“…In the common methods of clustering, there is no previous information, and as such, it is called the unsupervised learning method [2,3]; however, in the real world, some information [4] is normally available, or we can obtain from Oracle. is information can be in different forms and can be used in the process of clustering [5][6][7][8][9][10][11][12][13].…”
Section: Introductionmentioning
confidence: 99%
“…If the information is presented as pairwise constraints (where a document pair must be in the same cluster (ML), while a document pair should not be located in the same cluster (CL)), and these pairwise constraints are used in the process of clustering, this method will be called pairwise constrained clustering [6,14,15]. Pairwise constraints can be useful in the clustering process in two ways: when enough informative pairwise constraints exist, where the accuracy and efficiency of the clustering can be improved, and when we want to change the process of clustering and personalize it [10,12,16].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation