2019
DOI: 10.1007/s13369-019-04121-0
|View full text |Cite
|
Sign up to set email alerts
|

Initial Seed Selection for Mixed Data Using Modified K-means Clustering Algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…In this section, a discussion of various existing techniques is presented, which are used to cluster the categorical data. The advantages and its limitations of these existing techniques [14][15][16][17][18] are also illustrated.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…In this section, a discussion of various existing techniques is presented, which are used to cluster the categorical data. The advantages and its limitations of these existing techniques [14][15][16][17][18] are also illustrated.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Sajidha [17] proposed the modified K-means clustering algorithm by considering every attribute of datasets for selecting the initial seed. The datasets were clustered with the mixed attributes easily because the developed study was independent of user-defined parameters.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Different initial seeds may lead to distinct results. It is also difficult to determine the number of clusters due to its nature of a supervised algorithm ( Sajidha et al, 2020 ). By applying the dummy variable for qualitative traits, k -means can be modified as weighted k -means clustering ( Huang et al, 2005 ; Foss and Markatou, 2018 ), so that it is able to deal with qualitative and quantitative traits at the same time.…”
Section: Introductionmentioning
confidence: 99%
“…Hence, methods to detect outliers and to moderate their effects are needed. The importance of density and distance of data points while identifying the initial seed points for -means for numerical data, -modes for categorical data and mixed datasets using modified -means algorithm, is elucidated in the work [2]- [4] in which the initial seed points were effectively identified. One of the major drawbacks of the partition based clustering algorithm is that they cannot detect the presence of outliers.…”
Section: Introductionmentioning
confidence: 99%