Twenty-First International Conference on Machine Learning - ICML '04 2004
DOI: 10.1145/1015330.1015404
|View full text |Cite
|
Sign up to set email alerts
|

Entropy-based criterion in categorical clustering

Abstract: Entropy-type measures for the heterogeneity of clusters have been used for a long time. This paper studies the entropy-based criterion in clustering categorical data. It first shows that the entropy-based criterion can be derived in the formal framework of probabilistic clustering models and establishes the connection between the criterion and the approach based on dissimilarity coefficients. An iterative Monte-Carlo procedure is then presented to search for the partitions minimizing the criterion. Experiments… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
134
0

Year Published

2004
2004
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 133 publications
(135 citation statements)
references
References 14 publications
1
134
0
Order By: Relevance
“…Initial results have shown that the entropy criterion can be very effective in clustering categorical data. Paper [34] also shows that the entropy criterion can be formally derived in the framework of probabilistic clustering models, which strongly supports that the entropy criterion is a meaningful and reliable similarity measure, particularly good for categorical data.…”
Section: Entropy-based Categorical Clusteringmentioning
confidence: 53%
See 3 more Smart Citations
“…Initial results have shown that the entropy criterion can be very effective in clustering categorical data. Paper [34] also shows that the entropy criterion can be formally derived in the framework of probabilistic clustering models, which strongly supports that the entropy criterion is a meaningful and reliable similarity measure, particularly good for categorical data.…”
Section: Entropy-based Categorical Clusteringmentioning
confidence: 53%
“…Originated from information theory, entropy has been applied in various areas, such as pattern discovery [8], numerical clustering [17] and information retrieval [40]. Due to the lack of intuitive distance definition for categorical values, recently entropy has been applied in clustering categorical data [5,34,11,20,13]. Initial results have shown that the entropy criterion can be very effective in clustering categorical data.…”
Section: Entropy-based Categorical Clusteringmentioning
confidence: 99%
See 2 more Smart Citations
“…In fact, entropy and related information gain criterion are widely used in learning, e.g. for contextual clustering [25] or decision trees [34].…”
Section: Highmentioning
confidence: 99%