2017
DOI: 10.1080/10618600.2017.1305278
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Categorical Data via Ensembling Dissimilarity Matrices

Abstract: We present a technique for clustering categorical data by generating many dissimilarity matrices and averaging over them. We begin by demonstrating our technique on low dimensional categorical data and comparing it to several other techniques that have been proposed. Then we give conditions under which our method should yield good results in general. Our method extends to high dimensional categorical data of equal lengths by ensembling over many choices of explanatory variables. In this context we compare our … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
28
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 15 publications
(29 citation statements)
references
References 46 publications
0
28
0
Order By: Relevance
“…In general, the clusters in different  ( ) 's do not correspond in a one-to-one manner according to their indices. For instances, (2) 's may be (1) 's with permuted subscript k. To further complicate matters, even permutation may not be a suitable relationship.…”
Section: Consider a Set Of Instancesmentioning
confidence: 99%
See 4 more Smart Citations
“…In general, the clusters in different  ( ) 's do not correspond in a one-to-one manner according to their indices. For instances, (2) 's may be (1) 's with permuted subscript k. To further complicate matters, even permutation may not be a suitable relationship.…”
Section: Consider a Set Of Instancesmentioning
confidence: 99%
“…1 Cluster alignment: Let  ( ) = { (1) 1 , (1) 2 , … , (1) }, = 1, … , . The number of clusters K t varies.…”
Section: Consider a Set Of Instancesmentioning
confidence: 99%
See 3 more Smart Citations