2016
DOI: 10.1007/s10115-016-0930-3
|View full text |Cite
|
Sign up to set email alerts
|

An adaptive version of k-medoids to deal with the uncertainty in clustering heterogeneous data using an intermediary fusion approach

Abstract: Abstract. This paper introduces Hk -medoids, a modified version of the standard kmedoids algorithm. The modification extends the algorithm for the problem of clustering complex heterogeneous objects that are described by a diversity of data types, e.g. text, images, structured data and time series. We first proposed an intermediary fusion approach to calculate fused similarities between objects, SMF, taking into account the similarities between the component elements of the objects using appropriate similarity… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…By contrast, using k-means for the ODP-239 dataset shows a high result of 84.36 in terms of f-measure with QSTWVR. This finding may be due to the ability of QSTWVR low-dimensional word embeddings to capture the implicit semantics of words efficiently; thus, it is more appropriate for short text [ 44 , 60 ]. Likewise, the ODP-239 dataset result obtained with k-medoids is 82.3 in terms of f-measure using QSTDVR, which is also high compared with other clustering methods.…”
Section: Discussionmentioning
confidence: 99%
“…By contrast, using k-means for the ODP-239 dataset shows a high result of 84.36 in terms of f-measure with QSTWVR. This finding may be due to the ability of QSTWVR low-dimensional word embeddings to capture the implicit semantics of words efficiently; thus, it is more appropriate for short text [ 44 , 60 ]. Likewise, the ODP-239 dataset result obtained with k-medoids is 82.3 in terms of f-measure using QSTDVR, which is also high compared with other clustering methods.…”
Section: Discussionmentioning
confidence: 99%
“…by fusing distances) [33]. The intermediate fusion approach we use is an adaptation of work by Mojahed et al [34] who used a k-medoids clustering algorithm to cluster objects that were represented by different data types (e.g. text, images and TSs).…”
Section: Time Series Analysismentioning
confidence: 99%
“…There are several studies that have applied machine learning methods to multimedia datasets. Mojahed et al [10,11] applied clustering ensemble methods to multimedia datasets. They generated five different heterogeneous datasets containing a mixture of both structured and unstructured datasets.…”
Section: Related Workmentioning
confidence: 99%