2021
DOI: 10.1016/j.ins.2021.04.076
|View full text |Cite
|
Sign up to set email alerts
|

Clustering mixed numerical and categorical data with missing values

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 72 publications
(29 citation statements)
references
References 28 publications
0
19
0
1
Order By: Relevance
“…Any missing value of the variables representing monthly average temperatures, total monthly rainfalls, days with hail events in a month, days with fog occurrence in a month and days with storm activity in a month were calculated as the average of all the values registered for the corresponding month for all the years included in the dataset. In this context, notice that there are novel proposals for data-mining methods that inherently deal with missing values [21].…”
Section: Data Preprocessing On the Cloudmentioning
confidence: 99%
“…Any missing value of the variables representing monthly average temperatures, total monthly rainfalls, days with hail events in a month, days with fog occurrence in a month and days with storm activity in a month were calculated as the average of all the values registered for the corresponding month for all the years included in the dataset. In this context, notice that there are novel proposals for data-mining methods that inherently deal with missing values [21].…”
Section: Data Preprocessing On the Cloudmentioning
confidence: 99%
“…In the following, the effective application of HAC on inhomogeneous data of PUH-environments is presented, leading to the obvious suggestion that HAC could be far more intensely used for building classification tasks in urban planning. For a detailed description of a novel clustering algorithm (k-CMM), especially suited to mixed numerical and categorical data, see e.g., Dinh et al [24].…”
Section: Data/buildings Classification and Applied Clustering Algorithmsmentioning
confidence: 99%
“…(1) In OCS approach, the missing values are viewed as additional attributes to be optimized and then impute missing values at each iteration till it reaches the best estimates, (2) NPS is a OCS modification, which computes the partial distances, and missing values are estimated by their nearest prototype counterparts during each iteration. In the hybrid clustering-based imputation method, Dinh et al [32] proposed a framework of clustering mixed numerical and categorical data with missing values, it used the decision-tree-based method to find the set of correlated data instance and used the mean and kernel-based methods to obtain cluster centers at numerical and categorical attributes, and they applied the dissimilarity measure to calculate the distances between instance and cluster centers.…”
Section: Computational Intelligence Imputationmentioning
confidence: 99%