2021
DOI: 10.1016/j.seps.2020.100907
|View full text |Cite
|
Sign up to set email alerts
|

Robust multivariate analysis for mixed-type data: Novel algorithm and its practical application in socio-economic research

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…Although Gower's metric is widely used when dealing with mixed datasets, it presents several shortcomings. For example, it gives more weight to categorical variables than to quantitative ones; it does not take into account the possible associations or correlations between variables; and it is not robust against atypical data (see [9,42,43]).…”
Section: Discussionmentioning
confidence: 99%
“…Although Gower's metric is widely used when dealing with mixed datasets, it presents several shortcomings. For example, it gives more weight to categorical variables than to quantitative ones; it does not take into account the possible associations or correlations between variables; and it is not robust against atypical data (see [9,42,43]).…”
Section: Discussionmentioning
confidence: 99%
“…The FS-DB algorithm was introduced by [8] for the analysis and clustering of mixed data. In particular, the FS-DB algorithm combines the forward search method [9,10] with a distance-based tool, used in [5], to detect outliers in mixed-type datasets.…”
Section: Fs-db Algorithmmentioning
confidence: 99%
“…Two possible distance measures are available in the FS-DB algorithm: a distance measure based on Gower's classical similarity coefficient and a metric obtained via related metric scaling, which satisfies several axioms related to the property of identifying and discarding redundant information. See [8] for details.…”
Section: Fs-db Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…Gower's distance is defined as δ 2 (z i , z j ) = 1 − s ij . There are other possible, more robust, metrics that can be used with mixed data (see, e.g., [20]), but, for simplicity, we use Gower's distance, as it is implemented in the R cluster package and supported in the dblm and dbglm functions of the dbstats package that we employed in the analysis of the real data sets and in the simulations.…”
Section: The Choice Of the Distancementioning
confidence: 99%