2021
DOI: 10.1590/1982-7849rac2021200081
|View full text |Cite
|
Sign up to set email alerts
|

Cluster Analysis in Practice: Dealing with Outliers in Managerial Research

Abstract: Context: in recent years, cluster analysis has stimulated researchers to explore new ways to understand data behavior. The computational ease of this method and its ability to generate consistent outputs, even in small datasets, explain that to some extent. However, researchers are often mistaken in holding that clustering is a terrain in which anything goes. The literature shows the opposite: they must be careful, especially regarding the effect of outliers on cluster formation. Objective: in this tutorial p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0
3

Year Published

2021
2021
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 24 publications
0
4
0
3
Order By: Relevance
“…After evaluating the properties of the research variables, the first step was to build the configurations based on the startup clustering in relation to the imperatives. The aim was to find configurations of startups that were internally homogeneous, but heterogeneous between each other (Distefano, 2012;Lopes & Gosling, 2020).…”
Section: Discussionmentioning
confidence: 99%
“…After evaluating the properties of the research variables, the first step was to build the configurations based on the startup clustering in relation to the imperatives. The aim was to find configurations of startups that were internally homogeneous, but heterogeneous between each other (Distefano, 2012;Lopes & Gosling, 2020).…”
Section: Discussionmentioning
confidence: 99%
“…K-means clustering is a popular method; however, we should perhaps experiment using the k-medoids clustering method. k-medoids is a partitioning method that is best suited for domains requiring robustness to outliers, inconsistent distance metrics, or the dataset with no clear definition of mean or median [53]. The k-medoids algorithm returns medoids which are the actual data points in the dataset.…”
Section: Limitationsmentioning
confidence: 99%
“…Though k-means clustering has become a popular data-clustering algorithm [69,70], it is sensitive to outliers [68]. An alternative clustering method that is robust to outliers is the partitioning around medoids (PAM) algorithm [69,71]. PAM requires that the optimal number of clusters be determined before the algorithm is applied [70].…”
Section: Cluster Analysismentioning
confidence: 99%