1987
DOI: 10.1007/bf01890074
|View full text |Cite
|
Sign up to set email alerts
|

A test for spatial homogeneity in cluster analysis

Abstract: Cluster homogeneity, Spatial uniformity, Cluster validity, Data normalization,

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

1994
1994
2021
2021

Publication Types

Select...
5
2
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 24 publications
(7 citation statements)
references
References 32 publications
0
6
0
Order By: Relevance
“…According to Lawson and Jures [44], "if too few points are chosen, then the nearest-neighbor distances chosen will not be representative of the entire distribution of distances." If too many points are chosen, [45] warn that the "assumptions about the Beta distribution will be invalid." Previous authors recommend sampling 5 − 10% of the data [44,46].…”
Section: Clusterability Via Spatial Randomnessmentioning
confidence: 99%
“…According to Lawson and Jures [44], "if too few points are chosen, then the nearest-neighbor distances chosen will not be representative of the entire distribution of distances." If too many points are chosen, [45] warn that the "assumptions about the Beta distribution will be invalid." Previous authors recommend sampling 5 − 10% of the data [44,46].…”
Section: Clusterability Via Spatial Randomnessmentioning
confidence: 99%
“…One of the most important subjects in cluster analysis is to understand the spatial relationships between data objects in each cluster, such as dense or sparse regions in a dataset [10]. It becomes a problem, when some clustering algorithms do not obey these relationships and distributions.…”
Section: Data Distribution Patterns For Clusteringmentioning
confidence: 99%
“…U j are the minimum distances of the sub-models from m random points in the sampling window. To define the sampling window, we either took 25 to 75 percentile of the feature values or from δ to max.value-δ along each dimension, where δ denotes the standard deviation of the feature value (Dubes & Zeng 1987; Zeng & Richard C Dubes 1985; Zeng & Richard C. Dubes 1985). To estimate p-value, we repeat the above procedure 1000 times and measured the H value.…”
Section: Methodsmentioning
confidence: 99%