2018
DOI: 10.1007/978-3-319-92871-5_11
|View full text |Cite
|
Sign up to set email alerts
|

A Statistical Performance Analysis of Graph Clustering Algorithms

Abstract: Measuring graph clustering quality remains an open problem. To address it, we introduce quality measures based on comparisons of intra-and inter-cluster densities, an accompanying statistical test of the significance of their differences and a step-by-step routine for clustering quality assessment. Our null hypothesis does not rely on any generative model for the graph, unlike modularity which uses the configuration model as a null model. Our measures are shown to meet the axioms of a good clustering quality f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
16
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
1
1

Relationship

4
3

Authors

Journals

citations
Cited by 12 publications
(17 citation statements)
references
References 39 publications
1
16
0
Order By: Relevance
“…Here, we slightly modify the procedure to generate inter-cluster edges. In our previous article [35], we varied the proportion of vertices inside and outside each cluster that shared an edge. Here, we vary edge probabilities.…”
Section: Experimental Set-up and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Here, we slightly modify the procedure to generate inter-cluster edges. In our previous article [35], we varied the proportion of vertices inside and outside each cluster that shared an edge. Here, we vary edge probabilities.…”
Section: Experimental Set-up and Resultsmentioning
confidence: 99%
“…Here, we streamline our statistical test. In our previous article [35], we conducted two separate tests. We formulated two null hypotheses,K intra = K andK inter = K, to avoid the effects of a possible correlation between K intra andK inter (K is a graph constant, not the result of a clustering).…”
Section: Hypothesis Testingmentioning
confidence: 99%
“…The links between density and clustered patterns of connectivity were shown in Miasnikof et al [26]. Under such a pattern of connectivity, it is expected that the densities of induced subgraphs obtained by sampling vertices within a neighborhood will exhibit, on average, higher densities than the graph's global density.…”
Section: Underlying Assumptions and Densitiesmentioning
confidence: 93%
“…We then examine the relationship between mean Jaccard [10], Otsuka-Ochiai [17] and Burt's distances [2], on one hand, and intra-cluster density [14,13,15,16] within each cluster, on the other. Because these distances are pairwise measures, we compare their mean value for a given cluster to the cluster's internal density.…”
Section: Distance Measurements Under Studymentioning
confidence: 99%
“…At the cluster level, this distance takes the form of subsets of densely connected vertices. The link between clustering and density has been discussed in depth, recently [14,15,13,16]. In this article, our ultimate goal is to transform a graph's adjacency matrix into a |V | × |V | similarity or distance matrix D = [d ij ], where the distance between each pair of vertices is given by the element d ij .…”
Section: Introductionmentioning
confidence: 99%

Graph Distances and Clustering

Miasnikof,
Shestopaloff,
Pitsoulis
et al. 2020
Preprint
Self Cite