Comparing Two Clusterings Using Matchings between Clusters of Clusters

Cazals, Frédéric; Mazauric, Dorian; Tetley, Romain; Watrigant, Rémi

doi:10.1145/3345951

Cited by 6 publications

(7 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several research papers [ 10 , 11 ] have analyzed the stability of a given clustering algorithm while varying its parameters, and to compare clusters yielded by different algorithms, using comparison schemes based on matchings, information theory, and use of various indices (Rand, Jaccard). This was generalized to accommodate many-to-many matchings between clusters, via the D-family-matching on the intersection graph , with D as the upper bound on the diameter of the graph induced by the clusters of any meta-cluster by [ 12 ]. While this problem is NP-complete and hard to approximate, a polynomial time, spanning tree based heuristic was presented.…”

Section: Related Workmentioning

confidence: 99%

“…In this paper, we are dealing with clusters obtained through two different datasets which are therefore, non-uniform in size and could represent different metrics. We present a novel way of using a specialized edge-weighing formulation in intersection graph and to find the correspondences through maximum-weighted bipartite matching of the graph in section [5.2, 5.3], figure [ 12 , 15 ]. It is noteworthy that our edge-weight metric can be generalized to compute similarity between two sets containing objects at different levels of hierarchies in a hierarchical dataset.…”

Section: Model and Techniquesmentioning

confidence: 99%

See 1 more Smart Citation

Clusters of COVID-19 Indicators in India: Characterization, Correspondence and Change Analysis

2022

View full text Add to dashboard Cite

We conduct a long-term epidemiology study of COVID-19 in India from Mar 2020 to May 2021 using a number of indicators such as active cases, daily new cases, and deaths, on a micro (district level, per capita) and macro level (state level). Our automated shape-based cluster discovery of the per capita daily new cases ( case rate ) during the first wave in India (between Mar 2020 and Jan 2021) revealed four distinct shape patterns: sharp-rise and decline, steady-rise and decline, plateau and multiple relatively high peaks. These clusters exhibit a strong geographical correlation. To determine the correspondence between clusters obtained by different indicators, we design a novel metric for determining edge-weights in their intersection graph . This is used for comparative analysis and to develop informative hierarchical cartographic visualizations. We then perform dynamic cluster analysis for different time windows to answer some pertinent questions. Is the second wave similar to or different from the first wave ? How has the relative ranking (on micro- and macro-level indicators) of the states varied over the last one year? How much medical resources have been stressed during the peak? We demonstrate that using multiple indicators, we can assess the impact of the epidemic holistically in a particular geography. Our analysis techniques and insights obtained can help the local and state governments in monitoring and managing COVID-19 situation and fine-tuning the ongoing vaccination drive in India.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Model and Techniquesmentioning

confidence: 99%

Clusters of COVID-19 Indicators in India: Characterization, Correspondence and Change Analysis

2022

View full text Add to dashboard Cite

show abstract

“…For this purpose, we applied a cluster matching framework, called D-family matching. 22 It first defines the "intersection graph" G of N i and N j as a bipartite graph where the vertices in the two partite sets correspond to the clusters of N i and N j . Each pair of clusters of N i and N j has an edge with the weight equal to the size of their intersection.…”

Section: Rq2: Inter-annotator Agreementmentioning

confidence: 99%

User‐guided global explanations for deep image recognition: A user study

Hamidi-Haines

Fern

et al. 2021

Applied AI Letters

View full text Add to dashboard Cite

We study a user‐guided approach for producing global explanations of deep networks for image recognition. The global explanations are produced with respect to a test data set and give the overall frequency of different “recognition reasons” across the data. Each reason corresponds to a small number of the most significant human‐recognizable visual concepts used by the network. The key challenge is that the visual concepts cannot be predetermined and those concepts will often not correspond to existing vocabulary or have labeled data sets. We address this issue via an interactive‐naming interface, which allows users to freely cluster significant image regions in the data into visually similar concepts. Our main contribution is a user study on two visual recognition tasks. The results show that the participants were able to produce a small number of visual concepts sufficient for explanation and that there was significant agreement among the concepts, and hence global explanations, produced by different participants.

show abstract

“…In addition, studies comparing algorithms or generated clusters have been performed [37]. Cazals et al proposed a framework to analyze the stability of clustering algorithms and compare clusters by introducing meta-clusters [10]. They defined the family-matching problems on an intersection graph.…”

Section: Clusteringmentioning

confidence: 99%

MaxMin clustering for historical analogy

2020

View full text Add to dashboard Cite

Historical analogy is the ability to use historical knowledge to consider solutions for a present event, and it can be promoted by group learning. However, group creation for promoting the ability has been unexplored. This study proposes a novel clustering algorithm, named MaxMin clustering (MMC), to enhance discussions of group learning toward promoting historical analogy. The key concept is group formation by aggregating similar and different users. MMC uses aspects provided by users for the same present event. Subsequently, it solves maximum and minimum optimization problems to find similar and different users by counting the number of aspects shared by them. MMC is implemented and evaluated through comparison with other clustering algorithms; the comparison is based on the degree to which the generated clusters satisfy conditions for enhancing discussions of group learning toward promoting historical analogy. The experimental results prove that only MMC can generate suitable groups.

show abstract

Comparing Two Clusterings Using Matchings between Clusters of Clusters

Cited by 6 publications

References 45 publications

Clusters of COVID-19 Indicators in India: Characterization, Correspondence and Change Analysis

Clusters of COVID-19 Indicators in India: Characterization, Correspondence and Change Analysis

User‐guided global explanations for deep image recognition: A user study

MaxMin clustering for historical analogy

Contact Info

Product

Resources

About