2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS) 2018
DOI: 10.1109/focs.2018.00082
|View full text |Cite
|
Sign up to set email alerts
|

Epsilon-Coresets for Clustering (with Outliers) in Doubling Metrics

Abstract: We study the problem of constructing ε-coresets for the (k, z)-clustering problem in a doubling metric M (X, d). An ε-coreset is a weighted subset S ⊆ X with weight function w : S → R ≥0 , such that for any k-subset C ∈ [X] k , it holds thatWe present an efficient algorithm that constructs an ε-coreset for the (k, z)-clustering problem in M (X, d), where the size of the coreset only depends on the parameters k, z, and the doubling dimension ddim(M ). To the best of our knowledge, this is the first efficient -c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
75
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 47 publications
(79 citation statements)
references
References 46 publications
3
75
1
Order By: Relevance
“…In general, a bounded doubling dimension does not imply a bounded VC dimension of the induced range space and vice versa. Recently, Huang et al [34] showed that if we allow a small (1 + ε)distortion of the distance function d, the shattering dimension can be upper bounded by O(ε −O(ddim(M,d)) ). It is conceivable that the doubling dimension of the metric space of the discrete Fréchet distance and Hausdorff distance is bounded, as long as the underlying metric has bounded doubling dimension.…”
Section: Range Spaces Induced By Distance Measuresmentioning
confidence: 99%
“…In general, a bounded doubling dimension does not imply a bounded VC dimension of the induced range space and vice versa. Recently, Huang et al [34] showed that if we allow a small (1 + ε)distortion of the distance function d, the shattering dimension can be upper bounded by O(ε −O(ddim(M,d)) ). It is conceivable that the doubling dimension of the metric space of the discrete Fréchet distance and Hausdorff distance is bounded, as long as the underlying metric has bounded doubling dimension.…”
Section: Range Spaces Induced By Distance Measuresmentioning
confidence: 99%
“…ε-coresets are very popular in the field of clustering, cf. [19,18,10,21] and they are becoming a topic in other fields, too, cf. [22,29,31,15,26].…”
Section: Introductionmentioning
confidence: 99%
“…This analysis was proven powerful in various metric spaces, such as doubling spaces by Huang, Jiang, Li and Wu [HJLW18], graphs of bounded treewidth by Baker, Braverman, Huang, Jiang, Krauthgamer, Wu [BBH + 20] or the shortest-path metric of a graph excluding a fixed minor by Braverman, Jiang, Krauthgamer and Wu [BJKW21]. However, range spaces of even heavily constrained metrics do not necessarily have small VC-dimension (e.g.…”
Section: Introductionmentioning
confidence: 99%
“…However, range spaces of even heavily constrained metrics do not necessarily have small VC-dimension (e.g. bounded doubling dimension does not imply bounded VC-dimension or vice versa [HJLW18,LL06]), and applying previous techniques requires heavy additional machinery to adapt the VC-dimension approach to them. Moreover, the bounds provided are far from the bound obtained for Euclidean spaces: their dependency in k is at least Ω(k 2 ), leaving a significant gap to the best lower bounds of Ω(k).…”
Section: Introductionmentioning
confidence: 99%