Proceedings of the Thirty-Seventh Annual ACM Symposium on Theory of Computing 2005
DOI: 10.1145/1060590.1060622
|View full text |Cite
|
Sign up to set email alerts
|

Coresets in dynamic geometric data streams

Abstract: A dynamic geometric data stream consists of a sequence of m insert/delete operations of points from the discrete space {1, . . . , ∆} d [26]. We develop streaming (1 + )-approximation algorithms for k-median, k-means, MaxCut, maximum weighted matching (MaxWM), maximum travelling salesperson (MaxTSP), maximum spanning tree (MaxST), and average distance over dynamic geometric data streams. Our algorithms maintain a small weighted set of points (a coreset) that approximates with probability 2/3 the current point … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
104
0

Year Published

2005
2005
2012
2012

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 111 publications
(104 citation statements)
references
References 30 publications
0
104
0
Order By: Relevance
“…Coresets in the context of the Euclidean k-median and the Euclidean k-means problem have been known for some time (see [21,3,15,10,13]). A (k, )-coreset for a set P is a small (weighted) set such that for any set C of k cluster centers the (weighted) clustering cost of the coreset is an approximation for the clustering cost of the original set P with relative error at most .…”
Section: Related Workmentioning
confidence: 99%
“…Coresets in the context of the Euclidean k-median and the Euclidean k-means problem have been known for some time (see [21,3,15,10,13]). A (k, )-coreset for a set P is a small (weighted) set such that for any set C of k cluster centers the (weighted) clustering cost of the coreset is an approximation for the clustering cost of the original set P with relative error at most .…”
Section: Related Workmentioning
confidence: 99%
“…Later, standard constant factor approximation algorithms were given that run in time O(n k) (see, e.g., [47,54]). These sublinear-time results have been extended in many different ways, e.g., to efficient data streaming algorithms and very fast algorithms for Euclidean kmedian and also to k-means, see, e.g., [9,13,17,29,38,39,44,45,48]. For another clustering problem, the min-sum k-clustering problem (which is complement to the Max-k-Cut), for the basic case of k = 2, Indyk [42] (see also [41]) gave a (1+ε)-approximation algorithm that runs in time O(2 1/ε O(1) n (log n) O(1) ), which is sublinear in the full input description size.…”
Section: Other Resultsmentioning
confidence: 99%
“…• problems on geometric streams where space is partitioned in one or higher dimensions into geometrically spaced grids [138,96].…”
Section: Exponential Histogramsmentioning
confidence: 99%
“…This line of reasoning has led to new geometric results on Turnstile data streams for minimum spanning tree weight estimation, bichromatic matching, facility location and k-median clustering [138,96,95]. Many open problems remain.…”
Section: Computational Geometrymentioning
confidence: 99%