Smoothed Analysis of the k-Means Method

Arthur, David; Manthey, Bodo; Röglin, Heiko

doi:10.1145/2027216.2027217

Cited by 74 publications

(65 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the k-means problem, Kanungo et al [37] showed that local search achieves a 9`ε-approximation in general metrics and this remains the best known approximation guarantee so far even for fixed d. There are also a variety of results for k-means and k-median when the input has some stability conditions (see for example [10,8,14,13,18,40,45]) or in the context of smoothed analysis (see for example [6,5]). …”

Section: Related Workmentioning

confidence: 99%

Local Search Yields Approximation Schemes for k-Means and k-Median in Euclidean and Minor-Free Metrics

Cohen-Addad

Klein

Mathieu

2016

2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS)

View full text Add to dashboard Cite

We give the first polynomial-time approximation schemes (PTASs) for the following problems: (1) uniform facility location in edge-weighted planar graphs; (2) k-median and k-means in edge-weighted planar graphs; (3) k-means in Euclidean space of bounded dimension. Our first and second results extend to minor-closed families of graphs. All our results extend to cost functions that are the p-th power of the shortest-path distance. The algorithm is local search where the local neighborhood of a solution S consists of all solutions obtained from S by removing and adding 1{εOp1q centers.

show abstract

Section: Related Workmentioning

confidence: 99%

Local Search Yields Approximation Schemes for k-Means and k-Median in Euclidean and Minor-Free Metrics

Cohen-Addad

Klein

Mathieu

2016

2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS)

View full text Add to dashboard Cite

show abstract

“…The computational complexity of Algorithm 1 is O(n·K · d · ω) with ω the number of iterations until satisfactory convergence is achieved in line 6. Even though ω can grow exponentially in n [18], it is in average (via smoothed analysis) polynomial in n [19]. For real data, it often can be observed that ω does not grow that fast and is considered proportional to n.…”

Section: K-means Clusteringmentioning

confidence: 99%

Identification of nonlinear behavior with clustering techniques in car crash simulations for better model reduction

Grunert

Fehr

2016

Adv. Model. and Simul. in Eng. Sci.

View full text Add to dashboard Cite

Background: Car crash simulations need a lot of computation time. Model reduction can be applied in order to gain time-savings. Due to the highly nonlinear nature of a crash, an automatic separation in parts behaving linearly and nonlinearly is valuable for the subsequent model reduction. Methods: We analyze existing preprocessing and clustering methods like k-means and spectral clustering for their suitability in identifying nonlinear behavior. Based on these results, we improve existing and develop new algorithms which are especially suited for crash simulations. They are objectively rated with measures and compared with engineering experience. In future work, this analysis can be used to choose appropriate model reduction techniques for specific parts of a car. A crossmember of a 2001 Ford Taurus finite element model serves as an industrial-sized example. Results: Since a non-intrusive black box approach is assumed, only heuristic approaches are possible. We show that our methods are superior in terms of simplicity, quality and speed. They also free the user from arbitrarily setting parameters in clustering algorithms. Conclusion: Though we improved existing methods by an order of magnitude, preparing them for the use with a full car model, they still remain heuristic approaches that need to supervised by experienced engineers.

show abstract

“…The overall cost is therefore O(kn 2 ) for each iteration, which, in our case, can be simplified to O(n 2 ). Though it has been shown that, in the very worst case, the algorithm requires an exponential number of iterations [36], it has been recognized since long that the performance of the K-means algorithm exhibits a stark contrast between practical observations and theoretical bounds [37]. In our case, we have observed an approximately constant number of iterations, so that we can safely assume that the overall cost of the K-means algorithm is O(n 2 ).…”

Section: K-meansmentioning

confidence: 63%

A traffic-based evolutionary algorithm for network clustering

Naldi

Salcedo‐Sanz

Carro-Calvo

et al. 2013

Applied Soft Computing

View full text Add to dashboard Cite

a b s t r a c tNetwork clustering algorithms are typically based only on the topology information of the network. In this paper, we introduce traffic as a quantity representing the intensity of the relationship among nodes in the network, regardless of their connectivity, and propose an evolutionary clustering algorithm, based on the application of genetic operators and capable of exploiting the traffic information. In a comparative evaluation based on synthetic instances and two real world datasets, we show that our approach outperforms a selection of well established evolutionary and non-evolutionary clustering algorithms.

show abstract

Smoothed Analysis of the k-Means Method

Cited by 74 publications

References 23 publications

Local Search Yields Approximation Schemes for k-Means and k-Median in Euclidean and Minor-Free Metrics

Local Search Yields Approximation Schemes for k-Means and k-Median in Euclidean and Minor-Free Metrics

Identification of nonlinear behavior with clustering techniques in car crash simulations for better model reduction

A traffic-based evolutionary algorithm for network clustering

Contact Info

Product

Resources

About