A branch-and-cut SDP-based algorithm for minimum sum-of-squares clustering

Aloise, Daniel; Hansen, Pierre

doi:10.1590/s0101-74382009000300002

Cited by 14 publications

(22 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…So, we do not refer to its results in the subsequent tables. As empirically observed in [2], the performance of algorithm bb-sdp deteriorates as k decreases, in contrast with…”

Section: Results In the Planementioning

confidence: 73%

“…The results are also compared to those of two other methods proposed in the literature, i.e., the repetitive branch-and-bound algorithm (rbba) of Brusco [9] and the best branch-and-cut SDP-based algorithm (bb-sdp) of [2]. Tables 2-7 show results for data sets in the plane.…”

Section: Results In the Planementioning

confidence: 99%

“…On the basis of these results, the present authors developed in [2] a branchand-cut SDP-based algorithm for MSSC with lower bounds obtained from the LP relaxation of the 0-1 SDP model. This algorithm obtains exact solutions for fairly large data sets, i.e., n = 202 and k ≥ 9, with computing times comparable with those obtained by the column generation method proposed by du Merle et al [15].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An improved column generation algorithm for minimum sum-of-squares clustering

2010

Self Cite

View full text Add to dashboard Cite

Given a set of entities associated with points in Euclidean space, minimum sum-of-squares clustering (MSSC) consist in partitioning this set into clusters such that the sum of squared distances from each point to the centroid of its cluster is minimized. A column generation algorithm for MSSC was given in du Merle et al. [15]. The bottleneck of that algorithm is resolution of the auxiliary problem of finding a column with negative reduced cost. We propose a new way to solve this auxiliary problem based on geometric arguments. This greatly improves the efficiency of the whole algorithm and leads to exact solution of instances with over 2300 entities, i.e., more than 10 times as much as previously done.

show abstract

“…So, we do not refer to its results in the subsequent tables. As empirically observed in [2], the performance of algorithm bb-sdp deteriorates as k decreases, in contrast with…”

Section: Results In the Planementioning

confidence: 73%

Section: Results In the Planementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An improved column generation algorithm for minimum sum-of-squares clustering

2010

Self Cite

View full text Add to dashboard Cite

show abstract

“…In such a problem, each object can be considered as a point in a n-dimensional space and each cluster can be identified by its center, called centroid, a non-observable object calculated by taking the mean of all the objects assigned to this cluster [18,32,34]. To express similarity between objects, i.e., homogeneity inside a cluster, several similarity measures have been proposed, such as a metric defined on the data set [2,7]. One of the most used (dis)similarity measures is the squared Euclidean distance [3,4,14,18,34].…”

Section: Introductionmentioning

confidence: 99%

Two-Step Semidefinite Programming approach to clustering and dimensionality reduction

Macedo

2015

Stat., optim. inf. comput.

View full text Add to dashboard Cite

Inspired by the recently proposed statistical technique called clustering and disjoint principal component analysis (CDPCA), this paper presents a new algorithm for clustering objects and dimensionality reduction, based on Semidefinite Programming (SDP) models. The Two-Step-SDP algorithm is based on SDP relaxations of two clustering problems and on a K-means step in a reduced space. The Two-Step-SDP algorithm was implemented and tested in R, a widely used open source software. Besides returning clusters of both objects and attributes, the Two-Step-SDP algorithm returns the variance explained by each component and the component loadings. The numerical experiments on different data sets show that the algorithm is quite efficient and fast. Comparing to other known iterative algorithms for clustering, namely, the K-means and ALS algorithms, the computational time of the Two-Step-SDP algorithm is comparable to the K-means algorithm, and it is faster than the ALS algorithm.

show abstract

“…Most of the sophisticated computation on clustering problems has addressed the MSS; see, e.g., the most recent exact methods (Aloise and Hansen, ; Aloise et al, ). We do not discuss these approaches in detail because (i) MSS is computationally rather different from RASS and (ii) we focus here on spectral bounds (for which there is not much computation in the MSS special case).…”

Section: Introductionmentioning

confidence: 99%

Improving spectral bounds for clustering problems by Lagrangian relaxation

Dolatabadi

Lodi

Afsharnejad

2011

Intl. Trans. in Op. Res.

View full text Add to dashboard Cite

Clustering is one of the most important issues in data mining, image segmentation, VLSI design, parallel computing and many other areas. We consider the general problem of partitioning n points into k clusters by maximizing the affinity measure of the points into the clusters. This objective function, referred to as Ratio Association, generalizes the classical (Minimum) Sum-of-Squares clustering problem, where the affinity is measured as closeness in the Euclidean space. This generalized version has emerged in the context of the approximation of chemical conformations for molecules, and in explaining transportation phenomena in dynamical systems, especially in dynamical astronomy. In particular, we refer to the dynamical systems application in the paper. Although successful heuristics have been developed to approximately solve the problem, the conventional spectral bounds proposed in the literature are not tight enough for ''large'' instances to assert the quality of those heuristics or to allow solving the problem exactly. In this paper, we investigate how to tighten the spectral bounds by using Lagrangian relaxation and Subgradient optimization methods.

show abstract

A branch-and-cut SDP-based algorithm for minimum sum-of-squares clustering

Cited by 14 publications

References 28 publications

An improved column generation algorithm for minimum sum-of-squares clustering

An improved column generation algorithm for minimum sum-of-squares clustering

Two-Step Semidefinite Programming approach to clustering and dimensionality reduction

Improving spectral bounds for clustering problems by Lagrangian relaxation

Contact Info

Product

Resources

About