A robust and efficient clustering algorithm based on cohesion self-merging

Lin, Cheng-Ru; Chen, Ming-Syan

doi:10.1145/775047.775133

Cited by 18 publications

(11 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…If it does not exist (empty network) or if it does not satisfy the condition of equation 1, a new node is added with ) (t p w new . In the first layer, if the network is not empty the threshold is adjusted by the formula (2). Otherwise the threshold is infinite:…”

Section: Proposed Methodsmentioning

confidence: 99%

“…Figure 2. 2D noisy artificial data set used for the experiment As stated in [2] and [19], neither clustering algorithm can correctly partition such a data set nor eliminate noise from clusters.…”

Section: Proposed Methodsmentioning

confidence: 99%

“…A salient advantage of unsupervised learning is their ability to automatically partition a set of data into clusters [1] without any prior knowledge of classes. Most partition clustering algorithms run in linear time and work on large data sets [2]. The K-means algorithm is a local search procedure; it depends heavily on the initial starting conditions.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An incremental parallel neural network for unsupervised classification

Hebboul

Hacini

Hachouf

2011

International Workshop on Systems, Signal Processing and Their Applications, WOSSPA

View full text Add to dashboard Cite

This paper presents a novel unsupervised and parallel learning technique for data clustering that are polluted by noise using neural network approaches. The proposed approach is based on a self-organizing incremental neural network. The design of two-layer neural network enables this system to represent the topological structure of unsupervised on-line data, reports the reasonable number of clusters, and gives typical prototype patterns of every cluster without prior conditions such as a suitable number of nodes. To confirm the efficiency of the proposed learning mechanism, we present a set of experiments with artificial data sets and real world data sets.

show abstract

Section: Proposed Methodsmentioning

confidence: 99%

Section: Proposed Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An incremental parallel neural network for unsupervised classification

Hebboul

Hacini

Hachouf

2011

International Workshop on Systems, Signal Processing and Their Applications, WOSSPA

View full text Add to dashboard Cite

show abstract

“…There are other fast algorithms designed for clustering large numerical data sets, such as CLARANS [24], BIRCH [25], DBSCAN [26], CURE [27], and CSM [28]. In addition, several approaches in [29][30][31] are proposed to solve the high dimensionality and data sparsity problems of numerical data.…”

Section: Related Workmentioning

confidence: 99%

Adherence clustering: an efficient method for mining market-basket clusters

Yun¹,

Chuang²,

Chen³

2006

Information Systems

Self Cite

View full text Add to dashboard Cite

“…Data clustering is a useful technique for many applications, including similarity search, pattern recognition, trend analysis, marketing analysis, grouping, classification of documents, and so forth [3][7] [10]. In data clustering, similar data points are grouped together in a cluster.…”

Section: Introductionmentioning

confidence: 99%

On the Techniques for Data Clustering with Numerical Constraints

Dai¹,

Lin²,

Chen³

2003

Proceedings of the 2003 SIAM International Conference on Data Mining

Self Cite

View full text Add to dashboard Cite

In this paper, the attributes employed to model the constraints are called constraint attributes and those attributes involved in the objective function to be optimized are called cost-optimal attributes. The constrained clustering considered is conducted in such a way that the objective function of cost-optimal attributes is optimized subject to the condition that the imposed constraint is satisfied. Explicitly, we address the problem of constrained clustering with numerical constraints, in which the constraint attribute values of any two data items in the same cluster are required to be within the corresponding constraint range. We devise an effective and efficient algorithm with complete-link to solve this clustering problem. It is noted that due to the intrinsic nature of the numerical constrained clustering, there is an order dependency on the process of attaining the clustering, which in many cases degrades the clustering results. In view of this, we devise a progressive constraint relaxation technique to remedy this drawback and improve the overall performance of clustering results. Explicitly, by using a smaller (tighter) constraint range in earlier iterations of merge, we will have more room to relax the constraint and seek for better solutions in subsequent iterations. It is empirically shown that the progressive constraint relaxation technique is able to improve not only the execution efficiency but also the clustering quality.

show abstract

A robust and efficient clustering algorithm based on cohesion self-merging

Cited by 18 publications

References 15 publications

An incremental parallel neural network for unsupervised classification

An incremental parallel neural network for unsupervised classification

Adherence clustering: an efficient method for mining market-basket clusters

On the Techniques for Data Clustering with Numerical Constraints

Contact Info

Product

Resources

About