2009 IEEE Symposium on Computational Intelligence and Data Mining 2009
DOI: 10.1109/cidm.2009.4938664
|View full text |Cite
|
Sign up to set email alerts
|

An architecture and algorithms for multi-run clustering

Abstract: Abstract-This paper addresses two main challenges for clustering which require extensive human effort: selecting appropriate parameters for an arbitrary clustering algorithm and identifying alternative clusters. We propose an architecture and a concrete system MR-CLEVER for multi-run clustering that integrates active learning with clustering algorithms. The key hypothesis of this work is that better clustering results can be obtained by combining clusters that originate from multiple runs of clustering algorit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 15 publications
(15 reference statements)
0
10
0
Order By: Relevance
“…For this experiment we used a water well dataset called Arsenic_10_avg [9] which was created from a database provided by the Texas Water Development Board (TWDB) [10]. In this paper, we use a subset of this dataset containing 3 spatial attributes longitude, latitude and aquifer and the following 8 chemical concentrations for each water well: Arsenic (As), Molybdenum (Mo), Vanadium (V), Boron (B), Fluoride (F -), Chloride (Cl -), Sulfate (SO 4 2-) and Total Dissolved Solids (TDS). We used CLEVER (CLustEring using representatives and Randomized hill climbing), introduced in [5], as the clustering algorithm in the experiments.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…For this experiment we used a water well dataset called Arsenic_10_avg [9] which was created from a database provided by the Texas Water Development Board (TWDB) [10]. In this paper, we use a subset of this dataset containing 3 spatial attributes longitude, latitude and aquifer and the following 8 chemical concentrations for each water well: Arsenic (As), Molybdenum (Mo), Vanadium (V), Boron (B), Fluoride (F -), Chloride (Cl -), Sulfate (SO 4 2-) and Total Dissolved Solids (TDS). We used CLEVER (CLustEring using representatives and Randomized hill climbing), introduced in [5], as the clustering algorithm in the experiments.…”
Section: Resultsmentioning
confidence: 99%
“…Our clustering approach to cope with multi-objective problem is similar to the ensemble clustering approach as it is based on the key hypothesis that better clustering results can be obtained by combining clusters that originate from multiple runs of clustering algorithms [4]. However, our approach is an incremental approach that collects and refines clusters on the fly, and the search for alternative clusters takes into consideration what clusters already have been generated, rewarding novelty.…”
Section: Introductionmentioning
confidence: 99%
“…The result of voice services using segmentation and data services using segmentation among customers in Group 1are presented by figure1 and figure 2 [3] [5]. We conduct a further segmentation among customers in Group 1 as follows mainly in accordance with the consuming characteristics, which covers 98% of the total population while the rest has no obvious characteristics.…”
Section: B Maintaining the Integrity Of The Specificationsmentioning
confidence: 99%
“…To further demonstrate the advantages of our mechanism, several simulation scenarios are designed in OPNET sim ulation platform [13]. As hybird routing protocols are most suitable for the large scale network with high node density, the principle of the scenario design is to compare the whole With simulations of different number of nodes, we have proved that in random moving and big scale scenarios, C AHR performs better in end-to-end latency, routing overhead, throughput and packet delivery ratio [14] [15].…”
Section: Simulat Ion and Analy Sismentioning
confidence: 99%