2021
DOI: 10.1007/s12530-021-09408-y
|View full text |Cite
|
Sign up to set email alerts
|

EvolveCluster: an evolutionary clustering algorithm for streaming data

Abstract: Data has become an integral part of our society in the past years, arriving faster and in larger quantities than before. Traditional clustering algorithms rely on the availability of entire datasets to model them correctly and efficiently. Such requirements are not possible in the data stream clustering scenario, where data arrives and needs to be analyzed continuously. This paper proposes a novel evolutionary clustering algorithm, entitled EvolveCluster, capable of modeling evolving data streams. We compare E… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 49 publications
0
8
0
Order By: Relevance
“…EvolveCluster [1] is a sequential data stream clustering algorithm capable of modeling consistent and changing behaviors in a data stream. It builds upon dividing a stream into specified segments, or windows, and clusters them sequentially as they arrive.…”
Section: Ms-ec Modulesmentioning
confidence: 99%
See 1 more Smart Citation
“…EvolveCluster [1] is a sequential data stream clustering algorithm capable of modeling consistent and changing behaviors in a data stream. It builds upon dividing a stream into specified segments, or windows, and clusters them sequentially as they arrive.…”
Section: Ms-ec Modulesmentioning
confidence: 99%
“…This paper proposes a novel distributed multi-stream (multi-view) clustering algorithm entitled MultiStream EvolveCluster (MS-EC). A multi-source extension of our previously introduced algorithm EvolveCluster [1]. We design and conduct several experiments to investigate how the MS-EC algorithms perform, using synthetic and real-world data, multiple consensus clustering algorithms, and against a baseline algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…There exist several data stream clustering algorithms in the literature [12][13][14][15][16][17]. Most of the data stream clustering algorithms use a two-phase approach [16].…”
Section: Related Workmentioning
confidence: 99%
“…Cluster Mapping Measure [42] is another data stream clustering validation metric. However, all the aforementioned metrics are for clustering algorithms that process streaming instances individually [15]. On the contrary, EmCStream processes the data stream window by window.…”
Section: Metricsmentioning
confidence: 99%
“…1) dynamic unsupervised and semi-supervised learning models that are robust to the appearance of drifting context and additionally enable to learn from multiple data sources by distributed training, and continual updating and evolving of the model [4], [6], [7], [8]; 2) development of dynamic techniques for automatic annotation (labeling) of the data; 3) usage of transfer learning techniques enabling reuse of knowledge from training in earlier tasks to subsequent tasks. The other research ambition of DAISeN is the design of distributed/composable data mining models.…”
Section: Research Objectivesmentioning
confidence: 99%