2020
DOI: 10.1186/s40537-020-00354-1
|View full text |Cite
|
Sign up to set email alerts
|

Learning in the presence of concept recurrence in data stream clustering

Abstract: In the case of real-world data streams, the underlying data distribution will not be static; it is subject to variation over time, which is known as the primary reason for concept drift. Concept drift poses severe problems to the accuracy of a model in online learning scenarios. The recurring concept is a particular case of concept drift where the concepts already seen in the past reappear as the stream evolves. This problem is not yet studied in the context of stream clustering. This paper proposes a novel al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…This type of drift may be evident and detectable after a while from the start point. ❼ Recurrent or recurring drift refers to the reoccurrence of some foregone distributions or concepts after a while (Namitha and Santhosh Kumar, 2020).…”
Section: Concept Driftmentioning
confidence: 99%
“…This type of drift may be evident and detectable after a while from the start point. ❼ Recurrent or recurring drift refers to the reoccurrence of some foregone distributions or concepts after a while (Namitha and Santhosh Kumar, 2020).…”
Section: Concept Driftmentioning
confidence: 99%
“…e algorithm uses too many parameters that are difficult to tune. Namitha [11] proposed a novel algorithm to identify recurring concepts in data stream clustering. If concept drift is detected, the algorithm retrieves the most matching model from the repository.…”
Section: Related Workmentioning
confidence: 99%
“…This concept is in conflict with the goal of detecting evolving cluster structures over time. A recent approach from [12] solves this issue by introducing a two-level architecture with an online component, which clusters the stream data, and an offline component, which saves recurring cluster structures (called concepts) for later use. A drawback stems from the lack of support for arbitrarily shaped clusters, which decreases the usability in applications where no prior information about shape of clusters and data distribution is known.…”
Section: State Of the Art In Data Stream Clustering And Cluster Tracking Applicationsmentioning
confidence: 99%