2022
DOI: 10.1109/tcyb.2020.3023973
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams

Abstract: In an era of ubiquitous large-scale evolving data streams, data stream clustering (DSC) has received lots of attention because the scale of the data streams far exceeds the ability of expert human analysts. It has been observed that high-dimensional data are usually distributed in a union of low-dimensional subspaces. In this article, we propose a novel sparse representation-based DSC algorithm, called evolutionary dynamic sparse subspace clustering (EDSSC). It can cope with the time-varying nature of subspace… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 66 publications
(141 reference statements)
0
5
0
Order By: Relevance
“…LRR is built on the premise of union sampling on multiple low rank linear subsets, and LRR also has the property of self-expression, that is, each sample is expressed in its own form. Unlike SSC, which selects the least number of features from multiple expressions, LRR aims to find the minimum rank of all data [14][15]. Therefore, compared with SSC, LRR can better control the overall structure of sample points, that is, by applying low rank constraints to the expression matrix, it aggregates data points with high correlation together, thereby achieving overall clustering.…”
Section: Lrr Algorithmmentioning
confidence: 99%
“…LRR is built on the premise of union sampling on multiple low rank linear subsets, and LRR also has the property of self-expression, that is, each sample is expressed in its own form. Unlike SSC, which selects the least number of features from multiple expressions, LRR aims to find the minimum rank of all data [14][15]. Therefore, compared with SSC, LRR can better control the overall structure of sample points, that is, by applying low rank constraints to the expression matrix, it aggregates data points with high correlation together, thereby achieving overall clustering.…”
Section: Lrr Algorithmmentioning
confidence: 99%
“…Moreover, these techniques are naturally adapted to the evolving environment of data streams. In addition, EDSSC introduces a subspace structure evolution detection model to detect the appearing, disappearing, and recurring subspaces [22]. In particular, it uses a singular-based Laplacian matrix decomposition to automatically estimate the number of subspaces.…”
Section: B Data Stream Clustering Techniquesmentioning
confidence: 99%
“…However, it is insufficient to characterize the intrinsic structures of high-dimensional data objects in data streams. This motivates the development of alternative techniques for exploiting the intrinsic characteristics of the highdimensional data objects [10], [22], [23], [23], [24]. For example, Krleža1 et al proposed a statistical hierarchical clustering (SHC) algorithm that uses statistical inference based on statistical distances to estimate statistical distributions on data streams [10].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Data stream mining is widely used in various fields such as weather forecasting, power prediction, stock trading, and network intrusion detection. In recent years, research on data streams has received considerable attention with the aim of improving the generalization performance of online learning models to adapt to the real-time distribution of data flow [1]. Since the data distribution of data streams may change over time, that is, there is concept drift, the model needs to be continuously updated to adapt to the new data stream environment [2].…”
Section: Introductionmentioning
confidence: 99%