Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2003
DOI: 10.1145/956750.956778
|View full text |Cite
|
Sign up to set email alerts
|

Mining concept-drifting data streams using ensemble classifiers

Abstract: Recently, mining data streams with concept drifts for actionable insights has become an important and challenging task for a wide range of applications including credit card fraud protection, target marketing, network intrusion detection, etc. Conventional knowledge discovery tools are facing two challenges, the overwhelming volume of the streaming data, and the concept drifts. In this paper, we propose a general framework for mining concept-drifting data streams using weighted ensemble classifiers. We train a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
444
0
5

Year Published

2006
2006
2017
2017

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 787 publications
(449 citation statements)
references
References 21 publications
0
444
0
5
Order By: Relevance
“…In a concept-drifting data stream, a new member of the ensemble family is built on a chunk of recent data samples and an outdated member is removed. By assigning weights to ensemble members depending on the estimated error rate, concept drift can be dealt with [15]. In [16], two ensemble methods, bagging by ADWIN (ADaptive WINdowing) [14] and bagging by adaptive-size Hoeffding trees, are proposed.…”
Section: Related Workmentioning
confidence: 99%
“…In a concept-drifting data stream, a new member of the ensemble family is built on a chunk of recent data samples and an outdated member is removed. By assigning weights to ensemble members depending on the estimated error rate, concept drift can be dealt with [15]. In [16], two ensemble methods, bagging by ADWIN (ADaptive WINdowing) [14] and bagging by adaptive-size Hoeffding trees, are proposed.…”
Section: Related Workmentioning
confidence: 99%
“…Multivariate time series data classification methods were studied in [4,5,6,7,8], including On-demand Classifier [4], HMM (Hidden Markov Models) [5], RNN (Recurrent Neural Network), Dynamic Time Warping [5], weighted ensemble classifier [6] and SAX [7]. These methods involve large numbers of parameters and complex preprocessing step that need to be tuned.…”
Section: Related Workmentioning
confidence: 99%
“…So, supervised classification techniques suffer from the scarcity of labeled data for learning, resulting in a poorly built classifier. Most existing data stream classification techniques address only the infinite length, and concept-drift problems [1][2][3]. Our previous work MineClass [4] addresses the concept-evolution problem in addition to the infinite length and conceptdrift problems.…”
Section: Introductionmentioning
confidence: 99%
“…In order to cope with concept-drift, a classifier must be continuously updated to be consistent with the most recent concept. ActMiner applies a hybrid batch-incremental process [2,5] to solve the infinite length and concept-drift problems. It divides the data stream into equal sized chunks and trains a classification model from each chunk.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation