2012
DOI: 10.1007/s13748-011-0008-0
|View full text |Cite
|
Sign up to set email alerts
|

Learning from streaming data with concept drift and imbalance: an overview

Abstract: The primary focus of machine learning has traditionally been on learning from data assumed to be sufficient and representative of the underlying fixed, yet unknown, distribution. Such restrictions on the problem domain paved the way for development of elegant algorithms with theoretically provable performance guarantees. As is often the case, however, real-world problems rarely fit neatly into such restricted models. For instance class distributions are often skewed, resulting in the "class imbalance" problem.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
134
0
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 240 publications
(143 citation statements)
references
References 81 publications
(82 reference statements)
0
134
0
1
Order By: Relevance
“…Dynamical nature of data that arrive either in batches or online poses new challenges when imbalanced distributions are to be expected [26]. Whether dealing with stationary or evolving streams we require adaptive methods that are able to deal with skewed objects coming in real time.…”
Section: Learning From Imbalanced Data Streamsmentioning
confidence: 99%
“…Dynamical nature of data that arrive either in batches or online poses new challenges when imbalanced distributions are to be expected [26]. Whether dealing with stationary or evolving streams we require adaptive methods that are able to deal with skewed objects coming in real time.…”
Section: Learning From Imbalanced Data Streamsmentioning
confidence: 99%
“…One of the possible future directions in addressing this challenge is the utilisation of so-called ``anytime algorithms'' [170] that can learn from streaming data (e.g., time-dependent Bayesian classifiers) [171] and that still return a valuable result if their execution is interrupted at any time. Moreover, in the future, we will have access to more and more time series data.…”
Section: Challenges and Perspectivesmentioning
confidence: 99%
“…When concept drifts occur in a data stream, certain amount of labeled data samples are needed for training new supervised or semi-supervised learning models [3]. A request for labels on selected data samples will be made prior to the training.…”
Section: Delayed Labeling Problemmentioning
confidence: 99%
“…Examples of concept drifting data streams are weather data stream, financial data stream, and online-opinion data stream. Concept drifting data streams require the data mining framework to be able to detect changes in the stream, and adapt to them so that the learning model is kept upto-date [3]. Numerous studies have been done on designing such adapting data mining frameworks [4][5][6][7][8][9][10][11].…”
Section: Introductionmentioning
confidence: 99%