2021
DOI: 10.1007/s10115-021-01560-w
|View full text |Cite
|
Sign up to set email alerts
|

The impact of data difficulty factors on classification of imbalanced and concept drifting data streams

Abstract: Class imbalance introduces additional challenges when learning classifiers from concept drifting data streams. Most existing work focuses on designing new algorithms for dealing with the global imbalance ratio and does not consider other data complexities. Independent research on static imbalanced data has highlighted the influential role of local data difficulty factors such as minority class decomposition and presence of unsafe types of examples. Despite often being present in real-world data, the interactio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(33 citation statements)
references
References 78 publications
0
29
0
Order By: Relevance
“…To evaluate the classifiers in specific and controlled scenarios, we prepared data streams generators under different imbalanced and drifting settings. Nine generators in MOA (Bifet et al, 2010b) plus one generator proposed by Brzeziński (Brzeziński et al, 2021) were used. Those generators are presented in Table 3, with their number of attributes, classes, and whether they can generate internal concept drifts.…”
Section: Generatorsmentioning
confidence: 99%
See 3 more Smart Citations
“…To evaluate the classifiers in specific and controlled scenarios, we prepared data streams generators under different imbalanced and drifting settings. Nine generators in MOA (Bifet et al, 2010b) plus one generator proposed by Brzeziński (Brzeziński et al, 2021) were used. Those generators are presented in Table 3, with their number of attributes, classes, and whether they can generate internal concept drifts.…”
Section: Generatorsmentioning
confidence: 99%
“…Goal of the experiment. This experiment addresses RQ3 and evaluates the robustness of the classifiers to instance-level difficulties (Brzeziński et al, 2021). We evaluated the Brzeziński generator with borderline or rare instances, and combining both at the same time.…”
Section: Instance-level Difficultiesmentioning
confidence: 99%
See 2 more Smart Citations
“…There are already many papers dealing with the challenge of class imbalance for the classification task [10,19,20]. Many difficulty factors for this problem have been investigated [10,20] and various solutions proposed [21]. However, none of these solutions is based on hypergraph modeling of categorical features and focused on optimizing the random undersampling.…”
Section: Related Workmentioning
confidence: 99%