2020
DOI: 10.1016/j.procs.2020.07.018
|View full text |Cite
|
Sign up to set email alerts
|

Unbalanced data processing using oversampling: Machine Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 16 publications
0
13
0
2
Order By: Relevance
“…As one may observe, there were only 2 lane detection errors; for vehicle classification, in turn, there was a bias towards classifying vehicles as cars. We tried to circumvent this problem using techniques for dealing with unbalanced training sets (i.e., training sets in which classes have distinct frequencies), such as oversampling the least frequent classes [28], but the results did not change significantly.…”
Section: Resultsmentioning
confidence: 99%
“…As one may observe, there were only 2 lane detection errors; for vehicle classification, in turn, there was a bias towards classifying vehicles as cars. We tried to circumvent this problem using techniques for dealing with unbalanced training sets (i.e., training sets in which classes have distinct frequencies), such as oversampling the least frequent classes [28], but the results did not change significantly.…”
Section: Resultsmentioning
confidence: 99%
“…Explicitly, only 950 Eurobarometer survey participants admitted their participation in the undeclared economy from the supply side, which introduces a substantial risk of models being biased towards the negative outcome. 7 To address this issue, during the training phase, we applied the random oversampling scheme with weights inversely proportional to class frequencies (see Fernández Hilario et al 2018 ; Viloria et al 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…The imbalanced proportion of normal and DR images in Big Data has been identi ed as one of the main challenges for the algorithms. This can commonly cause over tting problems [11], as there is a high performance of DR grading in training data, but low performance in the testing data.…”
Section: A Datasetmentioning
confidence: 99%