2022
DOI: 10.1016/j.simpa.2022.100272
|View full text |Cite
|
Sign up to set email alerts
|

K Nearest Neighbor OveRsampling approach: An open source python package for data augmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 14 publications
0
5
0
2
Order By: Relevance
“…The Naive Bayes method has the advantage of not requiring a large amount of training data to determine the estimated parameters needed in the classification process, which makes the classification process more effective and efficient [25] [26]. While the K-Nearest Neighbor method is easier to implement, experiments with this method show that it can provide good performance for independent data (which does not have word dependence) [27].…”
Section: Resultsmentioning
confidence: 99%
“…The Naive Bayes method has the advantage of not requiring a large amount of training data to determine the estimated parameters needed in the classification process, which makes the classification process more effective and efficient [25] [26]. While the K-Nearest Neighbor method is easier to implement, experiments with this method show that it can provide good performance for independent data (which does not have word dependence) [27].…”
Section: Resultsmentioning
confidence: 99%
“…The time taken by the KNNOR method is more compared to the SMOTE method, as has been showcased in the works of Islam et al [32]. This is because KNNOR comprises a pre-augmentation step to calculate the optimized distance and proportion of the population used while oversampling.…”
Section: A Limitationmentioning
confidence: 99%
“…As multiple neighbors are used to generating a single point, the time taken to create a new point is also impacted by the number of neighbors used. This additional time is justified by the greater accuracy achieved by the proposed method as shown in Table 1 for image data and in the works of [13], [32] for tabular data.…”
Section: A Limitationmentioning
confidence: 99%
“…Data hadir dalam jumlah besar, tetapi masalah kumpulan data yang tidak seimbang muncul berulang kali, mengganggu pengklasifikasi dan mengurangi akurasi [12]. Augmentasi data adalah proses memodifikasi atau memanipulasi suatu citra sehingga citra asli dalam bentuk yang telah disiapkan berubah bentuk dan posisinya.…”
Section: Data Augmentasiunclassified