2013
DOI: 10.12785/amis/071l50
|View full text |Cite
|
Sign up to set email alerts
|

Classification for Imbalanced and Overlapping Classes Using Outlier Detection and Sampling Techniques

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(16 citation statements)
references
References 1 publication
0
16
0
Order By: Relevance
“…On the contrary, adding synthetic instances to the minority class (SMOTE) may introduce noise along the borderlines, leading to the loss of activity cliffs and mislabeling of the synthetic instances [ 66 ]. The ENN algorithm may effectively remove those synthetic outliers and restore the activity cliffs and class boundaries, leading to enhanced prediction performance for SMOTEENN (SMN) [ 67 ]…”
Section: Resultsmentioning
confidence: 99%
“…On the contrary, adding synthetic instances to the minority class (SMOTE) may introduce noise along the borderlines, leading to the loss of activity cliffs and mislabeling of the synthetic instances [ 66 ]. The ENN algorithm may effectively remove those synthetic outliers and restore the activity cliffs and class boundaries, leading to enhanced prediction performance for SMOTEENN (SMN) [ 67 ]…”
Section: Resultsmentioning
confidence: 99%
“…In addition to the PRISM algorithm [51], other outlier and/or noise detection methods have been recently introduced demonstrating their ability to improve the prediction accuracy of machine learning models [53,54]. For example, Byeon et al introduced a novel technique to enhance the quality of training data with a noisy dependent variable for binary classification [53].…”
Section: Activity-cliff-related Medicinal Chemistry Task Approach Refsmentioning
confidence: 99%
“…More recently, based on the borderline noise factor, Yang and Gao applied datacleaning techniques to remove the classification borderline noise. This work compared three under-sampling methods to select the representative majority class examples and remove the distant samples, which are useless to form the decision boundary [54]. The experimental results on bench datasets showed that the proposed method can effectively improve the classification accuracy of minority classes while achieving better overall classification.…”
Section: Activity-cliff-related Medicinal Chemistry Task Approach Refsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another reason of overlapping is because the samples in both classes share almost that same value of attributes. Such overlapping causes difficulties for a classifier to classify the samples and may eventually lead to a low classification rate [49], [50]. In the next section, solutions to the unbalanced class distribution and the overlapping class samples shall be discussed based on the previous work conducted by other researchers.…”
Section: Overlapping Of Classesmentioning
confidence: 99%