2020
DOI: 10.11591/ijece.v10i4.pp4331-4339
|View full text |Cite
|
Sign up to set email alerts
|

Feature selection for multiple water quality status: Integrated bootstrapping and SMOTE approach in imbalance classes

Abstract: STORET is one method to determine the river water quality into four classes (very good , good, medium and bad) based on the data of water for each attribute or feature. The success of the formation of pattern recognition model much depends on the quality of data. There are two issues as the concern of this research as follows: the data having disproportionate amount among the classes (imbalance class) and the finding of noise on its attribute. Therefore, this research integrates the SMOTE Technique and bootstr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 24 publications
0
11
0
Order By: Relevance
“…In [6], the Author used SMOTE and boostrapping to handle unbalanced data. They used several feature selection methods with decision tree, k-NN and Bayes classification model.…”
Section: Introductionmentioning
confidence: 99%
“…In [6], the Author used SMOTE and boostrapping to handle unbalanced data. They used several feature selection methods with decision tree, k-NN and Bayes classification model.…”
Section: Introductionmentioning
confidence: 99%
“…This paper proposes and conducts a feature selection because the classifier can provide better classification results [8]. t-distributed stochastic neighbor embedding (t-SNE) is a non-linear technique to map multidimensional data to lower dimensional space [9].…”
Section: Feature Selectionmentioning
confidence: 99%
“…The reason why feature selection is effective in dealing with overlapping is because of its ability to eliminate uninformative predictors and reduce dimensionality of feature space [8]. However, on the other hand with the noise, the performance given by feature selection can decrease [9] and noise basically has an influence on classification performance [10]. Noise handling in general uses the method of resampling, but often encounters obstacles if there is a state of overlapping [6].…”
Section: Introductionmentioning
confidence: 99%