2022
DOI: 10.3390/su142114398
|View full text |Cite
|
Sign up to set email alerts
|

Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification

Abstract: Imbalanced data constitutes a challenge for knowledge management. This problem is even more complex in the presence of hybrid (numeric and categorical data) having missing values and multiple decision classes. Unfortunately, health-related information is often multiclass, hybrid, and imbalanced. This paper introduces a novel undersampling procedure that deals with multiclass hybrid data. We explore its impact on the performance of the recently proposed customized naïve associative classifier (CNAC). The experi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 60 publications
0
1
0
Order By: Relevance
“…Such methods are not relevant for mixed and partial data. Because there are more instances in the dataset due to oversampling processes, the computational execution time of the decision-making algorithms increases [37]. Moreover, the biggest drawback of oversampling is that it increases the likelihood of overfitting by using identical replicas of previous cases.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Such methods are not relevant for mixed and partial data. Because there are more instances in the dataset due to oversampling processes, the computational execution time of the decision-making algorithms increases [37]. Moreover, the biggest drawback of oversampling is that it increases the likelihood of overfitting by using identical replicas of previous cases.…”
Section: Proposed Methodsmentioning
confidence: 99%