2020
DOI: 10.21512/comtech.v11i2.6452
|View full text |Cite
|
Sign up to set email alerts
|

Finding Biomarkers from a High-Dimensional Imbalanced Dataset Using the Hybrid Method of Random Undersampling and Lasso

Abstract: The research conducted undersampling and gene selection as a starting point for cancer classification in gene expression datasets with a high-dimensional and imbalanced class. It investigated whether implementing undersampling before gene selection gave better results than without implementing undersampling. The used undersampling method was Random Undersampling (RUS), and for gene selection, it was Lasso. Then, the selected genes based on theory were validated. To explore the effectiveness of applying RUS bef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 26 publications
0
2
0
Order By: Relevance
“…Research [22] used the hybrid RUS and Lasso methods to find biomarkers in gene expression data that have imbalanced classes. In that study, two data were used, i.e.…”
Section: Problem With Imbalanced Classmentioning
confidence: 99%
See 1 more Smart Citation
“…Research [22] used the hybrid RUS and Lasso methods to find biomarkers in gene expression data that have imbalanced classes. In that study, two data were used, i.e.…”
Section: Problem With Imbalanced Classmentioning
confidence: 99%
“…The purpose of this study is to examine the effectiveness of using undersampling before feature selection on high-dimensional data with imbalanced classes. The method used is the RUS-Lasso-CART hybrid method [22], [26]. For this purpose, we used 30 simulated datasets generated from real gene expression data parameters.…”
Section: Problem With Model Interpretationmentioning
confidence: 99%