2018
DOI: 10.1186/s12863-018-0633-8
|View full text |Cite
|
Sign up to set email alerts
|

Using recursive feature elimination in random forest to account for correlated variables in high dimensional data

Abstract: BackgroundRandom forest (RF) is a machine-learning method that generally works well with high-dimensional problems and allows for nonlinear relationships between predictors; however, the presence of correlated predictors has been shown to impact its ability to identify strong predictors. The Random Forest-Recursive Feature Elimination algorithm (RF-RFE) mitigates this problem in smaller data sets, but this approach has not been tested in high-dimensional omics data sets.ResultsWe integrated 202,919 genotypes a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
114
0
3

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 256 publications
(117 citation statements)
references
References 16 publications
0
114
0
3
Order By: Relevance
“…To select the optimal subset of clinical features, an RFE processed with 10-fold cross-validation was used to select the best subset. The RFE could eliminate the redundant and irrelative information from the COVID-19 patients and enhance the performance of the RF classification model ( Darst, Malecki & Engelman, 2018 ). The results selected 11 clinical characteristics, Myo, CD8, age, LDH, LMR, CD45, Th/Ts, dyspnea, NLR, D-Dimer and CK with the highest accuracy at 0.9905 ( Fig.…”
Section: Resultsmentioning
confidence: 99%
“…To select the optimal subset of clinical features, an RFE processed with 10-fold cross-validation was used to select the best subset. The RFE could eliminate the redundant and irrelative information from the COVID-19 patients and enhance the performance of the RF classification model ( Darst, Malecki & Engelman, 2018 ). The results selected 11 clinical characteristics, Myo, CD8, age, LDH, LMR, CD45, Th/Ts, dyspnea, NLR, D-Dimer and CK with the highest accuracy at 0.9905 ( Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Recursive feature elimination (RFE) is a method that follows an iterative procedure where features are ranked based upon their importance in classifying the training set and then the feature-(s) with smallest ranking criterion is(are) removed. The RFE was implemented from Support Vector Machines 28 or Random Forest 29 algorithms. Besides that, both algorithms can be used to rank the feature importance and also remove correlated predictors.…”
Section: (Ii) Wrapper Methodsmentioning
confidence: 99%
“…The Recursive Feature Elimination combined with Random Forest (RFE-RF) classification has been reported as a promising technique to effectively handling the fusion of diverse data sources, such as HS and LiDAR data, at the same time generating unbiased and stable classification results in different application fields [64][65][66][67][68]. One of the objectives of this paper was therefore to develop an RFE-RF system, capable of combining the potentiality of RF classification and RFE feature selection, at the same time automatizing the analysis of HS and ALS data for our habitats mapping purpose.…”
Section: Feature Selection and The Recursive Feature Elimination-randmentioning
confidence: 99%