2023
DOI: 10.1002/qre.3398
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection wrapper in presence of correlated input variables for random forest models

Marta Rotari,
Murat Kulahci

Abstract: In most data analytic applications in manufacturing, understanding the data‐driven models plays a crucial role in complementing the engineering knowledge about the production process. Identifying relevant input variables, rather than only predicting the response through some “black‐box” model, is of great interest in many applications. There is, therefore, a growing focus on describing the contributions of the input variables to the model in the form of “variable importance”, which is readily available in cert… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…Misai et al 5 mix parametric, non-parametric, and machine learning inference methods to optimize a maintenance policy for a partially observed multi-component process. The following two articles by Calzarossa et al 6 and Rotari et al 7 share the same objective of variable selection when using random forest for classification or regression. The paper by Calzarossa et al 6 focuses on the application of detecting phishing sites based on URL characteristics; variables are selected in two stages: first by a univariate exploratory analysis, then by a Lorenz Zonoid-based selection procedure.…”
mentioning
confidence: 99%
See 2 more Smart Citations
“…Misai et al 5 mix parametric, non-parametric, and machine learning inference methods to optimize a maintenance policy for a partially observed multi-component process. The following two articles by Calzarossa et al 6 and Rotari et al 7 share the same objective of variable selection when using random forest for classification or regression. The paper by Calzarossa et al 6 focuses on the application of detecting phishing sites based on URL characteristics; variables are selected in two stages: first by a univariate exploratory analysis, then by a Lorenz Zonoid-based selection procedure.…”
mentioning
confidence: 99%
“…The following two articles by Calzarossa et al 6 . and Rotari et al 7 . share the same objective of variable selection when using random forest for classification or regression.…”
mentioning
confidence: 99%
See 1 more Smart Citation