2016
DOI: 10.21105/joss.00092
|View full text |Cite
|
Sign up to set email alerts
|

edarf: Exploratory Data Analysis using Random Forests

Abstract: SummaryThis package contains functions useful for exploratory data analysis using random forests, which can be fit using the randomForest, randomForestSRC, or party packages (Liaw and Wiener 2002;Ishwaran and Kogalur 2013;Hothorn, Hornik, and Zeileis 2006). These functions can compute the partial dependence of covariates (individually or in combination) on the fitted forests' predictions, the permutation importance of covariates, as well as the distance between data points according to the fitted model.Random … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
83
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 79 publications
(83 citation statements)
references
References 4 publications
(4 reference statements)
0
83
0
Order By: Relevance
“…Partial dependence plots allow RF models to be evaluated and to confirm how the explanatory variables are being used in the models for prediction (Jones and Linder, 2015). For the application presented here, there are general physical and chemical processes which should be confirmed in the RF models.…”
Section: Explaining the Observed Trendsmentioning
confidence: 91%
See 1 more Smart Citation
“…Partial dependence plots allow RF models to be evaluated and to confirm how the explanatory variables are being used in the models for prediction (Jones and Linder, 2015). For the application presented here, there are general physical and chemical processes which should be confirmed in the RF models.…”
Section: Explaining the Observed Trendsmentioning
confidence: 91%
“…This allows RF to produce predictive models which generalise well and predictive performance is generally considered among the best of any ML technique (Caruana and Niculescu-Mizil, 2006). RF also has the advantage of not being a "black-box" method (Jones and Linder, 2015 the few ML techniques where the learning process can be explained, investigated, and interpreted. In the case of artificial neural networks or kernel based learning methods, this is much more difficult to do (Kotsiantis, 2013;Tong et al, 2003).…”
Section: Decision Trees and Random Forestmentioning
confidence: 99%
“…Here, this advantage will be leveraged to 10 help explain some of the features in the PM 10 trends in Switzerland between 1997 and 2016. Partial dependence plots allow RF models to be evaluated and to confirm how the explanatory variables are being used in the models for prediction (Jones and Linder, 2015). For the application presented here, there are general physical and chemical processes which should be confirmed in the RF models.…”
Section: Explaining the Observed Trendsmentioning
confidence: 91%
“…This allows RF to produce predictive models 15 which generalise well and predictive performance is generally considered among the best of any ML technique (Caruana and Niculescu-Mizil, 2006). RF also has the advantage of not being a "black-box" method (Jones and Linder, 2015). Decision trees are one of the few ML techniques where the learning process can be explained, investigated, and interpreted.…”
Section: Machine Learningmentioning
confidence: 99%
“…For discussion on these important differences in conceptualisation see (Straus 2007;Finkel and Straus 2012) inants of state-sponsored atrocities. Our approach is similar to Hegre and Sambanis (2006) seminal analysis on the causes of civil war onset, but we provide additional tests to verify whether complex interactions and nonlinearities are driving the statistical results (Bell 2015;Jones and Linder 2015;Jones and Lupu 2018;Muchlinski et al 2015). In conducting this analysis, we address three debates in the mass violence literature:…”
mentioning
confidence: 99%