2016
DOI: 10.1139/cjfr-2014-0562
|View full text |Cite
|
Sign up to set email alerts
|

Random forests and stochastic gradient boosting for predicting tree canopy cover: comparing tuning processes and model performance

Abstract: 1As part of the development of the 2011 National Land Cover Database (NLCD) tree canopy 2 cover layer, a pilot project was launched to test the use of high resolution photography coupled 3 with extensive ancillary data to map the distribution of tree canopy cover over four study regions 4 in the conterminous US. Two stochastic modeling techniques, Random Forests (RF) and 5Stochastic Gradient Boosting (SGB), are compared. The objectives of this study were first to 6 explore the sensitivity of RF and SGB to choi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
114
1

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 145 publications
(117 citation statements)
references
References 34 publications
2
114
1
Order By: Relevance
“…Consequently, there were a total of 1350 samples with 70% of them used as a training dataset and 30% used for validation. Similar to other stochastic techniques, the RF model may introduce uncertainty to the final predictions since model runs can include highly correlated variables and results in spurious variation [74,75]. To reduce model uncertainty, multicollinearity among predictors was first examined using a variance inflation factor (VIF).…”
Section: Modeling Post-fire Forest Patternmentioning
confidence: 99%
See 1 more Smart Citation
“…Consequently, there were a total of 1350 samples with 70% of them used as a training dataset and 30% used for validation. Similar to other stochastic techniques, the RF model may introduce uncertainty to the final predictions since model runs can include highly correlated variables and results in spurious variation [74,75]. To reduce model uncertainty, multicollinearity among predictors was first examined using a variance inflation factor (VIF).…”
Section: Modeling Post-fire Forest Patternmentioning
confidence: 99%
“…All proposed predictor variables (Table 2) were independent and thus included in the RF model. Second, the process of RF model tuning [74] was applied to the 70% training samples where the ntrees (i.e., number of trees) parameter in the model was selected based on building different RF models with increasing numbers of trees ranging from 100 to 2000 in 100 tree increments. Based on the 30% testing dataset, the numbers of trees that stabilized the role of predictor variables in determining response variables was then selected as an optimal ntrees parameter in the final RF model.…”
Section: Modeling Post-fire Forest Patternmentioning
confidence: 99%
“…The weaknesses of the RF method are that the decision rules are unknown (black box), it is computationally intense and it requires input parameters [69,70]. We used RF regression tree implemented through the R package ModelMap [71] by calling the R package random forest [72] to identify important predictor variables, to model the relationship between the predicted variables and AGB, and to apply the model over the study area for mapping AGB based on single or composited Landsat surface reflectance images and mosaic PALSAR data.…”
Section: Rf Modeling and Implementationmentioning
confidence: 99%
“…This approach was previously used for sub-pixel imperviousness assessment based on Landsat (Walton, 2008); (Bernat and Drzewiecki, 2014) and MODIS images (Deng and Wu, 2013;Tsutsumida, 2016). It was also chosen by U.S. Forest Service to produce the 2011 NLCD percent tree canopy cover dataset (Freeman et al, 2016).…”
Section: Machine Learning Algorithmsmentioning
confidence: 99%
“…In case of stochastic gradient boosting, the models in each iteration are based only on the randomly selected fraction of training data (Kuhn and Johnson, 2013). This method was used to estimate aboveground biomass based on regression with Landsat and SPOT or ALOS PALSAR derived predictors (Carreiras, 2013) and predicting tree canopy cover based on Landsat-5 images (Freeman, 2016). To my best knowledge, it has not been used for sub-pixel imperviousness evaluation so far.…”
Section: Machine Learning Algorithmsmentioning
confidence: 99%