2016
DOI: 10.1002/2016wr019034
|View full text |Cite
|
Sign up to set email alerts
|

Spatial downscaling of precipitation using adaptable random forests

Abstract: This paper introduces Prec‐DWARF (Precipitation Downscaling With Adaptable Random Forests), a novel machine‐learning based method for statistical downscaling of precipitation. Prec‐DWARF sets up a nonlinear relationship between precipitation at fine resolution and covariates at coarse/fine resolution, based on the advanced binary tree method known as Random Forests (RF). In addition to a single RF, we also consider a more advanced implementation based on two independent RFs which yield better results for extre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
108
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 174 publications
(128 citation statements)
references
References 70 publications
(70 reference statements)
1
108
0
Order By: Relevance
“…Random forests are robust against overfitting (i.e., explaining the training data instead of finding general patterns) even in case of small data sets (Breiman, ). Applications of random forests in hydrology include empirical simulation of monthly streamflow (Shortridge et al, ), downscaling of precipitation data (He et al, ), and evaluation of flood hazard risk (Wang et al, ). In this study, we used the random forest implementation in the R statistical software (package “randomForest”) to quantify the prediction strength, expressed as variable importance, of the following predictors for F yw , α , and β : mean catchment slope, median surface flow path length, catchment area, percentage of agricultural land, forest and urban areas in 2012, soil fractions of sand, silt, and clay, and the mean values of annual precipitation, annual PET, annual runoff coefficient (determined from modeled discharge), and daily baseflow index (determined from event separation of modeled discharge) between 2013 and 2015.…”
Section: Methodsmentioning
confidence: 99%
“…Random forests are robust against overfitting (i.e., explaining the training data instead of finding general patterns) even in case of small data sets (Breiman, ). Applications of random forests in hydrology include empirical simulation of monthly streamflow (Shortridge et al, ), downscaling of precipitation data (He et al, ), and evaluation of flood hazard risk (Wang et al, ). In this study, we used the random forest implementation in the R statistical software (package “randomForest”) to quantify the prediction strength, expressed as variable importance, of the following predictors for F yw , α , and β : mean catchment slope, median surface flow path length, catchment area, percentage of agricultural land, forest and urban areas in 2012, soil fractions of sand, silt, and clay, and the mean values of annual precipitation, annual PET, annual runoff coefficient (determined from modeled discharge), and daily baseflow index (determined from event separation of modeled discharge) between 2013 and 2015.…”
Section: Methodsmentioning
confidence: 99%
“…Ensemble learning has been applied in many disciplines and areas and displays advantages over traditional algorithms (Deville et al, ; X. He et al, ; Stevens et al, ). Among the ensemble learning methods, RF and GB are widely used.…”
Section: Study Area and Methodsmentioning
confidence: 99%
“…Machine learning explores the relation between the response and its relevant predictors using one or multiple algorithms, with no need to consider the explicit mathematical form of the model (Elith et al, ; X. He et al, ). Examples of machine learning algorithms include nearest neighbor (Cover & Hart, ), naïve Bayes (Jensen, ; Lewis, ), decision trees (Breiman et al, ), support vector machines (SVMs; Vapnik, ), and artificial neural networks (ANNs) (Hopfield, ).…”
Section: Introductionmentioning
confidence: 99%
“…This method is suitable for both regression and classification problems. Due to randomized and decorrelated features of RF, it is able to build the connection between the input and output variables when their relationship is very complex and nonlinear ( He et al 2016, Hong et al 2016.…”
Section: Random Forestmentioning
confidence: 99%