Spatial downscaling of precipitation using adaptable random forests

He, Xu; Chaney, Nathaniel W.; Schleiss, Marc; Sheffield, Justin

doi:10.1002/2016wr019034

Cited by 174 publications

(128 citation statements)

References 70 publications

(70 reference statements)

Supporting

Mentioning

108

Contrasting

Order By: Relevance

“…Random forests are robust against overfitting (i.e., explaining the training data instead of finding general patterns) even in case of small data sets (Breiman, ). Applications of random forests in hydrology include empirical simulation of monthly streamflow (Shortridge et al, ), downscaling of precipitation data (He et al, ), and evaluation of flood hazard risk (Wang et al, ). In this study, we used the random forest implementation in the R statistical software (package “randomForest”) to quantify the prediction strength, expressed as variable importance, of the following predictors for F yw , α , and β : mean catchment slope, median surface flow path length, catchment area, percentage of agricultural land, forest and urban areas in 2012, soil fractions of sand, silt, and clay, and the mean values of annual precipitation, annual PET, annual runoff coefficient (determined from modeled discharge), and daily baseflow index (determined from event separation of modeled discharge) between 2013 and 2015.…”

Section: Methodsmentioning

confidence: 99%

Spatial Patterns of Water Age: Using Young Water Fractions to Improve the Characterization of Transit Times in Contrasting Catchments

Lutz

Krieg

Müller

et al. 2018

Water Resources Research

108

View full text Add to dashboard Cite

Transit time distributions (TTDs) are crucial descriptors of flow and transport processes in catchments, which can be determined from stable water isotope data. Recently, the young water fraction (F yw ) has been introduced as an additional metric derivable from seasonal isotope cycles. In this study, we calculated F yw and TTDs using monthly isotope data from 24 contrasting subcatchments in a mesoscale catchment (3,300 km 2 ) in Germany. F yw ranged from 0.01 to 0.27 (mean = 0.11) and was smallest in mountainous catchments. Assuming gamma-shaped TTDs, we determined stationary TTDs with the convolution integral method for each subcatchment. The convolution integral was first calibrated against the isotope data only (i.e., traditional calibration) and, second, using a multiobjective calibration with the F yw estimates as an additional constraint. This yielded largely differing TTD parameters even for neighboring catchments, with F yw values below 0.1 generally involving a delayed peak in TTDs (i.e., gamma-distribution shape parameter > 1). While the traditional calibration resulted in large uncertainties in TTD parameters, these uncertainties were reduced with the multiobjective calibration, thereby improving the assessment of mean transit times (2 years on average, ranging between 9.6 months and 5.6 years). This highlights the need for uncertainty assessment when using simple isotope models and shows that the traditional calibration might not yield an optimum solution in that it may give a TTD nonconsistent with F yw . Given the robustness of F yw estimates, isotope models should thus aim at accurately describing both F yw and measured isotope data in order to improve the description of flow and transport in catchments.Plain Language Summary Information on the age of river water is crucial for assessing the vulnerability of rivers to weather extremes and pollution. The age of river water is defined as the time that water has spent underground after rainfall infiltration and before ending up in the river. The probability distribution of river water age can be determined using environmental tracers, which are tracers that naturally occur in the system such as stable water isotopes. In this study, we used isotope models to analyze time series of stable water isotopes in rainfall and streamwater measured in 24 subcatchments of the Bode catchment in central Germany. We found that the mean age of river water ranges between 9.6 months and 5.6 years depending on catchment characteristics such as climate and soil type. Moreover, river water with an age of below 2 to 3 months accounts for between 1% and 27% of the entire age distribution. We demonstrate how to use this information on young river water to constrain other metrics such as the mean water age. We suggest that this method is valuable for future studies using environmental tracers and models to determine water age in catchments.

show abstract

Section: Methodsmentioning

confidence: 99%

Spatial Patterns of Water Age: Using Young Water Fractions to Improve the Characterization of Transit Times in Contrasting Catchments

Lutz

Krieg

Müller

et al. 2018

Water Resources Research

108

View full text Add to dashboard Cite

show abstract

“…Ensemble learning has been applied in many disciplines and areas and displays advantages over traditional algorithms (Deville et al, ; X. He et al, ; Stevens et al, ). Among the ensemble learning methods, RF and GB are widely used.…”

Section: Study Area and Methodsmentioning

confidence: 99%

“…Machine learning explores the relation between the response and its relevant predictors using one or multiple algorithms, with no need to consider the explicit mathematical form of the model (Elith et al, ; X. He et al, ). Examples of machine learning algorithms include nearest neighbor (Cover & Hart, ), naïve Bayes (Jensen, ; Lewis, ), decision trees (Breiman et al, ), support vector machines (SVMs; Vapnik, ), and artificial neural networks (ANNs) (Hopfield, ).…”

Section: Introductionmentioning

confidence: 99%

Water Resources Assessment of China's Transboundary River Basins Using a Machine Learning Approach

Yan

Jia

2019

Water Resources Research

View full text Add to dashboard Cite

A comprehensive and reliable assessment of the water resources in China's transboundary river basins is vital for water resources management and peaceful development. In this study, we built machine learning (random forest, gradient boosting, and stacking) and traditional linear models to identify the relation between the runoff coefficient and its influencing factors, including topography, climate, land cover, and soil. The cross‐validation results show that the machine learning models greatly outperform the traditional linear model in predicting runoff coefficient. High‐resolution (0.1°) runoff coefficient and runoff maps for the China's transboundary river basins riparian countries were produced and compared with other estimates at the country level. The best water resources estimates achieved from the machine learning model are consistent with the Food and Agriculture Organization of the United Nations AQUASTAT database (root‐mean‐square error = 76.97 km3/year, normalized root‐mean‐square error = 12%) at the country level. This outperformed two currently available runoff products: the UNH/GRDC Global Composite Runoff Fields and the Global Streamflow Characteristics Dataset. The study also demonstrated that accurate precipitation data can improve runoff and water resources estimation accuracy and that climate and topographic factors have a controlling role in prediction, whereas the influences of land cover and soils are weak. Finally, China's transboundary water resources were calculated and thoroughly assessed at basin and country levels.

show abstract

“…This method is suitable for both regression and classification problems. Due to randomized and decorrelated features of RF, it is able to build the connection between the input and output variables when their relationship is very complex and nonlinear ( He et al 2016, Hong et al 2016.…”

Section: Random Forestmentioning

confidence: 99%

Leveraging machine learning for predicting flash flood damage in the Southeast US

Alipour

Ahmadalipour

Abbaszadeh

et al. 2020

Environ. Res. Lett.

View full text Add to dashboard Cite

Flash flood is a recurrent natural hazard with substantial impacts in the Southeast US (SEUS) due to the frequent torrential rainfalls that occur in the region, which are triggered by tropical storms, thunderstorms, and hurricanes. Flash floods are costly natural hazards, primarily due to their rapid onset. Therefore, predicting property damage of flash floods is imperative for proactive disaster management. Here, we present a systematic framework that considers a variety of features explaining different components of risk (i.e. hazard, vulnerability, and exposure), and examine multiple machine learning methods to predict flash flood damage. A large database of flash flood events consisting of more than 14 000 events are assessed for training and testing the methodology, while a multitude of data sources are utilized to acquire reliable information related to each event. A variable selection approach was employed to alleviate the complexity of the dataset and facilitate the model development process. The random forest (RF) method was then used to map the identified input covariates to a target variable (i.e. property damage). The RF model was implemented in two modes: first, as a binary classifier to estimate if a region of interest was damaged in any particular flood event, and then as a regression model to predict the amount of property damage associated with each event. The results indicate that the proposed approach is successful not only for classifying damaging events (with an accuracy of 81%), but also for predicting flash flood damage with a good agreement with the observed property damage. This study is among the few efforts for predicting flash flood damage across a large domain using mesoscale input variables, and the findings demonstrate the effectiveness of the proposed methodology.

show abstract

Spatial downscaling of precipitation using adaptable random forests

Cited by 174 publications

References 70 publications

Spatial Patterns of Water Age: Using Young Water Fractions to Improve the Characterization of Transit Times in Contrasting Catchments

Spatial Patterns of Water Age: Using Young Water Fractions to Improve the Characterization of Transit Times in Contrasting Catchments

Water Resources Assessment of China's Transboundary River Basins Using a Machine Learning Approach

Leveraging machine learning for predicting flash flood damage in the Southeast US

Contact Info

Product

Resources

About