2019
DOI: 10.3390/w11050910
|View full text |Cite
|
Sign up to set email alerts
|

A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources

Abstract: Random forests (RF) is a supervised machine learning algorithm, which has recently started to gain prominence in water resources applications. However, existing applications are generally restricted to the implementation of Breiman’s original algorithm for regression and classification problems, while numerous developments could be also useful in solving diverse practical problems in the water sector. Here we popularize RF and their variants for the practicing water scientist, and discuss related concepts and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
243
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 428 publications
(292 citation statements)
references
References 328 publications
(137 reference statements)
2
243
0
Order By: Relevance
“…Furthermore, RF is straightforward to use compared to ANN (e.g., no need for stopping rules, no need for scaling, and less parameter requirements; cf. Tyralis, Papacharalampous, & Langousis, ).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, RF is straightforward to use compared to ANN (e.g., no need for stopping rules, no need for scaling, and less parameter requirements; cf. Tyralis, Papacharalampous, & Langousis, ).…”
Section: Discussionmentioning
confidence: 99%
“…The first row shows the results when machine learning (ML) algorithms were trained using data from all years. The second row is the results when they were trained separately for each year [Colour figure can be viewed at wileyonlinelibrary.com] no need for stopping rules, no need for scaling, and less parameter requirements; cf Tyralis, Papacharalampous, & Langousis, 2019)…”
mentioning
confidence: 99%
“…The well-known machine learning algorithm called Random Forests (RF) (Breiman, 2001) averages the result of many independent individual trees with a random subset of data and predictors in order to mitigate the poor performance of any single tree. A comprehensive review of its implementation in the water science field has been recently published (Tyralis et al, 2019). Details about the implementation of decision trees in this work are given in the Annex (section A.2), in case the reader is not familiar with RF.…”
Section: Non-linear Modelsmentioning
confidence: 99%
“…The same applies for investigations using large datasets. In fact, only such investigations allow the inspection of the impact of the various modelling 'tricks' to the quantification of predictive uncertainty, as suggested by well-established guidelines for machine learning-based approaches (see e.g., Tyralis et al 2019a, section 1). For extensive discussions on the subject, the interested reader is referred to Papacharalampous et al (2019b, section 5) and the references therein.…”
Section: Summary Discussion and Conclusionmentioning
confidence: 99%
“…The proposed methodology is characterized by some additional strengths that are also particularly important from a predictive modelling point of view. First, it is computationally convenient in the sense that it can be easily expressed in algorithmic Tyralis et al 2019a). Lastly, it allows the exploitation of the total amount of available information, in the sense that each sister prediction is herein converted into a probabilistic prediction (consisted of several quantile predictions) instead of a single simulation (randomly extracted from its predictive PDF; see the utilization of the meta-Gaussian bivariate distribution model in Montanari and Koutsoyiannis 2012; see also Kelly and Krzysztofowicz 1997).…”
Section: Summary Discussion and Conclusionmentioning
confidence: 99%