Making Steppingstones out of Stumbling Blocks: A Bayesian Model Evidence Estimator with Application to Groundwater Transport Model Selection

Elshall, Ahmed S.; Ye, Ming

doi:10.3390/w11081579

Cited by 7 publications

(7 citation statements)

References 95 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Redefining the first-order and total-effect process sensitivity indices using the posterior distributions and weights is straightforward, as discussed in . The key challenge is to estimate the posterior parameter distributions and posterior process weights due to high computational cost of Markov chain Monte Carlo simulations (Elshall and Ye, 2019;Liu et al, 2016). Estimating the posterior process weights are particularly challenging because it is different from estimating posterior system model weights and has not been attempted.…”

Section: Limitations and Future Researchmentioning

confidence: 99%

Process Interactions Can Change Process Ranking in a Coupled Complex System Under Process Model and Parametric Uncertainty

Jian

Dai

et al. 2022

Water Resources Research

View full text Add to dashboard Cite

For a complex hydrologic system with multiple processes and process interactions, global sensitivity analysis is often used to identify important or influential parameters for model development and improvement. The identification is complicated by process model uncertainty, when a system process can be represented by multiple process models. This study develops a new total‐effect process sensitivity index to identify influential processes under model uncertainty. This is done by extending Sobol's total‐effect parameter sensitivity index for one system model to total‐effect process sensitivity index for multiple system models to account for uncertainty in process models and model parameters. The total‐effect process sensitivity index includes not only the first‐order process sensitivity index for measuring the importance of individual processes but also higher‐order indices that account for process interactions. The total‐effect process sensitivity index can identify an influential process that itself and its interactions with other processes influence a model output. The total‐effect process sensitivity index is applied to two numerical examples: (a) Sobol's G*‐functions with analytical solutions of first‐order and total‐effect process sensitivity indices, and (b) groundwater flow models with interactions between recharge, geology, and snowmelt processes. The second evaluation shows that, due to second‐order and higher‐order process interactions, the first‐order and total‐effect process sensitivity indices give different process ranking. It is thus necessary to estimate both first‐order and total‐effect process sensitivity indices to appreciate the difference between the first‐order impact of a process alone and the overall total‐effect impact of the process itself and its interactions with other processes on a model output.

show abstract

Section: Limitations and Future Researchmentioning

confidence: 99%

Process Interactions Can Change Process Ranking in a Coupled Complex System Under Process Model and Parametric Uncertainty

Jian

Dai

et al. 2022

Water Resources Research

View full text Add to dashboard Cite

show abstract

“…As argued by Schöniger et al (2014), Monte Carlo is superior to other numerical schemes in that it is an unbiased scheme that is known to converge to the correct limit, and its convergence can be easily monitored. In our chosen test cases, the computational burden of Monte Carlo is bearable; for computationally heavier practical applications, alternative numerical methods could be used to improve on computational efficiency, such as nested sampling (Elsheikh et al, 2014;Skilling, 2006), thermodynamic integration (Lartillot & Philippe, 2006;Liu et al, 2016), stepping stone sampling (Elshall & Ye, 2019;Xie et al, 2011), or Gaussian mixture importance sampling (Volpi et al, 2017), to name a few examples. However, these methods are less straightforward to implement and bear the risk of introducing biases into the BME estimation.…”

Section: Bayesian Model Evidencementioning

confidence: 99%

“…Due to its statistical rigor and its elegance in accounting for uncertainty, BMS has become popular in water resources research. It has been applied in various different contexts, such as evaluation of hydrological models (Marshall et al., 2005), frequency analysis of hydrological extremes (Laio et al., 2009), climate change impact studies (Najafi et al., 2011), model complexity analysis (Höge et al., 2018; Schöniger, Illman, et al., 2015), optimal design for model choice (Nowak & Guthke, 2016), as well as hydrogeophysical (Brunetti et al., 2017), hydro‐morphodynamic (Mohammadi et al., 2018), and groundwater transport modeling (Elshall & Ye, 2019).…”

Section: Introductionmentioning

confidence: 99%

“…cal models (Marshall et al, 2005), frequency analysis of hydrological extremes (Laio et al, 2009), climate change impact studies (Najafi et al, 2011), model complexity analysis (Höge et al, 2018;Schöniger, Illman, et al, 2015), optimal design for model choice (Nowak & Guthke, 2016), as well as hydrogeophysical (Brunetti et al, 2017), hydro-morphodynamic (Mohammadi et al, 2018), and groundwater transport modeling (Elshall & Ye, 2019).…”

mentioning

confidence: 99%

See 1 more Smart Citation

The Four Ways to Consider Measurement Noise in Bayesian Model Selection—And Which One to Choose

Reuschen

Nowak

Guthke

2021

Water Resources Research

View full text Add to dashboard Cite

Models are used to predict and/or investigate and explain phenomena in nature. Often, many hypotheses exist for these two tasks. Naturally, the question arises, which of the competing modeling approaches predicts or explains nature best. Bayesian model selection (BMS, e.g., Wasserman, 2000) is a statistical method that uses observed data to select between competing models. BMS is settled in a rigorous probabilistic framework and follows the scheme of Bayesian updating: A prior belief about the plausibility of each candidate model is updated to a posterior model weight in the light of measured data (i.e., the probability of the model to have generated the data, given the model set). Posterior model weights are then used as a basis for Bayesian model ranking, selection, or averaging (BMA, Hoeting et al., 1999).To help with the interpretation of posterior model weights, the so-called model confusion matrix (MCM) has been introduced by Schöniger, Illman, et al. (2015). It reveals whether a lack of confidence in model choice is due to similarity between the candidate models or due to weakly informative data. The MCM is a purely synthetic analysis that can be used as a scale of reference for model weights obtained from real data. Schäfer Rodrigues Silva et al. (2020) have recently extended the MCM analysis to identify the best surrogate model from a set of candidates to replace an expensive full-complexity model in stochastic analysis.Technically, the Bayesian updating procedure requires calculating the so-called Bayesian model evidence (BME). BME is the likelihood of a model to have generated the data, integrated over its whole parameter space and all involved probability distributions. While the likelihood accounts for uncertainty in measured data, the integration considers parameter uncertainty, and potentially also uncertainty in model drivers or boundary conditions. In some cases, the integration even accounts for statistical representations of model errors (Leube et al., 2012;Nowak et al., 2012), which is perceived by many studies to be part of the likelihood.

show abstract

“…Traditionally, model ranking in the BMS framework is based on the values of Bayesian model evidence (BME), which are defined as the probability of a model of reproducing the available data (Raftery, 1995;Draper, 1995). Such BME-based model selection approaches have been used in many fields for model ranking, and/or selection purposes, for example: Schöniger, Illman, et al (2015) and Elshall and Ye (2019) for groundwater modelling, Wöhling et al (2015) for crop modelling, Marshall et al (2005) for hydrological models, Brunetti et al (2017) in hydrogeophysical modelling and Schäfer Rodrigues Silva et al (2020) in reactive groundwater transport models, to name a few. Additionally, Mohammadi et al (2018) and Scheurer et al (2021) apply BMS using surrogate models for sediment transport in rivers and to biochemical processes in the subsurface, respectively.…”

Section: Introductionmentioning

confidence: 99%

Information-Theoretic Scores for Bayesian Model Selection and Similarity Analysis: Concept and Application to a Groundwater Problem

Oreamuno

Oladyshkin

Nowak

2022

Preprint

View full text Add to dashboard Cite

Bayesian model selection (BMS) and Bayesian model justifiability analysis (BMJ) provide a statistically rigorous framework to compare competing conceptual models through the use of Bayesian model evidence (BME). However, BME-based analysis has two main limitations: (1) it's powerless when comparing models with different data set sizes and/or types of data and(2) doesn't allow to judge a model's performance based on its posterior predictive capabilities. Thus, traditional BME-based approaches ignore useful data or models due to issue (1) or disregards Bayesian updating because of issue (2). To address these limitations, we advocate to include additional information-theoretic scores into BMS and BMJ analysis: expected log-predictive density (ELPD), relative entropy (RE) and information entropy (IE). Exploring the connection between Bayesian inference and information theory, we explicitly link BME and ELPD together with RE and IE to indicate the information flow in BMS and BMJ analysis. We show how to compute and interpret these scores alongside BME, and apply it in a model selection and similarity analysis framework. We test the methodology on a controlled 2D groundwater setup considering five competing conceptual models accompanied with different data sets. The results show how the information-theoretic scores complement BME by providing a more complete picture concerning the Bayesian updating process. Additionally, we present how both RE and IE can be used to objectively compare models that feature different data sets. Overall, the introduced Bayesian information-theoretic framework helps to avoid any potential loss of information and leads to an informed decision for model selection and similarity.

show abstract

Making Steppingstones out of Stumbling Blocks: A Bayesian Model Evidence Estimator with Application to Groundwater Transport Model Selection

Cited by 7 publications

References 95 publications

Process Interactions Can Change Process Ranking in a Coupled Complex System Under Process Model and Parametric Uncertainty

Process Interactions Can Change Process Ranking in a Coupled Complex System Under Process Model and Parametric Uncertainty

The Four Ways to Consider Measurement Noise in Bayesian Model Selection—And Which One to Choose

Information-Theoretic Scores for Bayesian Model Selection and Similarity Analysis: Concept and Application to a Groundwater Problem

Contact Info

Product

Resources

About