Postprocessing of MCMC

South, Leah F.; Riabiz, Marina; Teymur, Onur; Oates, Chris J.

doi:10.1146/annurev-statistics-040220-091727

Cited by 16 publications

(10 citation statements)

References 74 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For this problem, we trained all CVs through stochastic optimisation and use m = (50, 50) MC samples. This synthetic example was originally used by South et al (2022) to show one of the drawbacks of kernel-based CVs, namely that the fitted CV will usually tend to β in parts of the domain where we do not have any function evaluations. This phenomenon can be observed on the red lines in Figure 2 (left and center) which gives a CV based on a squared-exponential kernel.…”

Section: Synthetic Examplementioning

confidence: 99%

“…Furthermore, if g is chosen appropriately, the variance of f − g will be much smaller than that of f , and a smaller number of samples will be required for the estimator to attain a given level of accuracy. The reader is referred to Si et al (2021); South et al (2022) for two recent reviews of this literature.…”

Section: Introductionmentioning

confidence: 99%

“…Figure2: Numerical integration of problem fromSouth et al (2022). Left and center: Illustration of f 1 and f 2 , as well as the corresponding kernel-based CVs and vv-CVs obtained through stochastic optimisation when Π 1 = N (0, 1) and Π 2 = N (0, 1.25).…”

mentioning

confidence: 99%

See 2 more Smart Citations

Vector-Valued Control Variates

Sun¹,

Barp²,

Briol³

2021

Preprint

View full text Add to dashboard Cite

Control variates are post-processing tools for Monte Carlo estimators which can lead to significant variance reduction. This approach usually requires a large number of samples, which can be prohibitive for applications where sampling from a posterior or evaluating the integrand is computationally expensive. Furthermore, there are many scenarios where we need to compute multiple related integrals simultaneously or sequentially, which can further exacerbate computational costs. In this paper, we propose vector-valued control variates, an extension of control variates which can be used to reduce the variance of multiple integrals jointly. This allows the transfer of information across integration tasks, and hence reduces the overall requirement for a large number of samples. We focus on control variates based on kernel interpolants and our novel construction is obtained through a generalised Stein identity and the development of novel matrix-valued Stein reproducing kernels. We demonstrate our methodology on a range of problems including multifidelity modelling and model evidence computation through thermodynamic integration.

show abstract

Section: Synthetic Examplementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Vector-Valued Control Variates

Sun¹,

Barp²,

Briol³

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…An effective control variate is one for which the difference f − g has smaller MC variance than f (or asymptotic variance, in the case of MCMC). CVs have proved successful in a range of challenging tasks in statistical physics [Assaraf and Caffarel, 1999], Bayesian statistics [Dellaportas and Kontoyiannis, 2012, Mira et al, 2013, Oates et al, 2017, South et al, 2022c, gradient estimation in variational inference [Grathwohl et al, 2018, Shi et al, 2022 and MCMC [Baker et al, 2019], reinforcement learning Liu et al [2018Liu et al [ , 2019, and computer graphics [Müller et al, 2020].…”

Section: Introductionmentioning

confidence: 99%

Meta-learning Control Variates: Variance Reduction with Limited Data

Sun¹,

Oates²,

Briol³

2023

Preprint

View full text Add to dashboard Cite

Control variates can be a powerful tool to reduce the variance of Monte Carlo estimators, but constructing effective control variates can be challenging when the number of samples is small. In this paper, we show that when a large number of related integrals need to be computed, it is possible to leverage the similarity between these integration tasks to improve performance even when the number of samples per task is very small. Our approach, called meta learning CVs (Meta-CVs), can be used for up to hundreds or thousands of tasks. Our empirical assessment indicates that Meta-CVs can lead to significant variance reduction in such settings, and our theoretical analysis establishes general conditions under which Meta-CVs can be successfully trained.

show abstract

“…On the other hand, thinning the Markov chain allows for compressing the MCMC output and may also reduce the correlation between the iteratively selected points. More recently, promising kernel-based procedures were proposed to automatically remove the burn-in period, compress the output, and reduce the asymptotic bias (South et al, 2022). These approaches consist in minimizing a kernel-based discrepancy measure D(P, Q m ) between the empirical distribution Q m of a subsample of the MCMC output of size m, and the target distribution P. In this respect, minimization of the maximum mean discrepancy (MMD) was investigated by several authors, but these strategies require the full knowledge of the target distribution P, whose density is not tractable in non-conjugate Bayesian inference.…”

Section: Introductionmentioning

confidence: 99%

Kernel Stein Discrepancy thinning: a theoretical perspective of pathologies and a practical fix with regularization

Bénard¹,

Staber²,

Veiga³

2023

Preprint

View full text Add to dashboard Cite

Stein thinning is a promising algorithm proposed by (Riabiz et al., 2022) for post-processing outputs of Markov chain Monte Carlo (MCMC). The main principle is to greedily minimize the kernelized Stein discrepancy (KSD), which only requires the gradient of the log-target distribution, and is thus well-suited for Bayesian inference. The main advantages of Stein thinning are the automatic remove of the burn-in period, the correction of the bias introduced by recent MCMC algorithms, and the asymptotic properties of convergence towards the target distribution. Nevertheless, Stein thinning suffers from several empirical pathologies, which may result in poor approximations, as observed in the literature. In this article, we conduct a theoretical analysis of these pathologies, to clearly identify the mechanisms at stake, and suggest improved strategies. Then, we introduce the regularized Stein thinning algorithm to alleviate the identified pathologies. Finally, theoretical guarantees and extensive experiments show the high efficiency of the proposed algorithm.

show abstract

Postprocessing of MCMC

Cited by 16 publications

References 74 publications

Vector-Valued Control Variates

Vector-Valued Control Variates

Meta-learning Control Variates: Variance Reduction with Limited Data

Kernel Stein Discrepancy thinning: a theoretical perspective of pathologies and a practical fix with regularization

Contact Info

Product

Resources

About