“…Moreover, our Wasserstein gradient descent using the SGE approximation can also be derived using an alternative formulation as a gradient flow with smoothed test functions [44]. A projected version of WGD has been studied in [65], which could also be readily applied in our framework. Besides particle methods, Bayesian neural networks MacKay [49], Neal [54] have gained popularity recently [69,18,16,32], using modern MCMC [54,69,18,20,17] and variational inference techniques [4,63,14,30].…”