Efficient emulators of computer experiments using compactly supported correlation functions, with an application to cosmology

Kaufman, Cari; Bingham, Derek; Habib, Salman; Heitmann, Katrin; Frieman, Joshua A.

doi:10.1214/11-aoas489

Cited by 98 publications

(110 citation statements)

References 32 publications

(38 reference statements)

Supporting

Mentioning

110

Contrasting

Order By: Relevance

“…An orthogonal approach is to perform approximate GP regression, and a common theme in that literature is sparsity, leading to fast matrix decompositions (e.g., Kaufman et al 2012;Sang and Huang 2012). Again, the expansion of capability is one-to-two orders of magnitude, albeit without tapping supercomputer resources which is more practical for most applications.…”

Section: Supercomputing and Sparse Approximation For Big Datamentioning

confidence: 99%

“…Again, the expansion of capability is one-to-two orders of magnitude, albeit without tapping supercomputer resources which is more practical for most applications. For example, Kaufman et al (2012) reported on an experiment with N = 20000. Some approaches in a similar vein include fixed rank kriging (Cressie and Johannesson 2008) and using '''pseudo-inputs" (Snelson and Ghahramani 2006).…”

Section: Supercomputing and Sparse Approximation For Big Datamentioning

confidence: 99%

“…The borehole experiment (Worley 1987;Morris, Mitchell, and Ylvisaker 1993) involves an 8-dimensional input space, and our use of it here follows the setup of Kaufman et al (2012); more details can be found therein. The response y is given by…”

Section: Borehole Datamentioning

confidence: 99%

“…Gramacy and Apley (2015)'s similar experiments included variations on the method of compactly supported covariances (CSC) (Kaufman et al 2012) yielding estimators with similar accuracies, but requiring at least an order magnitude more compute time. In fact, they commented that N = 10000 was the limit that CSC could accommodate on their machine due to memory swapping issues.…”

Section: Borehole Datamentioning

confidence: 99%

See 3 more Smart Citations

laGP: Large-Scale Spatial Modeling via Local Approximate Gaussian Processes inR

Gramacy¹

2016

J. Stat. Soft.

155

142

View full text Add to dashboard Cite

Gaussian process (GP) regression models make for powerful predictors in out of sample exercises, but cubic runtimes for dense matrix decompositions severely limit the size of data -training and testing -on which they can be deployed. That means that in computer experiment, spatial/geo-physical, and machine learning contexts, GPs no longer enjoy privileged status as data sets continue to balloon in size. We discuss an implementation of local approximate Gaussian process models, in the laGP package for R, that offers a particular sparse-matrix remedy uniquely positioned to leverage modern parallel computing architectures. The laGP approach can be seen as an update on the spatial statistical method of local kriging neighborhoods. We briefly review the method, and provide extensive illustrations of the features in the package through worked-code examples. The appendix covers custom building options for symmetric multi-processor and graphical processing units, and built-in wrapper routines that automate distribution over a simple network of workstations.

show abstract

Section: Supercomputing and Sparse Approximation For Big Datamentioning

confidence: 99%

Section: Supercomputing and Sparse Approximation For Big Datamentioning

confidence: 99%

Section: Borehole Datamentioning

confidence: 99%

Section: Borehole Datamentioning

confidence: 99%

See 2 more Smart Citations

laGP: Large-Scale Spatial Modeling via Local Approximate Gaussian Processes inR

Gramacy¹

2016

J. Stat. Soft.

155

142

View full text Add to dashboard Cite

show abstract

“…This becomes problematic when n goes beyond 10 3 −10 4 , and there is currently a large body of literature proposing alternative computation procedures for larger values of n [17,30,31,55].…”

Section: Current Research Questionsmentioning

confidence: 99%

Gaussian processes for computer experiments

et al. 2017

View full text Add to dashboard Cite

show abstract

Parallelization, Massive

Gramacy¹

2020

Wiley StatsRef: Statistics Reference Online

View full text Add to dashboard Cite

Algorithms exploiting parallelization are all but essential to make full use of modern computing architectures, leveraging multicore, multinode, and specialized hardware such as graphical processing units (GPUs). Methodology for statistical computing has been slow to adapt, although there are some notable success stories. Except in cases where models have statistical independence built in, computational independence for the purposes of distribution across computing elements implies approximation which may have (at worst) deleterious and (at best) unknown effects on inference and prediction. In this article, we consider nonlinear regression by Gaussian process (GP) models as a representative exemplar of the situation. GPs are popular in geo/spatial statistics, computer surrogate modeling, and machine learning, for their smooth, accurate predictions with excellent coverage properties, but can suffer from cubic run times and quadratic storage limitations. Strong spatial interdependence in canonical GP setups makes parallelization hard, but careful divide and conquer can facilitate massive distribution across the full span of contemporary hardware capabilities without compromising accuracy and uncertainty quantification properties. The specific idea and general recipe are ripe for extension to other large‐scale modeling applications.

show abstract

Efficient emulators of computer experiments using compactly supported correlation functions, with an application to cosmology

Cited by 98 publications

References 32 publications

laGP: Large-Scale Spatial Modeling via Local Approximate Gaussian Processes inR

laGP: Large-Scale Spatial Modeling via Local Approximate Gaussian Processes inR

Gaussian processes for computer experiments

Parallelization, Massive

Contact Info

Product

Resources

About