2019
DOI: 10.48550/arxiv.1905.04180
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Large scale in transit computation of quantiles for ensemble runs

Alejandro Ribes,
Théophile Terraz,
Bertrand Iooss
et al.

Abstract: The classical approach for quantiles computation requires availability of the full sample before ranking it. In uncertainty quantification of numerical simulation models, this approach is not suitable at exascale as large ensembles of simulation runs would need to gather a prohibitively large amount of data. This problem is solved thanks to an on-the-fly and iterative approach based on the Robbins-Monro algorithm. This approach relies on Melissa, a file avoiding, adaptive, fault-tolerant and elastic framework.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…Slightly different linear profiles can be proposed as γ(n) = 0.5[1+(n−1)/(N −1)]. Several tests on simple analytical functions (where the true quantile can be known) have been performed in order to calibrate and validate these γ-profiles [40,20]. Other algorithmic developments are currently under study to further improve the robustness of the Robbins-Monro estimate.…”
Section: Order Statistics: Quantilesmentioning
confidence: 99%
See 1 more Smart Citation
“…Slightly different linear profiles can be proposed as γ(n) = 0.5[1+(n−1)/(N −1)]. Several tests on simple analytical functions (where the true quantile can be known) have been performed in order to calibrate and validate these γ-profiles [40,20]. Other algorithmic developments are currently under study to further improve the robustness of the Robbins-Monro estimate.…”
Section: Order Statistics: Quantilesmentioning
confidence: 99%
“…The goal is to reduce the data to store to disk and to avoid the time penalty to write and then read back the raw data set as required by the classical post hoc analysis approach. In recent works, we proposed the Melissa framework for the on-line data aggregation of high-resolution ensemble runs [46,40]. As soon as each available simulation provides the results produced to a set of staging nodes, these nodes process them to update the statistics on a first-come-first-served basis thanks to one-pass algorithms.…”
Section: Introductionmentioning
confidence: 99%