2014 IEEE 28th International Parallel and Distributed Processing Symposium 2014
DOI: 10.1109/ipdps.2014.127
|View full text |Cite
|
Sign up to set email alerts
|

Designing Bit-Reproducible Portable High-Performance Applications

Abstract: Abstract-Bit-reproducibility has many advantages in the context of high-performance computing. Besides simplifying and making more accurate the process of debugging and testing the code, it can allow the deployment of applications on heterogeneous systems, maintaining the consistency of the computations. In this work we analyze the basic operations performed by scientific applications and identify the possible sources of non-reproducibility. In particular, we consider the tasks of evaluating transcendental fun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(24 citation statements)
references
References 16 publications
0
24
0
Order By: Relevance
“…This richness forces us to adopt an approach that is as generic as possible and consists of compiling different files at different optimizations and drawing a final linked image from this mixture. The granularity of mixing versions in our case is either at a file level, or (by using weak symbols and overriding) at a function level 3 . When we encounter a numerical result difference during our bisection search, we allow existing tools to help with rootcausing.…”
Section: Bisect Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…This richness forces us to adopt an approach that is as generic as possible and consists of compiling different files at different optimizations and drawing a final linked image from this mixture. The granularity of mixing versions in our case is either at a file level, or (by using weak symbols and overriding) at a function level 3 . When we encounter a numerical result difference during our bisection search, we allow existing tools to help with rootcausing.…”
Section: Bisect Problemmentioning
confidence: 99%
“…There is little external control one can exert on GPU warp schedulers 3. The approach of searching by overriding symbols is one that potentially creates "Frankenbinaries. "…”
mentioning
confidence: 99%
“…6 Sequential Reproducible Summation [22], (referred as ReproBLAS) from the ReproBLAS library 6 ; 5. The single-sweep reduction [23] with two and three levels (cited as bitrep2 and bitrep3, accordingly) from the bitrep library 7 .…”
Section: Baseline Algorithms and Experimental Setupmentioning
confidence: 99%
“…Demmel and Nguyen have extended their concept to reproducible BLAS routines, distributed in their ReproBLAS library. Arteaga et al [23] used Demmel's and Nguyen's algorithms with improved communication between nodes based on both one-and two-reductions. They were able to reach the same accuracy with better performance results (the overhead is within 10 %) compared to the conventional summation in case of the single reduction algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…The remaining functions offered by the Oracle database (see https://docs.oracle.com/database/121/SQLRF/functions003.htm) can be computed with SUM. 3 If arithmetic expressions are computed in their entirety only once all operands are available, then executing them always in the same order is trivial. However, if expressions are broken up and partially pushed through joins, then their execution order may depend on the join order, which may change even if the logical input has not changed.…”
Section: Introductionmentioning
confidence: 99%