IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN 2012) 2012
DOI: 10.1109/dsnw.2012.6264672
|View full text |Cite
|
Sign up to set email alerts
|

ROSE::FTTransform - A source-to-source translation framework for exascale fault-tolerance research

Abstract: Abstract-Exascale computing systems will require sufficient resilience to tolerate numerous types of hardware faults while still assuring correct program execution. Such extreme-scale machines are expected to be dominated by processors driven at lower voltages (near the minimum 0.5 volts for current transistors). At these voltage levels, the rate of transient errors increases dramatically due to the sensitivity to transient and geographically localized voltage drops on parts of the processor chip. To achieve p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 27 publications
(17 citation statements)
references
References 24 publications
0
16
0
Order By: Relevance
“…Last, some important source to source annotated tools/frameworks are found in the literature that help the user to apply code optimizations such as Rose [31], Orio [18] and POET [45].…”
Section: Related Workmentioning
confidence: 99%
“…Last, some important source to source annotated tools/frameworks are found in the literature that help the user to apply code optimizations such as Rose [31], Orio [18] and POET [45].…”
Section: Related Workmentioning
confidence: 99%
“…The tile and cache partition sizes as well as data array layouts which are different to those the proposed equations provide are discarded (they are inefficient), reducing the search space. We have implemented an automated C to C tool just for the studied algorithms, but a general tool can be implemented by using Rose [Lidman et al 2012] for loop tiling and [Henretty et al 2009] for data layout transformation.…”
Section: Search Spacementioning
confidence: 99%
“…Memory scrubbing is employed in systems with greatest need, but is expensive [20,23]. Software solutions employ redundancy and specialize fault-tolerant algorithms [7,17], application-level error checking [23], and critical MPI message validation [15]. However few general techniques have been developed, and the known general techniques introduce significant overhead for parallel applications [15,16], and consequently are rarely used.…”
Section: Introductionmentioning
confidence: 99%
“…Memory scrubbing is employed in systems with greatest need, but is expensive [20,23]. Software solutions employ redundancy and specialize fault-tolerant algorithms [7,17], application-level error checking [23], and critical MPI message validation [15].…”
Section: Introductionmentioning
confidence: 99%