2007
DOI: 10.1109/pact.2007.4336201
|View full text |Cite
|
Sign up to set email alerts
|

Unified Architectural Support for Soft-Error Protection or Software Bug Detection

Abstract: In this paper we propose a unified architectural support that can be used flexibly for either soft-error protection or software bug detection. Our approach is based on dynamically detecting and enforcing instructionlevel invariants. A hardware table is designed to keep track of run-time invariant information. During program execution, instructions access this table and compare their produced results against the stored invariants. Any violation of the predicted invariant suggests a potential abnormal behavior, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 21 publications
0
18
0
Order By: Relevance
“…For brevity, we present the cost vs. SDC reduction curves for a representative subset from our workload and metric combinations in Figure 9. 7 In the remainder of this section we use the term gap to signify the difference in the Y axis (SDC) for a given value on the X axis between the different curves.…”
Section: Cost Vs Sdc Reductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For brevity, we present the cost vs. SDC reduction curves for a representative subset from our workload and metric combinations in Figure 9. 7 In the remainder of this section we use the term gap to signify the difference in the Y axis (SDC) for a given value on the X axis between the different curves.…”
Section: Cost Vs Sdc Reductionmentioning
confidence: 99%
“…Software anomaly based error detection has emerged as an attractive approach that detects only those hardware faults that propagate to software [7,10,13,16,21,24,27]. This approach places near-zero cost error monitors that watch for anomalous software behavior.…”
Section: Introductionmentioning
confidence: 99%
“…Racunas et al dynamically predict the valid set of values that an instruction will produce, and consider a departure from this prediction as a symptom of a (transient) fault [14]. Dimitrov and Zhou monitor the variance between the two most recent results produced by two dynamic instructions of the same PC, and any large variance indicates a possible soft error [5]. We have recently explored various symptoms -fatal traps, hangs, high OS activity, and program based invariant violations -as the mechanisms for detecting permanent and transient faults in our SWAT system [6], [15].…”
Section: This Work Is Supported In Part By the Gigascale Systems Resementioning
confidence: 99%
“…To this end, increasing the reliability of systems is becoming critically important. In spite of tremendous improvements in software engineering, testing and software reliability [34,29,30,22,31,24,33,16,27,14,13,12,7,26,36,38,40,21,15,11], many software bugs still escape testing and enter production. As others have noted [36], performing off-site analysis of production run failures at development sites has several limitations: 1) it is difficult to reproduce failures at the development site due to differences in the environment, 2) customers have privacy concerns over what information can be released for off-site diagnosis, and 3) the same bug may generate a different failure at multiple production sites; it is cumbersome for the development team to investigate every failure that occurs without any automated feedback regarding the root cause of the failure.…”
Section: Motivationmentioning
confidence: 99%