Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing
DOI: 10.1109/ftcs.1997.614109
|View full text |Cite
|
Sign up to set email alerts
|

Discriminating fault rate and persistency to improve fault treatment

Abstract: In this paper the consolidate identification of faults, distinguished as transient or permanerdintermittent, is approached. Transient faults discrimination has long been performed in commercial systems: threshold-based techniques have been practiced for several years for this purpose. The present work aims to contribute to the usefulness of the count-and-threshold scheme, through the analysis of its behaviour and the exploration of its effects on the system. To this goal, the scheme is mechanized as a device n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
37
0

Publication Types

Select...
3
2
1

Relationship

4
2

Authors

Journals

citations
Cited by 23 publications
(37 citation statements)
references
References 12 publications
0
37
0
Order By: Relevance
“…In the general case, as discussed in [4,11], increasing values of T decrease the number of wrongly removed healthy processors (as shown by Table 5), while the probability of erroneous computation tends to increase because faulty processors stay longer on-line. In the present setting, where wrong outputs (even if detected) cause costly mission failures, the second effect dominates.…”
mentioning
confidence: 98%
See 3 more Smart Citations
“…In the general case, as discussed in [4,11], increasing values of T decrease the number of wrongly removed healthy processors (as shown by Table 5), while the probability of erroneous computation tends to increase because faulty processors stay longer on-line. In the present setting, where wrong outputs (even if detected) cause costly mission failures, the second effect dominates.…”
mentioning
confidence: 98%
“…The behaviour of -count has been analysed in [4,9,11] as a stand-alone mechanism. Two figures of merit have been evaluated: i) the time, D, between a (permanent or intermittent) fault occurrence in a processor u i , and its recognition by the threshold crossing of the pertinent count i ; ii) the wasted time, NU, spent by a processor idled after being wrongly signalled as faulty (normalized to the expected processor's life).…”
Section: The Diagnosis Mechanismmentioning
confidence: 99%
See 2 more Smart Citations
“…Heuristics are suggested by intuitive reasoning, and then validated by experiment or modelling (see e.g. [1] for an assessment of a heuristic via modelling). Most on-line techniques [1,5,8,12,13] use thresholding schemes.…”
Section: Introductionmentioning
confidence: 99%