A class of count-and threshold mechanisms, collectively dubbed α-count, able to discriminate between transient faults and intermittent faults in computing systems is presented. Transient faults discrimination has long been pursued in commercial systems: threshold-based techniques have been practiced for several years for this purpose. The present work aims to contribute to the usefulness of count-andthreshold schemes, through analysis of the behaviour and exploration of the effects on the system. A mathematically defined structure simple enough to be analysed by means of standard tools is adopted. α-count is equipped with internal parameters, designed to be tuned to suit environmental variables (such as transient fault rate, intermittent fault occurrence patterns). Extensive behaviour analysis for two embodiments of the scheme, both under the usual assumption of exponentially distributed fault rates and with more realistic fault patterns is carried out.
In this paper the consolidate identification of faults, distinguished as transient or permanerdintermittent, is approached. Transient faults discrimination has long been performed in commercial systems: threshold-based techniques have been practiced for several years for this purpose. The present work aims to contribute to the usefulness of the count-and-threshold scheme, through the analysis of its behaviour and the exploration of its effects on the system. To this goal, the scheme is mechanized as a device named acount, endowed with a few controllable parameters. a-count tries to balance between two conflicting requirements: to keep in the system those components that have experienced just transient faults; and to remove quickly those affected by permanent or intermittent faults. Analytical models are derived, allowing detailed study of a-count's behaviour; the actual evaluation, in a range of configurations, is performed by standard tools, in terms of the delay in spotting faulty components and the probability of improperly blaming correct ones.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.