Proceedings 1993 IEEE Workshop on Advances in Parallel and Distributed Systems
DOI: 10.1109/apads.1993.588861
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive fault tolerance

Abstract: The goal of adaptive fault tolerance(AFT) is to expand the envelope of dependable system operation in distributed, real-time systems. Such systems often experience substantial run-time changes in the types and distributions of faults, in the availability of resources, in data distribution, and in users' requirements for dependability and performance. Preliminary examples, such as Adaptable Distributed Recovery Blocks (Kim) and distributed crash recovery, illustrate how adaptive fault tolerance can provide usef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(5 citation statements)
references
References 4 publications
(1 reference statement)
0
5
0
Order By: Relevance
“…This structure employs a novel fault-tolerance mechanism known as adaptive fault tolerance (AFT) to overcome fixed fault conditions in the adaptive filter coefficients. It should be noted that the AFT mechanism discussed here is based on the inherent adaptive process of the system and is different from the AFT mechanism used in [21]. In addition to faults in filter coefficients, faults may also arise at the outputs of adders, multipliers, and registers that hold intermediate values.…”
Section: Introductionmentioning
confidence: 92%
See 1 more Smart Citation
“…This structure employs a novel fault-tolerance mechanism known as adaptive fault tolerance (AFT) to overcome fixed fault conditions in the adaptive filter coefficients. It should be noted that the AFT mechanism discussed here is based on the inherent adaptive process of the system and is different from the AFT mechanism used in [21]. In addition to faults in filter coefficients, faults may also arise at the outputs of adders, multipliers, and registers that hold intermediate values.…”
Section: Introductionmentioning
confidence: 92%
“…The occurrence of a fault in the th coefficient and removal of th row and column in is equivalent to removing the th row and column of each of the submatrices in . Hence, can be written as (21) The operation in (21) is equivalent to finding the eigenvalues of a circulant matrix having the same form as in (19). This will result in zero eigenvalues and eigenvalues each equal to two.…”
Section: F Convergence Ratesmentioning
confidence: 98%
“…Adaptivity: The controller should be able to remain functional under previously unseen faults (Goldberg et al (1993)). Speedy convergence: The agents behaviour should be responsive to faults as they occur.…”
Section: Reinforcement Learning-based Controller Designmentioning
confidence: 99%
“…AFT is gaining more importance with the increasing concern for lowering the amount of energy consumed by cyber-physical systems and the amount of heat they generate [25]. Conceptual frameworks for adaptive fault tolerance (AFT) describing algorithms and target systems are presented in [26,27]. Several CORBA-based middleware exist [28,9,8] but evolution is tackled differently: adaptation has a parametric form (e.g., number of replicas) or it is performed off-line or, if done on-line, has a coarse-grained nature.…”
Section: Related Workmentioning
confidence: 99%