2004
DOI: 10.1109/tpds.2004.1278102
|View full text |Cite
|
Sign up to set email alerts
|

Distributed diagnosis in dynamic fault environments

Abstract: Abstract-The problem of distributed diagnosis in the presence of dynamic failures and repairs is considered. To address this problem, the notion of bounded correctness is defined. Bounded correctness is made up of three properties: bounded diagnostic latency, which ensures that information about state changes of nodes in the system reaches working nodes with a bounded delay, bounded start-up time, which guarantees that working nodes determine valid states for every other node in the system within bounded time … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0
8

Year Published

2004
2004
2014
2014

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 50 publications
(41 citation statements)
references
References 25 publications
0
33
0
8
Order By: Relevance
“…Such approaches are susceptible to single point failure, lack scalability over a large network of nodes, have large overheads, and require large disk storage. These drawbacks can be minimised or avoided when the control of the approaches is distributed (for example, distributed diagnosis [50], distributed checkpointing [41] and diskless checkpointing [51]). …”
Section: Discussionmentioning
confidence: 99%
“…Such approaches are susceptible to single point failure, lack scalability over a large network of nodes, have large overheads, and require large disk storage. These drawbacks can be minimised or avoided when the control of the approaches is distributed (for example, distributed diagnosis [50], distributed checkpointing [41] and diskless checkpointing [51]). …”
Section: Discussionmentioning
confidence: 99%
“…A lot of research in distributed diagnosis has considered computer systems connected with a network, for example in [12], a Hierarchical Adaptive Distributed System-level Diagnosis algorithm is presented which allows every fault-free node to achieve diagnosis in maximum (log 2 N) 2 testing rounds. In [13], a diagnostic Algorithm HeartbeatComplete is presented, which offers bounded correctness in fullyconnected systems while simultaneously minimizing diagnostic latency, start-up time, and state holding time. In [14], a rule based diagnosis through a monitor system for diagnosis in large scale network protocols is discussed.…”
Section: Contextmentioning
confidence: 99%
“…The set of proofs presented in (DUARTE-JR; WEBER; FONSECA, 2012) is based on the theoretical framework known as Bounded Correctness (SUB-BIAH;BLOUGH, 2004). In this paper we employ a set of proofs which does not rely on that framework, and may be more intuitive for readers which are not familiar with it.…”
Section: Introductionmentioning
confidence: 99%
“…Subbiah and Blough introduced in (SUBBIAH;BLOUGH, 2004) a formal model of the dynamic behavior of diagnosis algorithms, called Bounded Correctness, which allows a diagnosis algorithm to be rigorously proven to be correct under a dynamic fault situation. Bounded Correctness has three goals.…”
Section: Introductionmentioning
confidence: 99%