Twenty-Fifth International Symposium on Fault-Tolerant Computing, 1995, ' Highlights From Twenty-Five Years'.
DOI: 10.1109/ftcsh.1995.532612
|View full text |Cite
|
Sign up to set email alerts
|

The Delta-4 Approach to Dependability in Open Distributed Computing Systems

Abstract: As part of the European Strategic Programme for Research in Information Technology (ESPRIT), the Delta-4 project is seeking to define an open, faulttolerant, distributed computing architecture. This paper presents the overall Delta-4 framework for open, fault-tolerant, distributed computing systems and sketches the current implementation which is based on a local area network with specific atomic multicasting and error-processing protocols for communicating between replicated software components. Keywords: Fau… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(26 citation statements)
references
References 11 publications
0
26
0
Order By: Relevance
“…Systems that are designed and implemented so that they fail only in specific modes of failure described in the dependability & security specification and only to an acceptable extent, are fail-controlled systems, e.g., with stuck output as opposed to delivering erratic values, silence as opposed to babbling, consistent as opposed to inconsistent failures. A system whose failures are to an acceptable extent halting failures only, is a fail-halt (or fail-stop) system; the situations of stuck service and of silence lead respectively to fail-passive systems and failsilent systems [Powell et al 1988]. A system whose failures are, to an acceptable extent, all minor ones is a failsafe system.…”
Section: Failures 331 Service Failuresmentioning
confidence: 99%
“…Systems that are designed and implemented so that they fail only in specific modes of failure described in the dependability & security specification and only to an acceptable extent, are fail-controlled systems, e.g., with stuck output as opposed to delivering erratic values, silence as opposed to babbling, consistent as opposed to inconsistent failures. A system whose failures are to an acceptable extent halting failures only, is a fail-halt (or fail-stop) system; the situations of stuck service and of silence lead respectively to fail-passive systems and failsilent systems [Powell et al 1988]. A system whose failures are, to an acceptable extent, all minor ones is a failsafe system.…”
Section: Failures 331 Service Failuresmentioning
confidence: 99%
“…Had the system been written for a dedicated purpose, correct diagnosis and recovery might have been possible, and the crash avoided. A crash can also be desirable if the OS is to implement "fail silent" behavior, i.e., when the system fails it does so by stopping to respond and without any other side effects [Powell et al, 1988].…”
Section: Failure Classesmentioning
confidence: 99%
“…Research in the Delta-4 system [4] asserted that the total message traffic between components of a dynamically evolving system (e.g, a shared communication inter-link as the ethernet) cannot in practice be assumed determenistic and varies from an instance to the other. To accommodate this variation, a background task was implemented to dynamically calculate the round-trip time (rtt) of inter-node messages with relation to the current network traffic.…”
Section: Detecting Node Failuresmentioning
confidence: 99%