Proceedings of 1995 IEEE International Computer Performance and Dependability Symposium
DOI: 10.1109/ipds.1995.395830
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the effects of communication faults on parallel applications

Abstract: This paper addresses the problem of injection of faults in the communication system of disjoint menuuy parallel computers and presents .fuult injection resultsshowing that 5% to 3Wo of the fuults injected in the conzmunication subsystem of a commercial purullel computer caused undetected errors thut l e d the upplicution to generate erroneous results. All these C U .~ correspond to situations in which it would be virtuidly inzpossible to detect that the benchmark output wus erroneous, us the size of the result… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 13 publications
0
7
0
Order By: Relevance
“…We do not consider faults caused by transmission errors in the physical layer -injection for this type of communication fault is already addressed by previous work such as [8,3,9,14]. That means messages are not corrupted or lost in the communication channel, but are indeed generated containing incorrect data or are not sent.…”
Section: Interaction Faults and Inconsistent Valuesmentioning
confidence: 99%
See 4 more Smart Citations
“…We do not consider faults caused by transmission errors in the physical layer -injection for this type of communication fault is already addressed by previous work such as [8,3,9,14]. That means messages are not corrupted or lost in the communication channel, but are indeed generated containing incorrect data or are not sent.…”
Section: Interaction Faults and Inconsistent Valuesmentioning
confidence: 99%
“…Fault injection has been used in several domains, such as fault-tolerant systems [15,13], parallel computers [3,14], distributed systems [8,9] and databases [19]. Different types of faults have been addressed, including hardware [15], software [12,16,17,13], operator [4,19] and even communication faults in a more general sense [8,3,9,14].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations