2007 IEEE 13th International Symposium on High Performance Computer Architecture 2007
DOI: 10.1109/hpca.2007.346193
|View full text |Cite
|
Sign up to set email alerts
|

Error Detection via Online Checking of Cache Coherence with Token Coherence Signatures

Abstract: To provide high dependability in a multithreaded system despite hardware faults, the system must detect and correct errors in its shared memory system. Recent IntroductionTwo trends motivate increased interest in fault tolerance for multithreaded shared-memory computer architectures. First, multithreaded systems-including traditional multiprocessors, chip multiprocessors, and simultaneously multithreaded processors-have come to dominate the commodity computing market. Second, the industrial roadmap [7] and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(20 citation statements)
references
References 22 publications
(38 reference statements)
0
20
0
Order By: Relevance
“…Therefore, it is important that on-chip communication protocols also be designed to handle message loss. This is a problem that is starting to gain prominence [6,16,27,40].…”
Section: Designing Reliable Protocolsmentioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, it is important that on-chip communication protocols also be designed to handle message loss. This is a problem that is starting to gain prominence [6,16,27,40].…”
Section: Designing Reliable Protocolsmentioning
confidence: 99%
“…These messages are part of the baseline cache coherence protocol. A separate solution (such as those described in [16,27]) is required to handle message loss in the cache coherence protocol and is orthogonal to the focus of this paper (the algorithm to determine when a transaction can proceed with propagating its updates) 4 . The above changes to the protocol do not introduce any additional traffic when there is no message loss (assuming that the timeout window is long enough and there are few false alarms).…”
Section: Protocol Extensions To Handle Message Lossmentioning
confidence: 99%
See 2 more Smart Citations
“…Other proposals have explored token counting in the context of multi-socket multi-core systems [23], virtual hierarchical coherence [25], fault-tolerant coherence [9,26], and multicast interconnection networks [14].…”
Section: Background On Token Countingmentioning
confidence: 99%