2005
DOI: 10.1109/mm.2005.122
|View full text |Cite
|
Sign up to set email alerts
|

Truss: A Reliable, Scalable Server Architecture

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
19
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(19 citation statements)
references
References 16 publications
0
19
0
Order By: Relevance
“…TRUSS introduces a distributed shared memory architecture with no single point of failure [10]. To avoid common-mode failure, redundant operations are carried out by cores on different chips; however, this leads to performance losses due to long delays waiting for data to be checked.…”
Section: Related Workmentioning
confidence: 99%
“…TRUSS introduces a distributed shared memory architecture with no single point of failure [10]. To avoid common-mode failure, redundant operations are carried out by cores on different chips; however, this leads to performance losses due to long delays waiting for data to be checked.…”
Section: Related Workmentioning
confidence: 99%
“…Hardware and software combined approaches [24], [25], [29], [26], [30], [27] use the parallel processing capacity of chip multiprocessors (CMPs) and redundant multi threading to detect and recover the problem. Mohamed et al [62] shows Chip Level Redundantly Threaded Multiprocessor with Recovery (CRTR), where the basic idea is to run each program twice, as two identical threads, on a simultaneous multithreaded processor.…”
Section: Related Workmentioning
confidence: 99%
“…We propose to decouple error checking from the DSM coherence protocol. Unlike our previous design [7], decoupled checking requires no modification to the existing coherence controller. Although the checking latency increases slightly with a decoupled design, the effectiveness of the checking filter is sufficiently high that overall performance overhead not affected.…”
Section: Introductionmentioning
confidence: 99%
“…Our prior proposal [7] for a chip-level redundant DSM suffers from unacceptable performance overhead, particularly in commercial workloads, due to frequent, long-latency checking on the critical path of execution. Previously proposed mechanisms [7] that reduce the latency of checking fail to provide adequate improvement and require impractical changes to the cache coherence protocol and its implementation.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation