Proceedings of the Ninth International Conference on Architectural Support for Programming Languages and Operating Systems 2000
DOI: 10.1145/378993.378997
|View full text |Cite
|
Sign up to set email alerts
|

Architecture and design of AlphaServer GS320

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2003
2003
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 83 publications
(39 citation statements)
references
References 26 publications
0
39
0
Order By: Relevance
“…The main source of increased latency for the tree barrier using coherent variables, is that a barrier Receivers polling for these signals introduce multiple round-trips through the directory to each signal propagation. For both MCS barriers and locks, one can expect that aggressive non-blocking coherence protocols [8] and migratory sharing optimizations can reduce the latency of contended flag update-reclaim interactions (atomic or not), but communication operations in these algorithms are dependent on each other and will introduce serialization of miss overhead. Explicit communication advocated here can significantly reduce such overheads.…”
Section: Performance On the Hardware Prototypementioning
confidence: 99%
“…The main source of increased latency for the tree barrier using coherent variables, is that a barrier Receivers polling for these signals introduce multiple round-trips through the directory to each signal propagation. For both MCS barriers and locks, one can expect that aggressive non-blocking coherence protocols [8] and migratory sharing optimizations can reduce the latency of contended flag update-reclaim interactions (atomic or not), but communication operations in these algorithms are dependent on each other and will introduce serialization of miss overhead. Explicit communication advocated here can significantly reduce such overheads.…”
Section: Performance On the Hardware Prototypementioning
confidence: 99%
“…All modern DSM multiprocessors with hardware cache coherence that we are aware of provide a NUMA-aware data placement interface to the user, so that the user can explicitly place pages on specific nodes. For example, on the Compaq GS320 [7], a system which does not have hardware page access counters, it is possible to use the NUMA APIs of Tru64 UNIX and obtain accurate information on the placement of ranges of virtual addresses in physical memory. Given this information, software instrumentation for access tracing is possible.…”
Section: Implementation Issuesmentioning
confidence: 99%
“…Note that this is necessary only because the Stanford FLASH protocol collects invalidation acknowledgments at the home node. The AlphaServer GS320 [7] uses a three-lane protocol. The virtual lanes are named Q0, Q1, and Q2.…”
Section: Introductionmentioning
confidence: 99%