1996
DOI: 10.1145/232974.232999
|View full text |Cite
|
Sign up to set email alerts
|

Coherent network interfaces for fine-grain communication

Abstract: Historically, processor accesses to memory-mapped device registers have been marked uncachable to insure their visibility to the device. The ubiquity of snooping cache coherence, however, makes it possible for processors and devices to interact with cachable, coherent memory operations. Using coherence can improve performance by facilitating burst transfers of whole cache blocks and reducing control overheads (e.g., for polling).This paper begins an exploration of network interfaces (NIs) that use coherence---… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2001
2001
2014
2014

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(35 citation statements)
references
References 27 publications
0
35
0
Order By: Relevance
“…The work closest to ours is Coherent Network Interface (CNI), which was proposed by Mukherjee et al [5][6][7]. CNI allows a coherent, cacheable memory block implemented as a Cacheable Device Register (CDR) to be shared between the host processor and NI.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The work closest to ours is Coherent Network Interface (CNI), which was proposed by Mukherjee et al [5][6][7]. CNI allows a coherent, cacheable memory block implemented as a Cacheable Device Register (CDR) to be shared between the host processor and NI.…”
Section: Related Workmentioning
confidence: 99%
“…Since both head and tail pointers are needed to determine whether the queue is full or empty, updating of the tail pointer by the host processor and head pointer by the CNI causes invalidations to ping-pong back and forth increasing the traffic on the bus. CNI employs three optimizations based on Lazy pointers, valid bits, and sense reverse to reduce the bus traffic [6]. The idea behind lazy pointers is to remove the host processor's dependency on the head pointer.…”
Section: Edt Vs Cni Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…This has the advantage of removing the need for asynchronous interrupts in foreign nodes and also allows us to execute the protocol in the requesting processor that most likely would be idle waiting for the data. A further advantage is that the protocol execution is divided between all the processors in the node, not just one processor at a time as suggested in some other proposals, for example by Mukherjee et al [29].…”
Section: Blocking Directory Protocol Overviewmentioning
confidence: 99%
“…While other work has proposed elaborate schemes for cutting down on the overhead associated with interrupting and/or polling caused by the asynchronous communication between the agents [5], [29], our implementation has completely eliminated the protocol-agent interactions. In DSZOOM the entire coherence protocol is implemented in the protocol handler running in the requesting processor.…”
Section: Introductionmentioning
confidence: 99%