2013
DOI: 10.1049/iet-cdt.2013.0031
|View full text |Cite
|
Sign up to set email alerts
|

A case for three‐dimensional stacking of tightly coupled data memories over multi‐core clusters using low‐latency interconnects

Abstract: Shared tightly coupled data memories are key architectural elements for building multi-core clusters in programmable accelerators and embedded systems, as they provide a convenient shared memory abstraction while avoiding cache coherence overheads. The performance of these memories largely depends on the architecture of the interconnect used between processing elements (PEs) and memory banks. The advent of three-dimensional (3D) technology has provided new opportunities to increase design modularity and reduce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…RE is responsible for decoding request packets and issuing them to the memory pipeline (only aligned access is supported). ATs are combinational binary trees adopted from the logarithmic interconnect [13], and finally, response engines receive response packets from RB and serialize them through multiple flits to the NIs. All components, which perform arbitration, have been designed carefully to avoid starvation.…”
Section: B Flow Control Componentsmentioning
confidence: 99%
See 3 more Smart Citations
“…RE is responsible for decoding request packets and issuing them to the memory pipeline (only aligned access is supported). ATs are combinational binary trees adopted from the logarithmic interconnect [13], and finally, response engines receive response packets from RB and serialize them through multiple flits to the NIs. All components, which perform arbitration, have been designed carefully to avoid starvation.…”
Section: B Flow Control Componentsmentioning
confidence: 99%
“…L1 SPMs offer very low latency access (1-2 Cycles) to a cluster of tightly coupled processors. However, their 3-D stacking is not so beneficial with current TSV technologies, because their access latency directly affects processor pipeline, and moving toward, the third dimension requires propagation of combinational signals through stacked TSVs [13], which are not yet much better in terms of speed than global on-chip wires [14]. Therefore, lower sensitivity of L2 SPMs to access latency and its variations makes them a more interesting option for going toward the third dimension.…”
mentioning
confidence: 99%
See 2 more Smart Citations
“…The latter are getting more complex demanding a higher performance system to handle parallel computing and provide higher bandwidth. At the same time, semiconductor industries are exploiting three dimensional integrated circuits (3D IC) which provide short global interconnects, lower power consumption, and higher performance [5]. Bringing together 2D NoC architecture with 3D IC technology, makes the design of 3D NoC possible which is a stack multiple die in the vertical axis that are interconnected through silicon via (TSV) [6].…”
Section: Introductionmentioning
confidence: 99%