2011
DOI: 10.1109/tcsi.2010.2073832
|View full text |Cite
|
Sign up to set email alerts
|

A Cost-Efficient L1–L2 Multicore Interconnect: Performance, Power, and Area Considerations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2013
2013
2016
2016

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 17 publications
0
11
0
Order By: Relevance
“…If the total die area is around 400 mm2, then the area overhead for an acceptable latency is around 46.8% for full sharing (nearly half the chip!). For calculating on-chip memory sizes, it is determined to be 1 bit per square micron, or 0.125MB per mm2 for 65 nm technology [2]. If the cache banks were private, and an SBF is used to interconnect these banks to the 8 cores, there would be 3MB of cache memory available for each core.…”
Section: Noc Connecting Shared Cache Memorymentioning
confidence: 99%
See 3 more Smart Citations
“…If the total die area is around 400 mm2, then the area overhead for an acceptable latency is around 46.8% for full sharing (nearly half the chip!). For calculating on-chip memory sizes, it is determined to be 1 bit per square micron, or 0.125MB per mm2 for 65 nm technology [2]. If the cache banks were private, and an SBF is used to interconnect these banks to the 8 cores, there would be 3MB of cache memory available for each core.…”
Section: Noc Connecting Shared Cache Memorymentioning
confidence: 99%
“…Where A0 is the wire pitch, W(i) is the i link width (number of wires) and L(i) is the length of link i [2]. Considering 16 bit data signals from one port to another, then, the total area occupied by 24 links that are implemented in 8X metal plane with a wire pitch of 1.6 µm (for 65 nm technology) is estimated to be ~3 mm2 [2].…”
Section: Noc Connecting Shared Cache Memorymentioning
confidence: 99%
See 2 more Smart Citations
“…PowerEN offers up to 64 hardware threads per chip and 256 per a 4-way SMP coherent system. It has a unique hierarchical chip multiprocessor (CMP [4]) structure that efficiently shares cache and interconnect resources [5]. A chip has four compute clusters, each compute cluster is constructed of four 64bit PowerPC general-purpose cores that share an L2 cache.…”
Section: A General Purpose Coresmentioning
confidence: 99%