2019
DOI: 10.1109/access.2019.2923855
|View full text |Cite
|
Sign up to set email alerts
|

Simulating the Network Activity of Modern Manycores

Abstract: Manycore architectures are one of the most promising candidates to reach the exascale. However, the increase in the number of cores on a single die exacerbates the memory wall problem. Modern manycore architectures integrate increasingly complex and heterogeneous memory systems to work around the memory bottleneck while increasing computational power. The Intel Mesh Interconnect architecture is the latest interconnect designed by Intel for its HPC product lines. Processors are organized in a rectangular networ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…Going forward, it would be desirable to improve this design, coupling the directory distribution that avoids bottlenecks in the NoC with a more regular and predictable mapping of the memory blocks to enable programmers, particularly in the high-performance computing domain, to have full control over coherence traffic. Horro et al [13] developed a simulator for the traffic on the NoC of distributed directory architectures based on the Tejas architectural simulator [25], predicting that codes with coherence traffic control would experiment a 20% decrease in overall traffic over the NoC, yielding more than 50% latency improvement for the coherence packets.…”
Section: Discussion and Related Workmentioning
confidence: 99%
“…Going forward, it would be desirable to improve this design, coupling the directory distribution that avoids bottlenecks in the NoC with a more regular and predictable mapping of the memory blocks to enable programmers, particularly in the high-performance computing domain, to have full control over coherence traffic. Horro et al [13] developed a simulator for the traffic on the NoC of distributed directory architectures based on the Tejas architectural simulator [25], predicting that codes with coherence traffic control would experiment a 20% decrease in overall traffic over the NoC, yielding more than 50% latency improvement for the coherence packets.…”
Section: Discussion and Related Workmentioning
confidence: 99%
“…On-chip networks, also known as networks on chip (NoCs), typically consist of a set of routers that connect to each other and to endpoints in a manner defined by the topology [33]. NoCs have been commercially adopted [34], [35] and may use deterministic, oblivious, or adaptive routing. Moreover, the flow control defines how packets progress through the network.…”
Section: B Nocs and Deflection Flow Controlmentioning
confidence: 99%
“…Interestingly, a ring topology can outperform mesh topology for workloads exhibiting moderate to high memory locality accesses [46]. Rings also have been used in commercial systems (e.g., the first generation of Intel Xeon Phi co-processor uses a dual ring topology, while second generation, as well as Intel SkyLake-SP Xeon processor, use a ring-based mesh [22,24,51]). The main advantage of using the ring topology is the low latency offered to the data packets to reach their destinations.…”
Section: Introductionmentioning
confidence: 99%