2022
DOI: 10.1364/jocn.451760
|View full text |Cite
|
Sign up to set email alerts
|

Performance trade-offs in reconfigurable networks for HPC

Abstract: Designing efficient interconnects to support high-bandwidth and low-latency communication is critical toward realizing high performance computing (HPC) and data center (DC) systems in the exascale era. At extreme computing scales, providing the requisite bandwidth through overprovisioning becomes impractical. These challenges have motivated studies exploring reconfigurable network architectures that can adapt to traffic patterns at runtime using optical circuit switching. Des… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 60 publications
0
5
0
Order By: Relevance
“…Of late, the development of interconnects to facilitate low-latency and high-bandwidth is crucial in the direction of HPC and data center systems. To resolve this problem, Teh et al [19] investigated reconfigurable network architectures that can adapt to traffic patterns at runtime using optical circuit switching. The study of how network performance, cost, scalability, and power consumption differ based on optical circuit switch (OCS) placement in the physical topology is presented.…”
Section: Discussionmentioning
confidence: 99%
“…Of late, the development of interconnects to facilitate low-latency and high-bandwidth is crucial in the direction of HPC and data center systems. To resolve this problem, Teh et al [19] investigated reconfigurable network architectures that can adapt to traffic patterns at runtime using optical circuit switching. The study of how network performance, cost, scalability, and power consumption differ based on optical circuit switch (OCS) placement in the physical topology is presented.…”
Section: Discussionmentioning
confidence: 99%
“…Optical switching technology has developed rapidly, and there has been a great interest in using them in DCNs in recent years [4,8,10,15,21,23,28,29,32,33,38,39,42,47,50,51,54]. To fully utilize the high capacity and power efficiency of optical switching, one trend is to connect racks with optical switches directly [4,10,21,32,33,51].…”
Section: Background and Motivationmentioning
confidence: 99%
“…Meanwhile, the detouring also damages mice flow FCT, particularly when elephant flows are spread across the network and block the mice ones at intermediate nodes. The performance downgrade worsens under heavier loads, which is a critical concern for HPC tasks like large-scale ML training where large amounts of flows are synchronously released to the network [25,27,47]. NegotiaToR is designed to meet these needs, offering a practical solution that can accommodate the high-performance requirements of modern DCNs where fast optical switching technology is ready.…”
Section: Background and Motivationmentioning
confidence: 99%
See 1 more Smart Citation
“…• Scalability: Using pods with hundreds of uplinks to the OCSs, our architecture could support up to about 100 pods. Since each pod could support Θ(1000) servers, our architecture can easily scale up to over 100k servers [43].…”
Section: B Network Architecturementioning
confidence: 99%