SC14: International Conference for High Performance Computing, Networking, Storage and Analysis 2014
DOI: 10.1109/sc.2014.33
|View full text |Cite
|
Sign up to set email alerts
|

Maximizing Throughput on a Dragonfly Network

Abstract: Abstract-Interconnection networks are a critical resource for large supercomputers. The dragonfly topology, which provides a low network diameter and large bisection bandwidth, is being explored as a promising option for building multi-Petaflop/s and Exaflop/s systems. Unlike the extensively studied torus networks, the best choices of message routing and job placement strategies for the dragonfly topology are not well understood. This paper aims at analyzing the behavior of an interconnect based on the dragonf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
17
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 47 publications
(20 citation statements)
references
References 25 publications
1
17
0
Order By: Relevance
“…Figure 1 demonstrates runtime variability when running fast Fourier Transform programs [9,10] for solving the Navier-Stokes equations on Hazelhen and Shaheen II. Both these computers use adaptive routing and job placement to speed up job throughput [18], but this can result in some run time variability, a subject of current research [7,28,35,38,39].…”
Section: Lessons Learnedmentioning
confidence: 99%
“…Figure 1 demonstrates runtime variability when running fast Fourier Transform programs [9,10] for solving the Navier-Stokes equations on Hazelhen and Shaheen II. Both these computers use adaptive routing and job placement to speed up job throughput [18], but this can result in some run time variability, a subject of current research [7,28,35,38,39].…”
Section: Lessons Learnedmentioning
confidence: 99%
“…Instead, algorithms like the local variant of Universal Globally-Adaptive Load-balanced routing [30,29] are used in practice. The complexity of this issue is exacerbated by the fact that the adaptive routing algorithm's performance depends in part on the workload [31].…”
Section: Topologies and Routingmentioning
confidence: 99%
“…is paper focuses on fat-tree because of the multiplicity of design options available for it [20,34]. For analysis and prediction of performance on HPC networks, several approaches ranging from analytical modeling [12,26,36,38] to it and packet-level simulations [18,28,35,41] have been proposed. We deploy trace-driven packet-level simulations because they allow for use of existing codes for simulating their communication complexity.…”
Section: Related Workmentioning
confidence: 99%