Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07) 2007
DOI: 10.1109/ccgrid.2007.60
|View full text |Cite
|
Sign up to set email alerts
|

Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An MPI Perspective

Abstract: Large scale InfiniBand clusters are becoming increasingly popular, as reflected by the TOP 500 Supercomputer rankings. At the same time, fat tree has become a popular interconnection topology for these clusters, since it allows multiple paths to be available in between a pair of nodes. However, even with fat tree, hot-spots may occur in the network depending upon the route configuration between end nodes and communication pattern(s) in the application. To make matters worse, the deterministic routing nature of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
23
0

Year Published

2007
2007
2015
2015

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 34 publications
(24 citation statements)
references
References 10 publications
(17 reference statements)
1
23
0
Order By: Relevance
“…Thus, it is comparable to the solution discussed by Vishnu et al in [19]. However, Vishnu et al only discuss a particular worst-case pattern.…”
Section: Resultssupporting
confidence: 72%
See 2 more Smart Citations
“…Thus, it is comparable to the solution discussed by Vishnu et al in [19]. However, Vishnu et al only discuss a particular worst-case pattern.…”
Section: Resultssupporting
confidence: 72%
“…Martinez et al proposed a fully adaptive routing scheme for InfiniBand networks [10] that requires significant changes to the crossbar switches and thus is currently not implemented. A simpler scheme for InfiniBand which uses multiple endpoint addresses (LIDs) per node has been proposed in [9] and implemented in [19]. Another scheme to optimize routing in fat-tree networks that requires global routing information and a specialized hardware has been proposed in [5].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…One way is for nodes to send their partial sum (a single variable) to a single node to perform the summation and generate a single result. This only occupies a single node's processor but completion time grows linearly with the number of operands and can also create network hotspots [17]. The other approach (used in most MPI implementations) performs the summation in a tree fashion, where each node adds a number of operands and passes the partial sum (also a single variable) to the next level of the tree.…”
Section: Department Of Energy Under Contract No De-ac02-05ch11231mentioning
confidence: 99%
“…As high-end computing (HEC) systems continue to increase rapidly in size, their network subsystems must scale as well, particularly to address issues such as hot-spot congestion [22], [26] and hardware faults [8]. Multi-path communication, supported by InfiniBand (IB) [1] and 10-Gigabit Ethernet (10GE) [16], provides a way to address these issues.…”
Section: Introductionmentioning
confidence: 99%