2012 International Conference for High Performance Computing, Networking, Storage and Analysis 2012
DOI: 10.1109/sc.2012.75
|View full text |Cite
|
Sign up to set email alerts
|

Mapping applications with collectives over sub-communicators on torus networks

Abstract: Abstract-The placement of tasks in a parallel application on specific nodes of a supercomputer can significantly impact performance. Traditionally, this task mapping has focused on reducing the distance between communicating tasks on the physical network. This minimizes the number of hops that point-to-point messages travel and thus reduces link sharing between messages and contention. However, for applications that use collectives over sub-communicators, this heuristic may not be optimal. Many collectives can… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 39 publications
(24 citation statements)
references
References 19 publications
0
24
0
Order By: Relevance
“…pF3D is a communication-heavy application and has been shown to benefit significantly from task mapping on Blue Gene/P [6]. This is the first attempt at mapping pF3D on Blue Gene/Q.…”
Section: Results With Pf3dmentioning
confidence: 99%
See 1 more Smart Citation
“…pF3D is a communication-heavy application and has been shown to benefit significantly from task mapping on Blue Gene/P [6]. This is the first attempt at mapping pF3D on Blue Gene/Q.…”
Section: Results With Pf3dmentioning
confidence: 99%
“…We use a Python mapping tool, Rubik [6,13] Rest of the tree AB -average bytes MB -maximum bytes (a) Decision tree. Based on the training set and the learning scheme, conditions are computed to guide prediction based on features, e.g., maximum bytes and average bytes.…”
Section: Input Datamentioning
confidence: 99%
“…All the experimental data for this study has been collected on Vulcan, an IBM Blue Gene/Q installation at LLNL. We use Rubik [12] to generate many different task mappings of the code running on a 5D torus. Based on the list of hardware components that could contribute to network congestion (Table I), we gather communication data from three network hardware counters: the number of packets sent on each link, the receive buffer length and the number of packets received on each link.…”
Section: A Gathering Data For Supervised Learningmentioning
confidence: 99%
“…For example, Blue Gene/Q (BG/Q) uses a five-dimensional (5D) torus interconnect as the network between processing nodes. Different mappings of the 3D logical process grid of pF3D onto the 5D torus of BG/Q can make a significant difference in how messages are routed on the network and in turn, performance [6]. The effect was previously studied on a 3D torus interconnect with the aid of a visualization specific to that interconnect topology [26].…”
Section: Laser-plasma Interaction Simulationsmentioning
confidence: 99%