2010 18th Euromicro Conference on Parallel, Distributed and Network-Based Processing 2010
DOI: 10.1109/pdp.2010.67
|View full text |Cite
|
Sign up to set email alerts
|

hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications

Abstract: Abstract-The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities.We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstrac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
197
0
3

Year Published

2010
2010
2014
2014

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 299 publications
(209 citation statements)
references
References 13 publications
0
197
0
3
Order By: Relevance
“…We reused the hwloc library [2] to obtain the topology of the underlying computation node. Once discovered, the runtime decides how to decompose the hardware resources (processing elements, caches, etc.)…”
Section: Methodsmentioning
confidence: 99%
“…We reused the hwloc library [2] to obtain the topology of the underlying computation node. Once discovered, the runtime decides how to decompose the hardware resources (processing elements, caches, etc.)…”
Section: Methodsmentioning
confidence: 99%
“…If we also know the node's hardware topology, we can associate a memory sample with a specific portion of it. Tools such as hwloc [5] and likwid [6] exist to detect the available hardware topology on a system. We use a modification of likwid suited to integrate with our existing codebase.…”
Section: A Node Hardware Topologymentioning
confidence: 99%
“…It is also completely transparent: applications cannot determine whether they are running under CPU speed degradation unless they read the inria-00535534, version 2 -14 Jul 2011 Figure 1: Architecture of a dual Intel L5420 machine, as shown by hwloc [6]. Cores sharing their L2 cache cannot change speed independently.…”
Section: Related Workmentioning
confidence: 99%