2016
DOI: 10.1007/978-3-319-43659-3_39
|View full text |Cite
|
Sign up to set email alerts
|

Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures

Abstract: The recent addition of data dependencies to the OpenMP 4.0 standard provides the application programmer with a more flexible way of synchronizing tasks. Using such an approach allows both the compiler and the runtime system to know exactly which data are read or written by a given task, and how these data will be used through the program lifetime. Data placement and task scheduling strategies have a significant impact on performances when considering NUMA architectures. While numerous papers focus on these top… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 14 publications
0
18
0
2
Order By: Relevance
“…Scheduling to improve data locality and minimizing NUMA effects in shared memory task parallel execution is an active research area [6,21,22,23,24,25,26,27] and can also be coupled to energy considerations [28,29]. : Left: A small task graph where all accesses (the type is indicated for each task) are assumed to be to the same shared data.…”
Section: Tracking Dependencies Through Data Versioningmentioning
confidence: 99%
“…Scheduling to improve data locality and minimizing NUMA effects in shared memory task parallel execution is an active research area [6,21,22,23,24,25,26,27] and can also be coupled to energy considerations [28,29]. : Left: A small task graph where all accesses (the type is indicated for each task) are assumed to be to the same shared data.…”
Section: Tracking Dependencies Through Data Versioningmentioning
confidence: 99%
“…Some of them are designed with memory migration heuristics or strategies to reduce remote memory access. Some target specific programming environments and runtime systems and reschedule application‐specific tasks to run on cores for load balancing on NUMA systems. Some adopt user‐level mechanisms, whereas most of them adopt kernel‐level mechanisms due to their efficiency in directly managing and allocating system resources.…”
Section: Technology Background and Related Workmentioning
confidence: 99%
“…To reduce remote memory access and/or resource contention, existing studies use various strategies. Most of them focus on either the scheduling techniques and thread mapping strategies or memory placement and data mapping policies.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Both approaches partition an initial subgraph containing the firstly created tasks but propagate this partition in different ways: RIP-DEP exploits information regarding the allocation of tasks' input data while RIP-MW repartitions the initial TDG subgraph as new tasks are added. • A complete performance evaluation of the proposed techniques against 3 other methods: an expert programmerdriven policy, a locality-unaware distributed first-in-first-out (DFIFO) approach and an implementation of a state-of-theart technique [17,18,42], dependency easy placement (DEP), that automatically schedules tasks depending on where their input and output data are allocated. Our evaluations consider 8 different OpenMP codes and 2 different parallel systems with up to 288 cores.…”
Section: Introductionmentioning
confidence: 99%