2014
DOI: 10.1016/j.procs.2014.05.101
|View full text |Cite
|
Sign up to set email alerts
|

Generating Code and Memory Buffers to Reorganize Data on Many-core Architectures

Abstract: International audienceThe dataflow programming model has shown to be a relevant approach to efficiently run mas-sively parallel applications over many-core architectures. In this model, some particular builtin agents are in charge of data reorganizations between user agents. Such agents can Split, Join and Duplicate data onto their communication ports. They are widely used in signal processing for example. These system agents, and their associated implementations, are of major impor-tance when it comes to perf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…The RDMA-based tiler operates at runtime (graph execution) inside each compute cluster concurrently, distributing the execution of each OpenVX node across the entire matrix of compute clusters. This technique is essential to achieve low-latency execution and contrasts with classic dataflow graph execution where actors map to clusters [2], [21]. Algorithm 1 receives commands through the job queues as seen in Figure 2 when the host application calls vxProcess-Graph.…”
Section: Runtime Optimization Rdma-based Tiling and Fusionmentioning
confidence: 99%
“…The RDMA-based tiler operates at runtime (graph execution) inside each compute cluster concurrently, distributing the execution of each OpenVX node across the entire matrix of compute clusters. This technique is essential to achieve low-latency execution and contrasts with classic dataflow graph execution where actors map to clusters [2], [21]. Algorithm 1 receives commands through the job queues as seen in Figure 2 when the host application calls vxProcess-Graph.…”
Section: Runtime Optimization Rdma-based Tiling and Fusionmentioning
confidence: 99%
“…Indeed, by using contiguous memory spaces, the developer of an application avoids the multiple jumps in memory that would have a negative impact on the system performance. By doing so, the developer also avoids writing complex pointer operations that would decrease the source code readability [Cudennec et al 2014].…”
Section: Graymentioning
confidence: 99%
“…In [Cudennec et al 2014], a technique is proposed to enable buffer merging for a set of actors with pre-defined behavior. In contrast to the method presented in this article, this technique does not allow buffer merging for actors with a user-defined behavior.…”
Section: Dataflow Optimizationsmentioning
confidence: 99%
“…In [5], a technique is proposed to enable buffer merging for a set of actors with pre-defined behavior. Contrary to the method presented in this paper, this technique does not allow buffer merging for actors with a user-defined behavior.…”
Section: Related Workmentioning
confidence: 99%
“…SDF actors are considered as "black boxes" within the model whose internal behavior can be implemented in any programming language. To simplify the description of this internal behavior, it is convenient to assume that the memory consumed and produced on each FIFO during the firing of an actor constitutes a contiguous memory space called a buffer [5]. To reveal these buffers, an SDF graph can be transformed into an equivalent single-rate graph where each FIFO is replaced with single-rate FIFOs whose consumption and production rates are equal ( Figure 2).…”
Section: Introductionmentioning
confidence: 99%