2016
DOI: 10.1109/tpds.2015.2487346
|View full text |Cite
|
Sign up to set email alerts
|

Hardware Implementation on FPGA for Task-Level Parallel Dataflow Execution Engine

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…The implementation throughput was 40 GB/s with an efficiency of 5.27 Mb/slice and a powerful performance of 286 GB/w. In [20], techniques to perform task-level out-of-order execution were proposed and implemented using the Xilinx Virtex-5 FPGA device to improve flexibility. The implementation results showed a better efficiency in terms of performance and resource usage.…”
Section: Related Workmentioning
confidence: 99%
“…The implementation throughput was 40 GB/s with an efficiency of 5.27 Mb/slice and a powerful performance of 286 GB/w. In [20], techniques to perform task-level out-of-order execution were proposed and implemented using the Xilinx Virtex-5 FPGA device to improve flexibility. The implementation results showed a better efficiency in terms of performance and resource usage.…”
Section: Related Workmentioning
confidence: 99%
“…Refs. [9,10] propose task-level schedulers to accelerate the modules running in parallel on FPGA. HRES is a task partitioning method based on DPR (dynamic partial reconfiguration) [11].…”
Section: Heterogeneous Computing Workload Partioningmentioning
confidence: 99%
“…On the other side, task partitioning does not need to deploy FPGA resources to all sub-tasks, because part of sub-tasks entirely runs on the GPU or CPU. [9,10] propose tasklevel schedulers to accelerate the modules running in parallel on FPGA. HRES is a task partitioning method based on DPR (dynamic partial reconfiguration) [11].…”
Section: Heterogeneous Computing Workload Partioningmentioning
confidence: 99%
“…Step 5: After receiving the Adder and Minus invoked by the user, the OS performs task scheduling to determine the dependency between the tasks and the execution timing of the tasks [15], [16], and checks whether a suitable black box can be used as an execution component. If yes, go to Step 7, otherwise go to Step 6.…”
Section: B Software Stackmentioning
confidence: 99%