2018
DOI: 10.1007/978-3-319-78890-6_58
|View full text |Cite
|
Sign up to set email alerts
|

HLS Algorithmic Explorations for HPC Execution on Reconfigurable Hardware - ECOSCALE

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 5 publications
0
8
0
Order By: Relevance
“…The deviation from the ideal 4x mainly appears as we move from 3 to 4 accelerators, as the maximum sustained memory bandwidth gets reached, revealing a memory bandwidth limit of the deployed MPSoC devices at about 9 GBytes/sec. The best performance we can achieve is 7.97 sec (equivalent to 73.3 GFLOPS), which is reported in [49] and is about 2.5 times better than our prior reported results in [72], demonstrating a significant improvement over our previous single-FPGA implementation of the UNILOGIC architecture. accelerators that are not so data hungry, would benefit from higher frequencies, as would be the case when mixing data-bound and computation-bound accelerators.…”
Section: Optimizations On a Single Fpgamentioning
confidence: 49%
See 4 more Smart Citations
“…The deviation from the ideal 4x mainly appears as we move from 3 to 4 accelerators, as the maximum sustained memory bandwidth gets reached, revealing a memory bandwidth limit of the deployed MPSoC devices at about 9 GBytes/sec. The best performance we can achieve is 7.97 sec (equivalent to 73.3 GFLOPS), which is reported in [49] and is about 2.5 times better than our prior reported results in [72], demonstrating a significant improvement over our previous single-FPGA implementation of the UNILOGIC architecture. accelerators that are not so data hungry, would benefit from higher frequencies, as would be the case when mixing data-bound and computation-bound accelerators.…”
Section: Optimizations On a Single Fpgamentioning
confidence: 49%
“…This oneto-many solution has been already implemented and tested. We have successfully used it on our first, proof-of-concept, single FPGA prototype [72]. Some analysis on this will be presented in the evaluation section 6, however it is important to note here that we have even analyzed hardware architecture realizations with up to 12 accelerators controlled by a single scheduler, with encouraging results.…”
Section: Accelerator Controller: Virtualization Of the Accelerator Slotsmentioning
confidence: 97%
See 3 more Smart Citations