2013 International Conference on Field-Programmable Technology (FPT) 2013
DOI: 10.1109/fpt.2013.6718365
|View full text |Cite
|
Sign up to set email alerts
|

From software threads to parallel hardware in high-level synthesis for FPGAs

Abstract: Abstract-We describe the support within high-level hardware synthesis (HLS) for two standard software parallelization paradigms: Pthreads and OpenMP. Parallel code segments, as specified in the software, are automatically synthesized by our HLS tool into parallel-operating hardware sub-circuits. Both data parallelism and task-level parallelism are supported, as is the combined use of both Pthreads and OpenMP. Moreover, our work also provides automated synthesis for commonly occurring synchronization constructs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
46
0
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 69 publications
(47 citation statements)
references
References 15 publications
0
46
0
1
Order By: Relevance
“…The proposed methodology has been verified on a set of parallel benchmarks distributed with Legup [8]. In OpenMP benchmarks each #pragma omp for has been replaced with #pragma omp simd, while pthread benchmarks have to be re-factorized to replace pthread parallelism with #pragma omp simd.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…The proposed methodology has been verified on a set of parallel benchmarks distributed with Legup [8]. In OpenMP benchmarks each #pragma omp for has been replaced with #pragma omp simd, while pthread benchmarks have to be re-factorized to replace pthread parallelism with #pragma omp simd.…”
Section: Resultsmentioning
confidence: 99%
“…The synthesis of Blackschoels on the Stratix V with P = 3,4,8 was not been possible because the device does not contain enough DSPs to implement the vectorized version of the benchmarks. The area results refer only to the synthesized accelerators since the produced parallel hardware architectures, differently from the ones presented in [8], do not require any external processor nor external controller to be integrated in a system. Memory utilization has not been reported since it is independent from the degree of parallelism.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations