2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) 2011
DOI: 10.1109/iccad.2011.6105349
|View full text |Cite
|
Sign up to set email alerts
|

Massively parallel programming models used as hardware description languages: The OpenCL case

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…Owaida et al [19] describe an automatic methodology to synthesize OpenCL workgroups into hardware accelerators for FPGA targets. The internal core of the accelerators works in a streaming-like fashion on data that is prefetched by a request generator that can directly access system memory.…”
Section: Related Workmentioning
confidence: 99%
“…Owaida et al [19] describe an automatic methodology to synthesize OpenCL workgroups into hardware accelerators for FPGA targets. The internal core of the accelerators works in a streaming-like fashion on data that is prefetched by a request generator that can directly access system memory.…”
Section: Related Workmentioning
confidence: 99%
“…The FPGA Brook implementation uses 18,909 LEs, 63 9-bit multipliers, and 33 4-Kbit memory blocks. Another related paper [Owaida et al 2011] uses matrix multiplication as a benchmark, but uses floating point arithmetic, making it difficult to perform a meaningful comparison. Altera [2012b] reported 60X speedup over the Nios II processor when implementing the Mandelbrot application using manually tuned C2H code that was written in a way that allowed the hardware implementation to take advantage of data parallelism.…”
Section: Mandelbrot Setmentioning
confidence: 99%
“…Their results also show that the circuits generated by the Optimus compiler significantly outperform the embedded PowerPC processor, which is similar to our results relative to the Nios II soft processor. Other related projects include FCUDA [Papakonstantinou et al 2009], which compiles CUDA kernels to FPGA logic, and compilers mapping applications described in OpenCL to FPGA logic [Altera 2012e;Lin et al 2010;Owaida et al 2011]. …”
Section: Related Workmentioning
confidence: 99%
“…In contrast to this platform, we propose a workgroup synthesis techniques to efficiently map OpenCL applications that process large amount of data on a very resourcerestricted FPGA in the Xilinx Zynq SoC. Owaida et al [3] propose a source-to-source converter to generate a C code from the OpenCL kernel code. They utilise compiler techniques such as prediction, code slicing and modulo scheduling to convert the C code into an HDL code to be synthesised by synthesis tools considering a hardware architecture.…”
Section: Previous Workmentioning
confidence: 99%
“…This critical innovation could make FPGA devices accessible to software programmers as an additional computing resource without changes to the algorithm source code. A few commercial and research OpenCL frameworks have been proposed by industry and researchers [3][4][5][6][7]. The commercially available OpenCL frameworks recommend that designers should structure their OpenCL kernel as a single workitem kernel (instead of NDRange kernels) if it is possible to improve performance [8].…”
Section: Introductionmentioning
confidence: 99%