2019
DOI: 10.1016/j.sysarc.2019.02.013
|View full text |Cite
|
Sign up to set email alerts
|

Optimized implementation of OpenCL kernels on FPGAs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 6 publications
0
10
0
Order By: Relevance
“…In the OpenCL execution architecture, the host-side program is used to uniformly manage and schedule multiple computing devices that support OpenCL [31]. When the host side submits the kernel to the computing device, OpenCL defines the organizational structure of the work-item through the index space and defines how the kernel operates on the computing device in a mapping manner on the computing device [32], [33]. In the OpenCL abstract model, each instance of the execution kernel is called a work-item, which is represented by its coordinates in the NDRange.…”
Section: A Opencl Parallel Computing Platformmentioning
confidence: 99%
“…In the OpenCL execution architecture, the host-side program is used to uniformly manage and schedule multiple computing devices that support OpenCL [31]. When the host side submits the kernel to the computing device, OpenCL defines the organizational structure of the work-item through the index space and defines how the kernel operates on the computing device in a mapping manner on the computing device [32], [33]. In the OpenCL abstract model, each instance of the execution kernel is called a work-item, which is represented by its coordinates in the NDRange.…”
Section: A Opencl Parallel Computing Platformmentioning
confidence: 99%
“…Nowadays, the Matlab based high level coding is advantageous for user prospective [19]. But the Matlab converted verilog/VHDL code is generalized in nature and is not optimal [21]. Such implementation uses more resources and introduces computational complexities as compare to the same algorithm implemented with low level Verilog/VHDL coding.…”
Section: Proposed Hardwarementioning
confidence: 99%
“…Recently, the FPGA vendors presented an open computing language (OpenCL) [20] to get two-fold benefits: (i) A generalized coding environment for different vendors' FPGAs, and (ii) Abstract (high-level) coding environment for an FPGA. Although the programming of an FPGA by this high level language/software significantly reduces the development time compared to the traditional lowlevel languages (e.g., Verilog or VHDL) but results in significantly under-utilizing the computing capabilities of the device [21]. Thus, a trade-off exist between ease of coding and optimized coding.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Shata et al present a lengthy study of several generic code optimizations for OpenCL kernels [33]. These include avoiding the use of global variables (e.g., for reduce operations), studying the effect of suggested or enforced work-group size for different kernels, and employing inter-kernel channels.…”
Section: Related Workmentioning
confidence: 99%