2020 30th International Conference on Field-Programmable Logic and Applications (FPL) 2020
DOI: 10.1109/fpl50879.2020.00032
|View full text |Cite
|
Sign up to set email alerts
|

Exploring FPGA Optimizations in OpenCL for Breadth-First Search on Sparse Graph Datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…Running various signal processing functions (e.g., different waveform shaping techniques from other running slices) on shared heterogeneous SDR architectures brings the challenge of irregular runtime adaptation (IRA) to the iRAN-S scene. IRA was examined for graph algorithms design in high-performance computing (HPC) [13]. In [13], authors discussed the runtime adaptation between various parallelizing techniques for irregular applications that run across shared heterogeneous architectures.…”
Section: Directions For Future Researchmentioning
confidence: 99%
See 1 more Smart Citation
“…Running various signal processing functions (e.g., different waveform shaping techniques from other running slices) on shared heterogeneous SDR architectures brings the challenge of irregular runtime adaptation (IRA) to the iRAN-S scene. IRA was examined for graph algorithms design in high-performance computing (HPC) [13]. In [13], authors discussed the runtime adaptation between various parallelizing techniques for irregular applications that run across shared heterogeneous architectures.…”
Section: Directions For Future Researchmentioning
confidence: 99%
“…IRA was examined for graph algorithms design in high-performance computing (HPC) [13]. In [13], authors discussed the runtime adaptation between various parallelizing techniques for irregular applications that run across shared heterogeneous architectures. Irregularity at runtime could arise due to several reasons, such as irregular memory access patterns (IMA) and irregular workload distribution (IWD).…”
Section: Directions For Future Researchmentioning
confidence: 99%
“…Here, OpenCL uses synchronous queue execution, in which the program waits until the executed command ends. Synchronous queue execution has the limitation of delaying the processing speed of a host, because the right to control the command that is executed in a computing device does not return to the host until the command ends, and this induces kernel launch overhead [50]. Therefore, our framework processes each OpenCL command in an asynchronous manner, thus maximizing GPU utilization.…”
Section: Asynchronous Queue Executionmentioning
confidence: 99%