The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays 2021
DOI: 10.1145/3431920.3439301
|View full text |Cite
|
Sign up to set email alerts
|

HBM Connect: High-Performance HLS Interconnect for FPGA HBM

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 52 publications
(25 citation statements)
references
References 16 publications
0
23
0
Order By: Relevance
“…Finally, their microbenchmark is developed in RTL, which still leaves a gap for software programmers who use HLS. More recently, Choi et al further proposed HBM Connect [9], a fully customized HBM crossbar to better utilize HBM bandwidth when multiple PEs access multiple HBM banks, which is orthogonal to our work. Characterization of CPU-FPGA Communication.…”
Section: Related Workmentioning
confidence: 99%
“…Finally, their microbenchmark is developed in RTL, which still leaves a gap for software programmers who use HLS. More recently, Choi et al further proposed HBM Connect [9], a fully customized HBM crossbar to better utilize HBM bandwidth when multiple PEs access multiple HBM banks, which is orthogonal to our work. Characterization of CPU-FPGA Communication.…”
Section: Related Workmentioning
confidence: 99%
“…To enable concurrent processing, we partition all the three major components internally. We use multi-stage switch networks [34] to improve the clock frequency without sacrificing the throughput [11] when all-to-all concurrent communication is required. Besides the three major components, the SPLAG accelerator also contains a dispatcher responsible for injecting the first active vertex, controlling program termination, and collecting statistics.…”
Section: The Splag Acceleratormentioning
confidence: 99%
“…The bandwidth of each pseudo channel is 14.375 GB/s, for a total bandwidth of 460 GB/s. Because HBM is a new feature to FPGAs, existing studies of FPGA HBM mainly focus on tool development [17,18,37] and benchmarking [19,50], but very few applications. SpMM, a memory-intensive application which is distinguished from typical computation-intensive FPGA applications [5,23,31,77,87,88], is a good fit for HBM.…”
Section: High Bandwidth Memorymentioning
confidence: 99%
“…C2 -The irregular column index shown as colored square numbers in Figure 1 (b) and (c) lead to irregular memory read requests, whereas the irregular row destination of PEs in Figure 1 (c) leads to irregular memory write requests. Although our accelerators are equipped with HBM which has higher memory bandwidth, the latency of accessing HBM is still high (up to 100 cycles) [18]. Inspired by the idea of caching random accessing on a higher memory hierarchy in graph processing [70,94], we partition the random memory read and write into a specific window, so random memory accessing is limited to on-chip fast memory.…”
Section: Motivationmentioning
confidence: 99%