2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) 2021
DOI: 10.1109/isca52012.2021.00054
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Graph Processing on FPGAs with Caches for Thousands of Simultaneous Misses

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
25
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(25 citation statements)
references
References 43 publications
0
25
0
Order By: Relevance
“…: uncertain. 2 Required programming language for development with the system; 3 Whether the system supports automated design low for developers; 4 Whether the system has been optimized for FPGAs with multiple SLRs (from Xilinx); 5 Whether the system has been optimized for HBM-enabled FPGA platforms; 6 Evaluation is based on simulation analysis (SIM) or real hardware implementation (HW); 7 Number of evaluated graph applications with the system in corresponding papers; 8 Whether the code of the system is publicly available.…”
Section: Existing Workmentioning
confidence: 99%
See 2 more Smart Citations
“…: uncertain. 2 Required programming language for development with the system; 3 Whether the system supports automated design low for developers; 4 Whether the system has been optimized for FPGAs with multiple SLRs (from Xilinx); 5 Whether the system has been optimized for HBM-enabled FPGA platforms; 6 Evaluation is based on simulation analysis (SIM) or real hardware implementation (HW); 7 Number of evaluated graph applications with the system in corresponding papers; 8 Whether the code of the system is publicly available.…”
Section: Existing Workmentioning
confidence: 99%
“…delivers an additional 2× speedup by enabling two-level caching to ForeGraph. Nevertheless, the two works above adopt the interval-shard based partitioning method and bufers both source and destination vertices, resulting in a signiicant data replication factor and heavy preprocessing overhead [3]. Moreover, these two works are simulations that are not publicly available.…”
Section: Hbm On Fpgasmentioning
confidence: 99%
See 1 more Smart Citation
“…As a result, efficient graph processing is becoming increasingly important, especially as the amount of graph data grows [35]. Allowing efficient customization on the hardware logic to computation/memory access patterns, FPGA usually delivers better memory efficiency and energy efficiency than CPUs/G-PUs [1], [3], [4], [9], [29], [36], [50]- [52]. Furthermore, highlevel synthesis (HLS) that translates kernels written in highlevel languages to low-level RTL modules alleviates the poor programmability issue of FPGAs, providing high usability to efficient graph processing systems [4], [14], [28].…”
Section: Introductionmentioning
confidence: 99%
“…Graph processing explores the irregular structure of a graph rather than performing large numbers of computations, resulting in poor data locality and high communication to computation ratio [25], [53]. Memory 1 ReGraph is open-sourced at https://anonymous.4open.science/r/ReGraph/. bandwidth is therefore the major bottleneck to the system performance.…”
Section: Introductionmentioning
confidence: 99%