2019
DOI: 10.1109/tpds.2019.2910068
|View full text |Cite
|
Sign up to set email alerts
|

HitGraph: High-throughput Graph Processing Framework on FPGA

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
64
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 81 publications
(66 citation statements)
references
References 33 publications
1
64
0
1
Order By: Relevance
“…Bellman-Ford Algorithm Accelerators. HitGraph [58] and its earlier version [57] implement an edge-centric graph accelerator. Leveraging the larger sequential bandwidth, HitGraph writes the intermediate relaxation results to DRAM when generated and reads them back when needed.…”
Section: 32mentioning
confidence: 99%
See 1 more Smart Citation
“…Bellman-Ford Algorithm Accelerators. HitGraph [58] and its earlier version [57] implement an edge-centric graph accelerator. Leveraging the larger sequential bandwidth, HitGraph writes the intermediate relaxation results to DRAM when generated and reads them back when needed.…”
Section: 32mentioning
confidence: 99%
“…Another challenge to efficiently implement priority queue-based SSSP algorithms is that priority-order graph traversal prohibits many reordering techniques used in many graph accelerators [5,16,17,28,49,58], which are vitally important to reducing external memory traffic and achieving high performance. As such, many graph accelerators [5,28,58] implement the Bellman-Ford algorithm [50] that does not require a priority queue at all. However, these accelerators work best for algorithms whose amount of work is insensitive to the traversal order (e.g., SpMV and PageRank).…”
Section: Introductionmentioning
confidence: 99%
“…• A general-purpose and user-friendly SpMM accelerator. Domain specific architectures [21,22,27,45] have been designed for boosting computing performance and efficiency in many application domains such as deep learning [5, 11, 12, 23, 31, 35, 47, 64-69, 77, 87, 88], dense linear algebra [23,29,30,35,77], graph processing [4,7,17,25,26,39,48,56,70,89,91,92,95], genomic and bio analysis [8,8,9,13,14,33,38,51,76,81], and data sorting [10,52,60,63]. However, most accelerators are designed for one specific problem with fixed input and output size.…”
Section: Motivationmentioning
confidence: 99%
“…• Challenge 3 -How to design a general-purpose accelerator which does not need to be rerun the time-consuming flow of synthesis/place/route. While many accelerators have been designed for boosting computing performance and efficiency in many application domains such as deep learning [5, 11, 12, 23, 31, 35, 64-69, 77, 87, 88], dense linear algebra [23,29,30,35,77], graph processing [4,17,25,26,39,70,89,91,92,95], genomic and bio analysis [8,9,13,14,33,38,51,76,81], data sorting [10,52,60,63], most are designed for one specific problem with fixed input and output size. For FPGA accelerators even with improved tools such as [17,77], a new design will still consume many hours or even a few days due to long synthesis and place/route time.…”
Section: Introductionmentioning
confidence: 99%
“…规则应用 编程范式的核心难点是如何针对规则应用中的复杂数据流访存行为, 结合动态重构存储系统的特性扩 展现有的流式处理编程范式 [58] . 而在处理面向以图计算为代表的非规则应用时, 流式处理会带来大 范围的随机访存, 严重影响系统性能, 需要考虑采用对数据进行分块处理的方法 [59] . 非规则应用的程 范式的核心难点是如何利用数据分块减少随机访存的范围, 通过存储系统的动态重构充分复用分块.…”
Section: 可重构计算模型的提出unclassified