2020
DOI: 10.1587/transinf.2019edp7144
|View full text |Cite
|
Sign up to set email alerts
|

Daisy-Chained Systolic Array and Reconfigurable Memory Space for Narrow Memory Bandwidth

Abstract: Jun IWAMOTO †a) , Yuma KIKUTANI †b) , Renyuan ZHANG †c) , Members, and Yasuhiko NAKASHIMA †d) , Fellow SUMMARY A paradigm shift toward edge computing infrastructures that prioritize small footprint and scalable/easy-to-estimate performance is increasing. In this paper, we propose the following to improve the footprint and the scalability of systolic arrays: (1) column multithreading for reducing the number of physical units and maintaining the performance even for back-to-back floating-point accumulations; (2)… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 20 publications
(19 reference statements)
0
4
0
Order By: Relevance
“…Despite these disadvantages, multicore CPUs and GPUs are currently considered to be the most applicable hardware platforms for calculating SHA-256 in Bitcoin mining and other blockchain networks. In another approach, the systolic array-based accelerator named EMAXVR in [13] and its improved version in [14] were applied to reduce the data access time by implementing local memory near the ALU. Although this platform achieves high performance on image processing and AI learning [13], [14], its performance for computing SHA-256 is very poor [15].…”
Section: Background a Sha-256 Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…Despite these disadvantages, multicore CPUs and GPUs are currently considered to be the most applicable hardware platforms for calculating SHA-256 in Bitcoin mining and other blockchain networks. In another approach, the systolic array-based accelerator named EMAXVR in [13] and its improved version in [14] were applied to reduce the data access time by implementing local memory near the ALU. Although this platform achieves high performance on image processing and AI learning [13], [14], its performance for computing SHA-256 is very poor [15].…”
Section: Background a Sha-256 Algorithmmentioning
confidence: 99%
“…In another approach, the systolic array-based accelerator named EMAXVR in [13] and its improved version in [14] were applied to reduce the data access time by implementing local memory near the ALU. Although this platform achieves high performance on image processing and AI learning [13], [14], its performance for computing SHA-256 is very poor [15]. The key reason for the low processing rate is the data-dependent characteristic of the SHA-256 algorithm, as mentioned above.…”
Section: Background a Sha-256 Algorithmmentioning
confidence: 99%
“…We believe that with their high computational and memory resources, reprogrammable hardware design, low power consumption, and high optimization for parallel pipeline processes, FPGAs are well suited for Scrypt implementation. There are several high-performance architectures that can be applied on FPGAs to reduce the memory access time, such as the systolic-array-based accelerator called EMAXVR [41], [42] used in near-memory computing. However, despite exhibiting high performance in machine learning and image processing applications, they can achieve only poor performance when performing low-cost operator hash functions [43].…”
Section: Preliminary Idea and Motivation For The High-performance Multi Romix Scrypt Acceleratormentioning
confidence: 99%
“…FPGAs are suitable for computationally intensive algorithms that result in a faster speed and efficient energy. A few highlights of these approaches include parameter reduction, binary weight quantization, memory bandwidth optimization, and data-flow optimization [24]- [27]. A highly flexible architecture that can mold itself into the given CNNs and achieve a higher resource utilization reduction is essential.…”
Section: Introductionmentioning
confidence: 99%