2022
DOI: 10.1109/tnnls.2021.3055814
|View full text |Cite
|
Sign up to set email alerts
|

FPGA-Based High-Throughput CNN Hardware Accelerator With High Computing Resource Utilization Ratio

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 45 publications
(23 citation statements)
references
References 28 publications
0
18
0
Order By: Relevance
“…Extensive experiments demonstrate that the DL-CSNet shows clear superiority over most classical CS algorithms. At last, the DL-CSNet could be accelerated on hardware such as FPGA [19].…”
Section: Discussionmentioning
confidence: 99%
“…Extensive experiments demonstrate that the DL-CSNet shows clear superiority over most classical CS algorithms. At last, the DL-CSNet could be accelerated on hardware such as FPGA [19].…”
Section: Discussionmentioning
confidence: 99%
“…The architecture they mentioned has been implemented on Artix-7 FPGA and attained a significant improvement in speed when compared to existing architecture working at 300 MHz. Huang et al proposed a novel composite hardware CNN accelerator architecture to solve the problem of the inefficient computing resource mapping mechanism and data supply [23]. They proposed a multi-CE architecture based on a row-level pipelined streaming strategy for convolution layers and a single-CE architecture based on a batch-based computing method for full-connection layers.…”
Section: Related Workmentioning
confidence: 99%
“…The performance bottleneck of the off-chip memory is the data transfer delay, which can slow the data supply. During the operation of a CNN, frequent readings of the parameters in the memory are required, and the mismatch between the rates of data reading and calculation can cause the computational module to fail to achieve the expected efficiency and affect the system performance [ 10 ]. The huge amount of computation also leads to the challenge of deploying algorithms in smart chips with limited computational resources and I/O ports.…”
Section: Introductionmentioning
confidence: 99%