2021
DOI: 10.1109/access.2021.3126838
|View full text |Cite
|
Sign up to set email alerts
|

Power Efficient Design of High-Performance Convolutional Neural Networks Hardware Accelerator on FPGA: A Case Study With GoogLeNet

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(3 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…It seems impossible to make the real-time inference of CNN-based object detection on mobile FPGA devices which have limited hardware resources such as memory size and lower processor performance. In these power and hardware resource-limited circumstances, to improve performance and reduce power consumption, many researchers have proposed CNN accelerators at various design levels including system level, application level, architecture level, and transistor level [7], [8], [9]. Recent studies have proposed a flexible CNN accelerator design for FPGA implementation at the system level and a flexible FPGA accelerator for various CNN architectures from lightweight CNN to large-scale CNN [11], [12], [13], [14], [15], [16].…”
Section: Convolutional Neural Network(cnn)-based Object Detection App...mentioning
confidence: 99%
“…It seems impossible to make the real-time inference of CNN-based object detection on mobile FPGA devices which have limited hardware resources such as memory size and lower processor performance. In these power and hardware resource-limited circumstances, to improve performance and reduce power consumption, many researchers have proposed CNN accelerators at various design levels including system level, application level, architecture level, and transistor level [7], [8], [9]. Recent studies have proposed a flexible CNN accelerator design for FPGA implementation at the system level and a flexible FPGA accelerator for various CNN architectures from lightweight CNN to large-scale CNN [11], [12], [13], [14], [15], [16].…”
Section: Convolutional Neural Network(cnn)-based Object Detection App...mentioning
confidence: 99%
“…The proposed architecture is implemented on the Xilinx Virtex-7 FPGA platform and achieves the performance of 1.34 GOP/s. Ahmed et al [81] proposed an FPGA-based Low Power CNN (LP-CNN) accelerator based on GoogLeNet CNN. The proposed accelerator uses quantization and weight pruning techniques to reduce memory size.…”
Section: B Accelerators For a Specific Algorithmmentioning
confidence: 99%
“…They significantly reduce the hardware requirements with a minimal impact on the accuracy [8,9]. An interesting, practical use of FPGA systems to implement CNN is presented in the works [10][11][12][13]. Several algorithmic optimization strategies used in FPGA hardware for CNNs are discussed, and a few of neural network FPGA-based accelerators architectures are presented.…”
Section: Introductionmentioning
confidence: 99%