Wu‐Shiung Feng scite author profile

Accelerating deep learning networks in edge computing based on power-efficient and highly parallel FPGA platforms is an important goal. Combined with deep learning theory, an accelerator design method based on the Winograd algorithm for the deep learning object detection model YOLO under the PYNQ architecture is proposed. A Zynq FPGA is used to build the hardware acceleration platform of a YOLO network. The Winograd algorithm is used to improve traditional convolution. In the FPGA, the numerous multiplication operations in the YOLO network are converted into addition operations, reducing the computational complexity of the model. The data of the original model are processed at a low fixed point, reducing the resource consumption of the FPGA. To optimize memory, a buffer pipeline method is proposed, which further improves the efficiency of the designed accelerator. Experiments show that compared with the acceleration of the YOLO model based on GPUs and other FPGA platforms, the proposed method not only optimizes FPGA resource usage but also reduces power consumption to 2.7 W. Additionally, the detection accuracy loss is less than 3%. INDEX TERMS FPGA, deep learning, Winograd, YOLO, buffer pipeline.

show abstract

Multilayer Traffic Network Optimized by Multiobjective Genetic Clustering Algorithm

Feng

Gen

2009

IEICE Trans. Fundamentals

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wu‐Shiung Feng

An adaptive-order rational Arnoldi method for model-order reductions of linear time-invariant systems

New efficient designs for XOR and XNOR functions on the transistor level

A hybrid temporal association rules mining method for traffic congestion prediction

A Power-Efficient Optimizing Framework FPGA Accelerator Based on Winograd for YOLO

Multilayer Traffic Network Optimized by Multiobjective Genetic Clustering Algorithm

Contact Info

Product

Resources

About