2017 27th International Conference on Field Programmable Logic and Applications (FPL) 2017
DOI: 10.23919/fpl.2017.8056850
|View full text |Cite
|
Sign up to set email alerts
|

Scalable high-performance architecture for convolutional ternary neural networks on FPGA

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
53
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 61 publications
(54 citation statements)
references
References 16 publications
1
53
0
Order By: Relevance
“…For example, Qiu et al [22] proposed a CNN accelerator supporting 8 and 4-bit data, implemented on a Xilinx Zynq platform. On this trail, even extreme quantization approaches have been presented, exploiting ternary or binary networks [23], [24]. While most DSP-capable FPGAs currently do not offer a low enough power envelope to be used in IoT end-nodes, Lattice recently announced SenseAI class of FPGAs [25] providing a comprehensive hardware and software solutions for always-on artificial intelligence (AI) within a power budget between 1 mW and 1 W. However these ultra-low power FPGAs are currently too expensive for many applications where MCUs are traditionally chosen because of their low cost.…”
Section: Related Workmentioning
confidence: 99%
“…For example, Qiu et al [22] proposed a CNN accelerator supporting 8 and 4-bit data, implemented on a Xilinx Zynq platform. On this trail, even extreme quantization approaches have been presented, exploiting ternary or binary networks [23], [24]. While most DSP-capable FPGAs currently do not offer a low enough power envelope to be used in IoT end-nodes, Lattice recently announced SenseAI class of FPGAs [25] providing a comprehensive hardware and software solutions for always-on artificial intelligence (AI) within a power budget between 1 mW and 1 W. However these ultra-low power FPGAs are currently too expensive for many applications where MCUs are traditionally chosen because of their low cost.…”
Section: Related Workmentioning
confidence: 99%
“…The method proposed by Prost-Boucle et al [Prost-Boucle et al 2017] achieves the previously best reported low precision throughput on an FPGA for CIFAR10. This work implements a VGG-7 style network with ternary weights and activations.…”
Section: Comparison With Previous Workmentioning
confidence: 90%
“…Prost-Boucle et al [Prost-Boucle et al 2017]. This paper adopts the scheme used by these authors to buffer the pixels, so the entire set of inputs is available simultaneously.…”
Section: Bufferingmentioning
confidence: 99%
“…Prost-Boucle et al implemented ternary CNNs on a Xilinx Virtex-7 VC709 FPGA, presenting both high-performance-and low-power-targe ing designs [110]. eir experiments with the CNV model classifying CIFAR-10 demonstrated a 6.6 pp accuracy improvement compared to FINN's binarised inference.…”
Section: Binarisation and Ternarisationmentioning
confidence: 99%