Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays 2018
DOI: 10.1145/3174243.3174257
|View full text |Cite
|
Sign up to set email alerts
|

Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(45 citation statements)
references
References 18 publications
0
44
0
1
Order By: Relevance
“…As shown in [17], the Winograd algorithm reduces the computation complexity significantly in CNNs. However, there are still some limitations in terms of flexibility.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…As shown in [17], the Winograd algorithm reduces the computation complexity significantly in CNNs. However, there are still some limitations in terms of flexibility.…”
Section: Resultsmentioning
confidence: 99%
“…The additions are executed with LUTs instead of DSP slices. That is why the implementation in [17] achieves the best performance density among all the listed implementations. In terms of throughput, our accelerator achieves state-of-the-art performance on both VGG and C3D.…”
Section: Methodsmentioning
confidence: 95%
See 1 more Smart Citation
“…Many existing frameworks [9], [25], [31], [23], [33], [39], that map CNN models to FPGAs generate a large homogeneous processing core that is temporally shared among layers. This common design is flexible, as by sequentially carrying out convolutions, it is less constrained by the amount of resources available on FPGAs.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the recent popularity of 2D CNNs in image recognition and related tasks, there have been a plethora of works proposing new architectures to accelerate 2D CNNs [8], [10], [11], [49]. There have been no works that accelerate 3D CNNs in ASICs, although we note several recent works [50], [51] which have explored hardware acceleration of 3D CNNs on FPGAs. On one hand, FPGAs are reconfigurable and thus can directly adapt to different CNN configurations.…”
Section: F Hardware Implementationmentioning
confidence: 99%