Proceedings of the 37th Annual International Symposium on Computer Architecture 2010
DOI: 10.1145/1815961.1815993
|View full text |Cite
|
Sign up to set email alerts
|

A dynamically configurable coprocessor for convolutional neural networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
118
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 251 publications
(118 citation statements)
references
References 22 publications
0
118
0
Order By: Relevance
“…Other recent works propose different CNN acceleration hardware. For example, [3,[10][11][12]22] focus on 2D-convolvers, which play the roles of both compute modules and data caches. Meanwhile, [18,19] use FMA units for computation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Other recent works propose different CNN acceleration hardware. For example, [3,[10][11][12]22] focus on 2D-convolvers, which play the roles of both compute modules and data caches. Meanwhile, [18,19] use FMA units for computation.…”
Section: Related Workmentioning
confidence: 99%
“…Several key similarities cause these methods to suffer from the underutilization problem we observe in our Single-CLP design. For example, the 2D-convolvers used in [3,10,12,22] must be provisioned for the largest filter across layers; they will necessarily be underutilized when computing layers with smaller filters. In [19], the organization of the compute modules depends on the number of output feature maps and their number of rows.…”
Section: Related Workmentioning
confidence: 99%
“…Chakradhar et al proposed mapping a CNN into a coprocessor with the dynamic reconfiguration techniques [14].…”
Section: Related Workmentioning
confidence: 99%
“…[1,2,3,4] Due to the specific computation pattern of CNN, general purpose processors hardly meet the implementation requirement, which encourages the proposal of various hardware implementations based on FPGA, GPU and ASIC [5,6,7]. CNN contains numerous 2D convolutions, which are responsible for more than 90% of the whole computation [8].…”
Section: Introductionmentioning
confidence: 99%