2019 IEEE International Symposium on Circuits and Systems (ISCAS) 2019
DOI: 10.1109/iscas.2019.8702458
|View full text |Cite
|
Sign up to set email alerts
|

An In-DRAM Neural Network Processing Engine

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…Since this publication, applications ranging from genome string processing to database management systems have all found successful implementations in NDP [29,59,63,69,127,128,129], which increasingly motivates the move to a NDPenabled architecture. All the potential shown by such proposals motivated the creation of commercially available NDP architecture, such as the UPMEM's [82] and the Samsungs' NDP-enabled HBM for artificial intelligence 1 .…”
Section: Related Workmentioning
confidence: 99%
“…Since this publication, applications ranging from genome string processing to database management systems have all found successful implementations in NDP [29,59,63,69,127,128,129], which increasingly motivates the move to a NDPenabled architecture. All the potential shown by such proposals motivated the creation of commercially available NDP architecture, such as the UPMEM's [82] and the Samsungs' NDP-enabled HBM for artificial intelligence 1 .…”
Section: Related Workmentioning
confidence: 99%
“…For example, Neurocube [54] executes DNNs using a 3D stacked memory. One major stream of work tries to modify the cell array structure to perform massively parallel DNNs operations within DRAM [34], [48], [65], [92], [97], SRAM [12], [13], [51], [103], and emerging memories [19], [20], [29], [42], [46], [66], [91], [95]. However, these technologies all require changing the cell structure itself, and implementing on top of the legacy technologies is complicated (e.g., DDR4/5 protocol).…”
Section: Related Workmentioning
confidence: 99%
“…Compared to the solutions above, GradPIM is a cheaper, easily realizable solution. Solutions such as [34], [48], [65], [92], [97] involve re-designing the DRAM core cell array. While these are better at performance, they are much harder to realize as a product.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, various in-DRAM architectures [25], [52] can support binary networks; however, it is still not possible to quantize into the binary level without a significant quality loss when quantizing large DNN models such as the convolution-based ResNet [10], transformer-based BERT language model [1], and NCF recommendation system [29] over large datasets. DRISA [25] adds simple Boolean logic circuits near the bit line and the local sense amp of the DRAM and runs binary CNNs, but with a prohibitive area overhead of 91% of the base LPDDR4 area.…”
Section: Related Work a Near-memory Processing Dnn Acceleratormentioning
confidence: 99%