2019 Symposium on VLSI Circuits 2019
DOI: 10.23919/vlsic.2019.8778056
|View full text |Cite
|
Sign up to set email alerts
|

A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator with Ground-Reference Signaling in 16nm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 41 publications
(21 citation statements)
references
References 1 publication
0
21
0
Order By: Relevance
“…A common peculiarity in all of these architectures is that the power consumption required to support FP operations is too demanding for their adoption in IoT end nodes or as ultra-low-power architectures. Recent trend is to equip IoT platforms e.g., microcontroller units like M4 and M7 [2] with FP unit, this is because with the scaling of technology below 40nm, the cost of an FP operation is getting near 1pJ/op [19], [25], so it has become affordable in terms of absolute power to use FP in IoT.…”
Section: Related Workmentioning
confidence: 99%
“…A common peculiarity in all of these architectures is that the power consumption required to support FP operations is too demanding for their adoption in IoT end nodes or as ultra-low-power architectures. Recent trend is to equip IoT platforms e.g., microcontroller units like M4 and M7 [2] with FP unit, this is because with the scaling of technology below 40nm, the cost of an FP operation is getting near 1pJ/op [19], [25], so it has become affordable in terms of absolute power to use FP in IoT.…”
Section: Related Workmentioning
confidence: 99%
“…Core Placement Optimization for Many-Core Systems with RL 11:5 In Figure 3, we take Tianjic as an example to illustrate the typical multi-chip many-core architecture. Usually, multiple chips (e.g., 4 × 4 in Figure 3(a)) can be interconnected through offchip links such as low-voltage differential signaling (LVDS) [9], SerDes [7] and ground-referenced signaling (GRS) [89,96]. As illustrated in Figure 3(b), each chip includes an array of functional cores arranged by a 2D mesh network-on-chip (NoC), an on-chip router for off-chip communication and essential chip peripherals.…”
Section: Multi-chip Many-core Architecturesmentioning
confidence: 99%
“…We build an in-house simulator for the typical multi-chip many-core architecture illustrated in Figure 3. The overall system consists of a 4 × 4 chip array with 16 × 16 cores per chip, with the off-chip interconnect assumed as GRS [89,96]. Generally, the routing is based on the minimal path, with X-Y routing for both NoC and off-chip communication.…”
Section: Experiments Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…The error backpropagation method is known as an efficient method for calculating gradients in the field of deep neural network machine learning for updating parameters using the gradient descent method [24]. Further speed improvement can be easily realized through using the GPGPU technique, which is again well established and under significant development in the field of deep learning [25].…”
Section: Introductionmentioning
confidence: 99%