2024
DOI: 10.1109/access.2024.3369889
|View full text |Cite
|
Sign up to set email alerts
|

CANET: Quantized Neural Network Inference With 8-bit Carry-Aware Accumulator

Jingxuan Yang,
Xiaoqin Wang,
Yiying Jiang

Abstract: Neural network quantization represents weights and activations with few bits, greatly reducing the overhead of multiplications. However, due to the recursive accumulation operations, high-precision accumulators are still required in multiply-accumulate (MAC) units to avoid overflow, incurring significant computational overhead. This constraint limits the efficient deployment of quantized NNs on resourceconstrained platforms. To address this problem, we present a novel framework named CANET, which adapts the 8-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 26 publications
(37 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?