A White Paper on Neural Network Quantization

Nagel, Markus; Fournarakis, Marios; Amjad, Rana Ali; Yelysei, Bondarenko,; Baalen, Mart van; Blankevoort, Tijmen

doi:10.48550/arxiv.2106.08295

Cited by 64 publications

(112 citation statements)

References 12 publications

(27 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When v is the activation output, the second term can be pre-computed and absorbed by the convolution bias, so it will not introduce extra calculation to inference. Therefore, it is recommended by Nagel et al (2021) to apply asymmetric quantization on activation and symmetric quantization on weights. Nagel et al (2021) also suggests to adopt a per-channel quantization on weights (Krishnamoorthi, 2018; Li et al, 2019).…”

Section: Preliminarymentioning

confidence: 99%

“…Therefore, it is recommended by Nagel et al (2021) to apply asymmetric quantization on activation and symmetric quantization on weights. Nagel et al (2021) also suggests to adopt a per-channel quantization on weights (Krishnamoorthi, 2018; Li et al, 2019).…”

Section: Preliminarymentioning

confidence: 99%

“…This benefits the CDF indexing and calculation which we will discuss later in Section 4.1. As recommended in Nagel et al (2021), we adopt a grid search minimizing the reconstruction error to obtain the weight quantization step s W of each layer. We figure out the activation quantization step s u with Min-Max method on a little calibration data.…”

Section: Ptq Baselinementioning

confidence: 99%

“…When adopting our approach or on model, the RD performance marginally deteriorates. However, adopts a per-channel activation quantization which is unfriendly to hardware implementation and rarely supported(Nagel et al, 2021). When adopting with per-tensor activation quantization, the performance, especially at higher bit-rates, gets hurt significantly.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Post-Training Quantization for Cross-Platform Learned Image Compression

He¹,

Yang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications. One of the most critical issues that need to be considered is the non-deterministic calculation, which makes the probability prediction cross-platform inconsistent and frustrates successful decoding. We propose to solve this problem by introducing well-developed post-training quantization and making the model inference integer-arithmetic-only, which is much simpler than presently existing training and fine-tuning based approaches yet still keeps the superior rate-distortion performance of learned image compression. Based on that, we further improve the discretization of the entropy parameters and extend the deterministic inference to fit Gaussian mixture models. With our proposed methods, the current state-ofthe-art image compression models can infer in a cross-platform consistent manner, which makes the further development and practice of learned image compression more promising. * This work is done when Ziming Yang and Yuan Chen are interns at SenseTime Research.

show abstract

Section: Preliminarymentioning

confidence: 99%

Section: Preliminarymentioning

confidence: 99%

Section: Ptq Baselinementioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Post-Training Quantization for Cross-Platform Learned Image Compression

He¹,

Yang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Real-time inference on resource-constrained and efficiency-demanding platforms has long been desired and extensively studied in the last decades, resulting in significant improvement on the trade-off between efficiency and accuracy (Han et al, 2015;Mei et al, 2019;Tanaka et al, 2020;Ma et al, 2020;Mishra et al, 2020;Liang et al, 2021;Liu et al, 2021). As a model compression technique, quantization is promising compared to other methods, such as network pruning (Tanaka et al, 2020;Ma et al, 2020; and slimming (Liu et al, 2017;2018), as it achieves a large compression ratio (Krishnamoorthi, 2018;Nagel et al, 2021) and is computationally beneficial for integer-only hardware. The latter one is especially important because many hardwares (e.g., most brands of DSPs (Ho, 2015;QCOM, 2019)) only support integer or fixed-point arithmetic for accelerated implementation and cannot deploy models with floating-point operations.…”

Section: Introductionmentioning

confidence: 99%

F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Jin¹,

Ren²,

Zhuang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Neural network quantization is a promising compression technique to reduce memory footprint and save energy consumption, potentially leading to real-time inference. However, there is a performance gap between quantized and fullprecision models. To reduce it, existing quantization approaches require highprecision INT32 or full-precision multiplication during inference for scaling or dequantization. This introduces a noticeable cost in terms of memory, speed, and required energy. To tackle these issues, we present F8Net, a novel quantization framework consisting of only fixed-point 8-bit multiplication. To derive our method, we first discuss the advantages of fixed-point multiplication with different formats of fixed-point numbers and study the statistical behavior of the associated fixedpoint numbers. Second, based on the statistical and algorithmic analysis, we apply different fixed-point formats for weights and activations of different layers. We introduce a novel algorithm to automatically determine the right format for each layer during training. Third, we analyze a previous quantization algorithmparameterized clipping activation (PACT)-and reformulate it using fixed-point arithmetic. Finally, we unify the recently proposed method for quantization finetuning and our fixed-point approach to show the potential of our method. We verify F8Net on ImageNet for MobileNet V1/V2 and ResNet18/50. Our approach achieves comparable and better performance, when compared not only to existing quantization techniques with INT32 multiplication or floating-point arithmetic, but also to the full-precision counterparts, achieving state-of-the-art performance.

show abstract

Binary‐Stochasticity‐Enabled Highly Efficient Neuromorphic Deep Learning Achieves Better‐than‐Software Accuracy

Li,

Wang,

Wang

et al. 2023

Advanced Intelligent Systems

View full text Add to dashboard Cite

In this work, the requirement of using high‐precision (HP) signals is lifted and the circuits for implementing deep learning algorithms in memristor‐based hardware are simplified. The use of HP signals is required by the backpropagation learning algorithm since the gradient descent learning rule relies on the chain product of partial derivatives. However, it is both challenging and biologically implausible to implement such an HP algorithm in noisy and analog memristor‐based hardware systems. Herein, it is demonstrated that the requirement for HP signals handling is not necessary and more efficient deep learning can be achieved when using a binary stochastic learning algorithm. The new algorithm proposed in this work modifies elementary neural network operations, which improves energy efficiency by two orders of magnitude compared to traditional memristor‐based hardware and three orders of magnitude compared to complementary metal–oxide–semiconductor‐based hardware. It also provides better accuracy in pattern recognition tasks than the HP learning algorithm benchmarks.

show abstract

A White Paper on Neural Network Quantization

Cited by 64 publications

References 12 publications

Post-Training Quantization for Cross-Platform Learned Image Compression

Post-Training Quantization for Cross-Platform Learned Image Compression

F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Binary‐Stochasticity‐Enabled Highly Efficient Neuromorphic Deep Learning Achieves Better‐than‐Software Accuracy

Contact Info

Product

Resources

About