SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training

Elthakeb, Ahmed T.; Pilligundla, Prannoy; Esmaeilzadeh, Hadi

doi:10.48550/arxiv.1905.01416

Cited by 1 publication

(3 citation statements)

References 8 publications

(11 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Such an approach, however, does not lend itself well to non-binary representations and also non-linear quantization schemes. Elthakeb et al [9] alleviate the former issue by using a sinusoidal regularizer on the quantized weights.…”

Section: Related Workmentioning

confidence: 99%

“…There is no dependence on the use of STEs, which greatly improves ease of implementation. Also, compared to previous similar regularizer-based approaches [9,12,33], since in QGT the regularizer is applied on the weight values directly rather than the quantized values, there is no need to learn the scale of the quantized weights separately. Using regularizers, QGT can enforce properties such as clustering of weight values into quantized bins, which can accommodate non-linear, hardwarespecific quantizers.…”

Section: Related Workmentioning

confidence: 99%

“…Regularizers have been used in DNN training [1,13], even to target DNNs with binary-value weights [12,33]. The main novelties of our QGT approach are that, unlike these previous approaches that target the quantized weights [9], we focus on the dequantized weights, which circumvents the need to learn scale and intercept separately. Further, our approach can accommodate custom nonlinear quantizations appropriate for custom hardware.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Quantization-Guided Training for Compact TinyML Models

Ghamari,

Ozcan,

Dinh

et al. 2021

Preprint

View full text Add to dashboard Cite

We propose a Quantization Guided Training (QGT) method to guide DNN training towards optimized low-bit-precision targets and reach extreme compression levels below 8-bit precision. Unlike standard quantization-aware training (QAT) approaches, QGT uses customized regularization to encourage weight values towards a distribution that maximizes accuracy while reducing quantization errors. One of the main benefits of this approach is the ability to identify compression bottlenecks. We validate QGT using state-ofthe-art model architectures on vision datasets. We also demonstrate the effectiveness of QGT with an 81KB tiny model for person detection down to 2-bit precision (representing 17.7x size reduction), while maintaining an accuracy drop of only 3% compared to a floating-point baseline.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%