Post-training Quantization for Neural Networks with Provable Guarantees

Zhang, Jinjie; Zhou, Yong‐Gui; Saab, Rayan

doi:10.48550/arxiv.2201.11113

Cited by 2 publications

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Notably, the resulting compression of sparse models is significantly higher with relatively small degradation in accuracy. (Choukroun et al, 2019) 73.39 76.01 2.62 AdaRound (Nagel et al, 2020) 75.23 76.07 0.84 S-AdaQuant (Hubara et al, 2021) 75.10 77.20 2.10 BRECQ (Li et al, 2021) 76.29 77.00 0.71 GPFQ (Zhang et al, 2022) 74…”

Section: Resultsmentioning

confidence: 99%

Rotation Invariant Quantization for Model Compression

Kampeas¹,

Nahshan²,

Kremer³

et al. 2023

Preprint

View full text Add to dashboard Cite

Post-training Neural Network (NN) model compression is an attractive approach for deploying large, memory-consuming models on devices with limited memory resources. In this study, we investigate the rate-distortion tradeoff for NN model compression. First, we suggest a Rotation-Invariant Quantization (RIQ) technique that utilizes a single parameter to quantize the entire NN model, yielding a different rate at each layer, i.e., mixed-precision quantization. Then, we prove that our rotation-invariant approach is optimal in terms of compression. We rigorously evaluate RIQ and demonstrate its capabilities on various models and tasks. For example, RIQ facilitates ×19.4 and ×52.9 compression ratios on pretrained VGG dense and pruned models, respectively, with < 0.4% accuracy degradation. Code: https://github.com/ehaleva/RIQ.

show abstract

Section: Resultsmentioning

confidence: 99%

Rotation Invariant Quantization for Model Compression

Kampeas¹,

Nahshan²,

Kremer³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…The second approach is to perform over-sampling using the artificial images generated by the proposed GAN. Finally, each trial is subjected to post-training dynamic range quantization which converts weights to 8-bit precision to compress model size and decrease the inference time to fit a real-time system on edge devices [50]. Table 4 shows the results of the proposed classifiers trials with respect to the size of the model file in megabytes (MB), inference time in milliseconds (ms), AUC, the precision of normal (Norm) and malignant (Mal), recall of Norm and Mal, F1-score of Norm and Mal, and accuracy.…”

Section: Fig 6: Comparison Between U-net Training Loss and Validation...mentioning

confidence: 99%

MSDSC: A Multistage Deep Learning-based Skin Cancer Classifier

Hassan¹,

Mahar²,

Fouad³

2023

Preprint

View full text Add to dashboard Cite

One of the most frequent types of cancer in the globe is skin cancer which is a complicated public health issue. A sample of tissue from the skin lesion confirms the diagnosis of skin cancer. However, before making a definitive diagnosis, professionals detect specific signs that can be considered as early diagnosis. An early skin cancer diagnosis is prone to mistakes due to specialists’ lack of knowledge and similarities with other illnesses. This study presents a multistage deep learning-based skin cancer classifier (MSDSC) to give early detection of melanoma and non-melanoma skin lesions. The first stage is the pre-processing of samples to remove the hair surrounding the skin lesion. Since the dataset is unbalanced, the second stage was creating synthetic images using the Generative Adversarial Networks (GAN) model. This stage is followed by an attention-based U-Net model that gener- ates masks for regions of interest to eliminate the background. Lastly, two different types of EfficientNet are trained to classify skin lesions using segmented images. The proposed framework was tested using the International Skin Imaging Collaboration dataset (ISIC) and the results showed that it outperformed the state-of-the-art studies achieving an accuracy of 0.96, F1-score of 0.91, recall of 0.95, and precision of 0.88.

show abstract

Post-training Quantization for Neural Networks with Provable Guarantees

Cited by 2 publications

References 24 publications

Rotation Invariant Quantization for Model Compression

Rotation Invariant Quantization for Model Compression

MSDSC: A Multistage Deep Learning-based Skin Cancer Classifier

Contact Info

Product

Resources

About