SlimNets: An Exploration of Deep Model Compression and Acceleration

Oguntola, Ini; Olubeko, Subby; Sweeney, Christopher J.

doi:10.1109/hpec.2018.8547604

Cited by 10 publications

(9 citation statements)

References 3 publications

(3 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Representative work include the SqueezeNet [243] and the MobileNet [240]. Another method is compressing an existing network to decrease the number of parameters and the required computation resource, under the guarantee of reconstruction accuracy [244,245,246,247,248]. For example, the model cutting method compresses the model by cutting unimportant connections of a trained model according to some effective evaluations [249].…”

Section: Designing Light and Efficient Architecturesmentioning

confidence: 99%

Deep Learning Methods for Solving Linear Inverse Problems: Research Directions and Paradigms

Bai,

Chen,

Chen

et al. 2020

Preprint

View full text Add to dashboard Cite

The linear inverse problem is fundamental to the development of various scientific areas. Innumerable attempts have been carried out to solve different variants of the linear inverse problem in different applications. Nowadays, the rapid development of deep learning provides a fresh perspective for solving the linear inverse problem, which has various well-designed network architectures results in state-of-the-art performance in many applications. In this paper, we present a comprehensive survey of the recent progress in the development of deep learning for solving various linear inverse problems. We review how deep learning methods are used in solving different linear inverse problems, and explore the structured neural network architectures that incorporate knowledge used in traditional methods. Furthermore, we identify open challenges and potential future directions along this research line.

show abstract

Section: Designing Light and Efficient Architecturesmentioning

confidence: 99%

Deep Learning Methods for Solving Linear Inverse Problems: Research Directions and Paradigms

Bai,

Chen,

Chen

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Teacher-Student training paradigm: Knowledge Distillation (KD) developed by Hinton et al (2015) is a popular technique to compress deep and wide networks into sparser ones, where the compressed model mimics the distribution learned by the complex model. Oguntola et al (2018) show that compression techniques such as pruning, and low-rank decomposition can be combined with KD to significantly improve compression rate, while maintaining accuracy. KD usually optimizes a weighted average of two different objective functions.…”

Section: Related Workmentioning

confidence: 99%

“…Pruning+KD s combines pruning with knowledge distillation using KL-divergence as described in SlimNets (Oguntola et al, 2018). Pruning+KD o applies pruning together with our version of knowledge distillation (KD o ) that combines KL-divergence with MSE-loss.…”

Section: Techniques In Comparisonmentioning

confidence: 99%

AntMan: Sparse Low-Rank Compression to Accelerate RNN inference

Rajbhandari¹,

Shrivastava²,

He³

2019

Preprint

View full text Add to dashboard Cite

Wide adoption of complex RNN based models is hindered by their inference performance, cost and memory requirements. To address this issue, we develop AntMan, combining structured sparsity with low-rank decomposition synergistically, to reduce model computation, size and execution time of RNNs while attaining desired accuracy. AntMan extends knowledge distillation based training to learn the compressed models efficiently. Our evaluation shows that AntMan offers up to 100x computation reduction with less than 1pt accuracy drop for language and machine reading comprehension models. Our evaluation also shows that for a given accuracy target, AntMan produces 5x smaller models than the state-of-art. Lastly, we show that AntMan offers super-linear speed gains compared to theoretical speedup, demonstrating its practical value on commodity hardware.

show abstract

“…The success of deep convolutional neural networks (CNNs) has been well demonstrated in several real-world applications, e.g. , image classification [20,29], object detection [35], semantic segmentation [28], and low-level computer vision [37]. Massive parameters and huge computational complexity are usually required for achieving the desired high performance, which limits the application of these models to portable devices such as mobile phones and smart cameras.…”

Section: Introductionmentioning

confidence: 99%

Learning Frequency Domain Approximation for Binary Neural Networks

Xu¹,

Han²,

Xu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Binary neural networks (BNNs) represent original full-precision weights and activations into 1-bit with sign function. Since the gradient of the conventional sign function is almost zero everywhere which cannot be used for back-propagation, several attempts have been proposed to alleviate the optimization difficulty by using approximate gradient. However, those approximations corrupt the main direction of de facto gradient. To this end, we propose to estimate the gradient of sign function in the Fourier frequency domain using the combination of sine functions for training BNNs, namely frequency domain approximation (FDA). The proposed approach does not affect the low-frequency information of the original sign function which occupies most of the overall energy, and high-frequency coefficients will be ignored to avoid the huge computational overhead. In addition, we embed a noise adaptation module into the training phase to compensate the approximation error. The experiments on several benchmark datasets and neural architectures illustrate that the binary network learned using our method achieves the state-of-the-art accuracy.

show abstract

SlimNets: An Exploration of Deep Model Compression and Acceleration

Cited by 10 publications

References 3 publications

Deep Learning Methods for Solving Linear Inverse Problems: Research Directions and Paradigms

Deep Learning Methods for Solving Linear Inverse Problems: Research Directions and Paradigms

AntMan: Sparse Low-Rank Compression to Accelerate RNN inference

Learning Frequency Domain Approximation for Binary Neural Networks

Contact Info

Product

Resources

About