Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

He, Yang; Kang, Guoliang; Dong, Xuanyi; Fu, Yanwei; Yang, Yi

doi:10.24963/ijcai.2018/309

Cited by 728 publications

(550 citation statements)

References 28 publications

Supporting

Mentioning

503

Contrasting

Unclassified

Order By: Relevance

“…We compare our results to those reported by Li's method [26] [37] and by Soft Filter [17] for ResNet in Table 2 and those reported by Zhang's method [18] for ResNetXt in Table 4. As shown, ResNet models trained using the proposed method achieve up to 0.31% accuracy improvement with a pruning ratio up to 65.7%-79.5% when compared to the baseline.…”

Section: Comparison To Existing Approachesmentioning

confidence: 84%

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks

Gao

Liu

Chien

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

View full text Add to dashboard Cite

Improving weight sparsity is a common strategy for producing light-weight deep neural networks. However, pruning models with residual learning is more challenging. In this paper, we introduce Variance-Aware Cross-Layer (VACL), a novel approach to address this problem. VACL consists of two parts, a Cross-Layer grouping and a Variance Aware regularization. In Cross-Layer grouping the i th filters of layers connected by skip-connections are grouped into one regularization group. Then, the Variance-Aware regularization term takes into account both the first and second-order statistics of the connected layers to constrain the variance within a group. Our approach can effectively improve the structural sparsity of residual models. For CIFAR10, the proposed method reduces a ResNet model by up to 79.5% with no accuracy drop, and reduces a ResNeXt model by up to 82% with < 1% accuracy drop. For ImageNet, it yields a pruned ratio of up to 63.3% with < 1% top-5 accuracy drop. Our experimental results show that the proposed approach significantly outperforms other state-of-the-art methods in terms of overall model size and accuracy.

show abstract

Section: Comparison To Existing Approachesmentioning

confidence: 84%

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks

Gao

Liu

Chien

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

View full text Add to dashboard Cite

show abstract

“…FLOPs↓ Top-1 Acc↓ Top-5 Acc↓ LcP [3] 25.00% -0.09% -0.19% NISP [36] 27.31% 0.21% -SSS [18] 31.08% 1.94% 0.95% ThiNet [26] 36.79% 0.84% 0.47% OICSR-GL 37.30% -0.22% -0.16% He et al [11] 41.80% 1.54% 0.81% GDP [23] 42.00% 2.52% 1.25% LcP [3] 42.00% 0.85% 0.26% NISP [36] 44.41% 0.89% -OICSR-GL 44.43% 0.01% 0.08% He et al [13] 50.00% -1.40% LcP [3] 50.00% 0.96% 0.42% OICSR-GL 50.00% 0.37% 0.34% (c) ResNet-50 on ImageNet-1K Table 2: Comparison with existing methods. FLOPs↓ and Params↓ denote the reduction of FLOPs and parameters.…”

Section: Methodsmentioning

confidence: 99%

“…Chin et al [3] considered channel pruning as a global ranking problem and compensated the layerwise approximation error that improved the performance for various heuristic metrics. To reduce accuracy loss caused by incorrect channel pruning, redundant channels were pruned in a dynamic way in [11,23]. Furthermore, Huang et al [17] and Huang & Wang [18] trained pruning agents and removed redundant structure in a data-driven way.…”

Section: Related Workmentioning

confidence: 99%

OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

Wang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

View full text Add to dashboard Cite

Channel pruning can significantly accelerate and compress deep neural networks. Many channel pruning works utilize structured sparsity regularization to zero out all the weights in some channels and automatically obtain structure-sparse network in training stage. However, these methods apply structured sparsity regularization on each layer separately where the correlations between consecutive layers are omitted. In this paper, we first combine one out-channel in current layer and the corresponding inchannel in next layer as a regularization group, namely outin-channel. Our proposed Out-In-Channel Sparsity Regularization (OICSR) considers correlations between successive layers to further retain predictive power of the compact network. Training with OICSR thoroughly transfers discriminative features into a fraction of out-in-channels. Correspondingly, OICSR measures channel importance based on statistics computed from two consecutive layers, not individual layer. Finally, a global greedy pruning algorithm is designed to remove redundant out-in-channels in an iterative way. Our method is comprehensively evaluated with various CNN architectures including CifarNet, AlexNet, ResNet, DenseNet and PreActSeNet on CIFAR-10, CIFAR-100 and ImageNet-1K datasets. Notably, on ImageNet-1K, we reduce 37.2% FLOPs on ResNet-50 while outperforming the original model by 0.22% top-1 accuracy.

show abstract

“…Liu et al impose channel sparsity by imposing 1 regularization on the scaling factors in batch normalization. In [11], He et al propose a soft filter pruning method which allows the pruned filters to be updated during the training procedure.…”

Section: Related Workmentioning

confidence: 99%

Progressive Learning of Low-Precision Networks for Image Classification

Zhou

Lv³

et al. 2021

IEEE Trans. Multimedia

View full text Add to dashboard Cite

Recent years have witnessed the great advance of deep learning in a variety of vision tasks. Many state-of-theart deep neural networks suffer from large size and high complexity, which makes it difficult to deploy in resourcelimited platforms such as mobile devices. To this end, lowprecision neural networks are widely studied which quantize weights or activations into the low-bit format. Though being efficient, low-precision networks are usually hard to train and encounter severe accuracy degradation. In this paper, we propose a new training strategy through expanding low-precision networks during training and removing the expanded parts for network inference. First, we equip each low-precision convolutional layer with an ancillary full-precision convolutional layer based on a low-precision network structure, which could guide the network to good local minima. Second, a decay method is introduced to reduce the output of the added full-precision convolution gradually, which keeps the resulted topology structure the same to the original low-precision one. Experiments on SVHN, CIFAR and ILSVRC-2012 datasets prove that the proposed method can bring faster convergence and higher accuracy for low-precision neural networks.

show abstract

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

Cited by 728 publications

References 28 publications

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks

OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

Progressive Learning of Low-Precision Networks for Image Classification

Contact Info

Product

Resources

About