NISP: Pruning Networks Using Neuron Importance Score Propagation

Rong, Yu; Li, Ang; Chen, Chun-Fu; Lai, Jui-Hsin; Morariu, Vlad I.; Han, Xintong; Gao, Mingfei; Lin, Ching‐Yung; Davis, Larry S.

doi:10.1109/cvpr.2018.00958

Cited by 669 publications

(443 citation statements)

References 32 publications

Supporting

Mentioning

410

Contrasting

Order By: Relevance

“…Luo et al [26] propose to use a greedy per-layer procedure to find the subset of neurons that minimize a re-construction loss, at a significant computational cost. Yu et al [34] estimate the importance of input features to a linear classifier and propagate their importance assuming Lipschitz continuity, requiring additional computational costs and nontrivial implementation of the feature score computation. Our proposed method is able to outperform these methods while requiring little additional computation and engineering.…”

Section: Related Workmentioning

confidence: 99%

“…Pruning is a common method to derive a compact network -after training, some structural portion of the parameters is removed, along with its associated computations. A variety of pruning methods have been proposed, based on greedy algorithms [26,34], sparse regularization [20,22,33], and reinforcement learning [12]. Many of them rely on the belief that the magnitude of a weight and its importance are strongly correlated.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Importance Estimation for Neural Network Pruning

Molchanov

Mallya

Tyree

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

657

430

View full text Add to dashboard Cite

Structural pruning of neural network parameters reduces computation, energy, and memory transfer costs during inference. We propose a novel method that estimates the contribution of a neuron (filter) to the final loss and iteratively removes those with smaller scores. We describe two variations of our method using the first and secondorder Taylor expansions to approximate a filter's contribution. Both methods scale consistently across any network layer without requiring per-layer sensitivity analysis and can be applied to any kind of layer, including skip connections. For modern networks trained on ImageNet, we measured experimentally a high (>93%) correlation between the contribution computed by our methods and a reliable estimate of the true importance. Pruning with the proposed methods leads to an improvement over state-ofthe-art in terms of accuracy, FLOPs, and parameter reduction. On ResNet-101, we achieve a 40% FLOPS reduction by removing 30% of the parameters, with a loss of 0.02% in the top-1 accuracy on ImageNet. Code is available at https://github.com/NVlabs/Taylor_pruning.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Importance Estimation for Neural Network Pruning

Molchanov

Mallya

Tyree

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

657

430

View full text Add to dashboard Cite

show abstract

“…This is completely different from our method which aims to get the complete pruned network in one step. While some methods [23,37] also measure significance of all filters in onestep via back propagation, their estimation is biased since these approaches only depend on a mini-batch of samples.…”

Section: Discussionmentioning

confidence: 99%

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

Wang

Bai

Zhou

et al. 2019

2019 IEEE 31st International Conference on Tools With Artificial Intelligence (ICTAI)

View full text Add to dashboard Cite

Acceleration of convolutional neural network has received increasing attention during the past several years. Among various acceleration techniques, filter pruning has its inherent merit by effectively reducing the number of convolution filters. However, most filter pruning methods resort to tedious and time-consuming layer-by-layer pruningrecovery strategy to avoid a significant drop of accuracy. In this paper, we present an efficient filter pruning framework to solve this problem. Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods. Furthermore, our method allows network compression with global filter pruning. Given a global pruning rate, it can adaptively determine the pruning rate for each single convolutional layer, while these rates are often set as hyper-parameters in previous approaches. Evaluated on VGG-16 and ResNet-50 using ImageNet, our approach outperforms several state-of-the-art methods with less accuracy drop under the same and even much fewer floatingpoint operations (FLOPs).

show abstract

“…Channel pruning was regarded as an optimization problem by Luo et al [26] and redundant channels were pruned by statistics of its next layer. Yu et al [36] conducted feature ranking to obtain neuron/channel importance score and propagated it throughout the network. The neurons/channels with smaller importance scores were removed with negligible accuracy loss.…”

Section: Related Workmentioning

confidence: 99%

OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

Wang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

View full text Add to dashboard Cite

Channel pruning can significantly accelerate and compress deep neural networks. Many channel pruning works utilize structured sparsity regularization to zero out all the weights in some channels and automatically obtain structure-sparse network in training stage. However, these methods apply structured sparsity regularization on each layer separately where the correlations between consecutive layers are omitted. In this paper, we first combine one out-channel in current layer and the corresponding inchannel in next layer as a regularization group, namely outin-channel. Our proposed Out-In-Channel Sparsity Regularization (OICSR) considers correlations between successive layers to further retain predictive power of the compact network. Training with OICSR thoroughly transfers discriminative features into a fraction of out-in-channels. Correspondingly, OICSR measures channel importance based on statistics computed from two consecutive layers, not individual layer. Finally, a global greedy pruning algorithm is designed to remove redundant out-in-channels in an iterative way. Our method is comprehensively evaluated with various CNN architectures including CifarNet, AlexNet, ResNet, DenseNet and PreActSeNet on CIFAR-10, CIFAR-100 and ImageNet-1K datasets. Notably, on ImageNet-1K, we reduce 37.2% FLOPs on ResNet-50 while outperforming the original model by 0.22% top-1 accuracy.

show abstract

NISP: Pruning Networks Using Neuron Importance Score Propagation

Cited by 669 publications

References 32 publications

Importance Estimation for Neural Network Pruning

Importance Estimation for Neural Network Pruning

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

Contact Info

Product

Resources

About