CUP: Cluster Pruning for Compressing Deep Neural Networks

Duggal, Rahul; Xiao, Cao; Vuduc, Richard; Sun, Jimeng

doi:10.48550/arxiv.1911.08630

Cited by 4 publications

(12 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Olah et al [40] highlights many examples of similar neurons in InceptionV1 and visualizes which concepts are detected by such neurons; however, the examples are manually curated by the authors. Identifying neurons that discover similar concepts also has practical benefits: in the neural network compression community, several methods [14,15,21,25,56] leverage potential neuron redundancies to generate compressed models while maintaining prediction accuracy. Even though these methods can measure neurons' similarity, there is limited work in interpreting their semantic similarity.…”

Section: Semantic Similarity Of Neuronsmentioning

confidence: 99%

“…Existing research on DNN interpretability tends to focus on inspecting individual neurons [23,37,42]. While helpful, neuron-level inspection cannot easily reveal how clusters of neurons may detect the same concept, even though it is common for multiple neurons to detect similar features [15,21,25,56]. As a result, users can easily miss higher-order interactions that explain how DNNs operate.…”

Section: Design Challengesmentioning

confidence: 99%

“…As DNNs are increasingly used in an everincreasing variety of applications, our approaches can help practitioners and researchers assess the effectiveness of their ideas. For example, in the neural network compression community, several methods [15,21,25,56] leverage potential neuron redundancies to generate compressed models while maintaining prediction accuracy. NEURO-CARTOGRAPHY can help researchers interpret the semantic similarity between the compressed model and the original, uncompressed models, which helps them assess if their techniques are indeed preserving the "gist" of the knowledge important for prediction, or if they are leveraging some other features of the data of the model.…”

Section: Conclusion Limitations and Future Workmentioning

confidence: 99%

See 2 more Smart Citations

NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

Park¹,

Das²,

Duggal³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Clicking a neuron adds it to the Neuron Neighbor View. Neuron Projection View1. Sarah starts exploring graph view. She selects neuron cluster for "dog face'' (pink). Graph ViewCluster Popup 2. Sarah explores neuron embedding, and wonders why one "dog face" neuron is farther away from the rest.3. Sarah discovers the "dog face" neuron is surrounded by other dog-related concepts like "furry body" and "furry head".

show abstract

Section: Semantic Similarity Of Neuronsmentioning

confidence: 99%

Section: Design Challengesmentioning

confidence: 99%

Section: Conclusion Limitations and Future Workmentioning

confidence: 99%

See 1 more Smart Citation

NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

Park¹,

Das²,

Duggal³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…(3) Similarity measurement. These approaches use various strategies, such as geometric median [15] and clustering [51,6], to identify the most replaceable filters, or those functionally share the most similarity with others.…”

Section: Channel Pruningmentioning

confidence: 99%

“…We compare the performance of our approach with several recent channel pruning methods, namely, minimum weight (MW) [24], Taylor expansion [35], average percentage of zero activation neurons (APoZ) [17], soft filter pruning (SFP) [14], discrimination-aware channel pruning (DCP) [52], neuron importance score propagation (NISP) [47], slimmable neural networks (SNN) [46], autopruner (AP) [30], generative adversarial learning (GAL) [27], geometric median (GM) [15], transformable architecture search (TAS) [5], cluster pruning (CUP) [6], ABC [26], trained rank pruning (TRP) [45], soft channel pruning (SCP) [19], and high-hank (HRank) [25].…”

Section: Experiments Settingsmentioning

confidence: 99%

Convolutional Neural Network Pruning with Structural Redundancy Reduction

Wang¹,

Li²,

Wang³

2021

Preprint

View full text Add to dashboard Cite

Convolutional neural network (CNN) pruning has become one of the most successful network compression approaches in recent years. Existing works on network pruning usually focus on removing the least important filters in the network to achieve compact architectures. In this study, we claim that identifying structural redundancy plays a more essential role than finding unimportant filters, theoretically and empirically. We first statistically model the network pruning problem in a redundancy reduction perspective and find that pruning in the layer(s) with the most structural redundancy outperforms pruning the least important filters across all layers. Based on this finding, we then propose a network pruning approach that identifies structural redundancy of a CNN and prunes filters in the selected layer(s) with the most redundancy. Experiments on various benchmark network architectures and datasets show that our proposed approach significantly outperforms the previous state-of-the-art.

show abstract

Compressing Convolutional Neural Networks by Pruning Density Peak Filters

2021

View full text Add to dashboard Cite

With the recent development of GPUs, the depth of convolutional neural networks (CNNs) has increased, and its structure has become complex. Hence, it is challenging to deploy them into a hardware device owing to its immense computational cost and memory for storage parameters. We propose a method of pruning a filter located near the density peak, which grasps the density of the filter space for each layer to overcome this problem. The density is calculated in the filter space based on the number of neighboring filters within a certain distance around the filter and the distance to a denser space. Moreover, we do not remove all filters at once, but use a method of pruning a certain number iteratively, so that filters can be evenly pruned in multiple locations with high density inside the filter space. After that, we fine-tune the pruned network to restore their performance. The experimental results show the effectiveness of the proposed method with respect to the other methods using CIFAR-10, and ImageNet dataset on VGGNet and ResNet architecture. Notably, on CIFAR-10, our method reduces 60.8% of FLOPs on ResNet56 with 0.31% validation accuracy improvement. Moreover, we achieve up to 51.9% FLOPs reduction with a little accuracy drop on ImageNet for ResNet34. INDEX TERMSConvolutional neural networks, compressing CNNs, filter pruning, density peak. YUNSEOK JANG received the B.S. degree in electrical and electronic engineering from Yonsei University, Seoul, South Korea, in 2015, where he is currently pursuing the Ph.D. degree with the IT-SOC Laboratory. His research interests include network compression and light object detection. SANGYOUN LEE (Member, IEEE) received the B.S. and M.

show abstract

CUP: Cluster Pruning for Compressing Deep Neural Networks

Cited by 4 publications

References 9 publications

NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

Convolutional Neural Network Pruning with Structural Redundancy Reduction

Compressing Convolutional Neural Networks by Pruning Density Peak Filters

Contact Info

Product

Resources

About