Toward Compact Deep Neural Networks via Energy-Aware Pruning

Yeom, Seul-Ki; Shim, Kyung-Hwan; Hwang, Jeehyun

doi:10.48550/arxiv.2103.10858

Cited by 2 publications

(7 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, these methods increasing training time by 10 times [19,20] and can be computationally expensive while optimizing extra parameters such as a soft mask particularly for large-scale networks. Other methods generate feature maps corresponding to a set of examples and then apply metrics such as rank [13], energy [14], the average percentage of zeros [21] to quantify the importance of filters, or use similarity measures such as clustering [22] on feature maps to eliminate filters corresponding to redundant feature maps. However, generating feature maps corresponding to a set of examples take extra memory resources.…”

Section: Methods To Compute Cnn Filter Importancementioning

confidence: 99%

“…In filter pruning, the importance of the filters is measured using either active or passive methods. Active methods [13,14] use a dataset to generate feature maps from the filters and then compute filter importance using various measures such as entropy, the average percentage of zeros on feature maps. Some active methods even identify important filters during the training of CNNs by involving extra parameters such as a soft mask for each filter, and then jointly optimising the CNN parameters and the soft mask [11,12].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Compressing Audio CNNS with Graph Centrality Based Filter Pruning

King,

Singh,

Plumbley

2023

2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

View full text Add to dashboard Cite

Convolutional neural networks (CNNs) are popular in highperforming solutions to many real-world problems, such as audio classification. CNNs have many parameters and filters, with some having a larger impact on the performance than others. This means that networks may contain many unnecessary filters, increasing a CNN's computation and memory requirements while providing limited performance benefits. To make CNNs more efficient, we propose a pruning framework that eliminates filters with the highest "commonality". We measure this commonality using the graph-theoretic concept of centrality. We hypothesise that a filter with a high centrality should be eliminated as it represents commonality and can be replaced by other filters without affecting the performance of a network much. An experimental evaluation of the proposed framework is performed on acoustic scene classification and audio tagging. On the DCASE 2021 Task 1A baseline network, our proposed method reduces computations per inference by 71% with 50% fewer parameters with less than a two percentage point drop in accuracy compared to the original network. For large-scale CNNs such as PANNs designed for audio tagging, our method reduces computations per inference by 24% with 41% fewer parameters at a slight improvement in performance.

show abstract

Section: Methods To Compute Cnn Filter Importancementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Compressing Audio CNNS with Graph Centrality Based Filter Pruning

King,

Singh,

Plumbley

2023

2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

View full text Add to dashboard Cite

show abstract

“…Active filter pruning: Active filter pruning methods involve a dataset to compute importance of the filters. For example, Luo et al [15], Lin et al [16], and Yeom et al [21] proposed feature-map based pruning methods, where a dataset is used to produce feature maps in CNNs, and then metrics such as entropy, variance, average rank of feature maps and the average percentage of zeros are applied on the feature maps to quantify the importance of the filters.…”

Section: B Filter Pruning Methodsmentioning

confidence: 99%

“…Other methods used for comparison: We compare the proposed operator norm based pruning method with that of the entry-wise norm based methods, (a) l 1 -norm method that eliminates filters with smaller entry-wise l 1 -norm [11] and (b) geometric median (GM) method that eliminates filters with smaller l 2 -norm as measured from the geometric median of all filters [19]. We also compare the proposed pruning method with the existing active filter pruning including HRank [16] and Energy-aware pruning [21]. The HRank method opts three steps to obtain a pruned network.…”

Section: Methodsmentioning

confidence: 99%

“…On the other hand, for image classification networks such as VGG-16 and ResNet50 Net, the accuracy shown in Figure 10(c)-(h) at various pruning ratio is more or less similar. The convergence plots obtained during the finetuning of the pruned networks are included in supplementary material Figure 12 Next, Table I gives various performance parameters including computation time in obtaining filter importance (pruning time), accuracy obtained after fine-tuning the pruned network, the number of parameters and the number of MACs obtained in various pruned networks after applying the proposed filter pruning method and the feature map based active filter pruning methods such as HRank [16] and Energy-aware [21] on various unpruned networks. The proposed passive filter pruning method is able to achieve similar performance compared to the feature map based active filter pruning methods without Pruning ratio (p) Fig.…”

Section: B Comparison With Other Methodsmentioning

confidence: 99%

See 1 more Smart Citation

A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification

Singh¹,

Plumbley²

2022

Interspeech 2022

View full text Add to dashboard Cite

Convolutional neural networks (CNNs) have shown state-of-the-art performance in various applications. However, CNNs are resource-hungry due to their requirement of high computational complexity and memory storage. Recent efforts toward achieving computational efficiency in CNNs involve filter pruning methods that eliminate some of the filters in CNNs based on the "importance" of the filters. The majority of existing filter pruning methods are either "active", which use a dataset and generate feature maps to quantify filter importance, or "passive", which compute filter importance using entry-wise norm of the filters without involving data. Under a high pruning ratio where large number of filters are to be pruned from the network, the entry-wise norm methods eliminate relatively smaller norm filters without considering the significance of the filters in producing the node output, resulting in degradation in the performance. To address this, we present a passive filter pruning method where the filters are pruned based on their contribution in producing output by considering the operator norm of the filters. The proposed pruning method generalizes better across various CNNs compared to that of the entry-wise norm-based pruning methods. In comparison to the existing active filter pruning methods, the proposed pruning method is at least 4.5 times faster in computing filter importance and is able to achieve similar performance compared to that of the active filter pruning methods. The efficacy of the proposed pruning method is evaluated on audio scene classification and image classification using various CNNs architecture such as VGGish, DCASE21 Net, VGG-16 and ResNet-50.

show abstract

Toward Compact Deep Neural Networks via Energy-Aware Pruning

Cited by 2 publications

References 41 publications

Compressing Audio CNNS with Graph Centrality Based Filter Pruning

Compressing Audio CNNS with Graph Centrality Based Filter Pruning

A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification

Contact Info

Product

Resources

About