Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning

Song, Min-Young; Yoon, Jaehong; Yang, Eunho; Hwang, Sung Ju

doi:10.48550/arxiv.2006.12139

Cited by 1 publication

(2 citation statements)

References 22 publications

(30 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Note that an independent and concurrent work [56] also proposes to utilize meta-learning for rapid structural pruning of neural networks. We highlight the main differences below: 1) [56] relies on a centralized meta-learning method where the nodes are required to submit data to a central platform, whereas we consider a more realistic distributed setup and propose a new federated meta-learning approach to fit the specific efficiency problem in our work. 2) [56] takes a stochastic approach and learns a task-specific Bernoulli distribution for mask generation, which however could possibly generate masks that lead to significant performance degradation.…”

Section: Related Workmentioning

confidence: 99%

“…We highlight the main differences below: 1) [56] relies on a centralized meta-learning method where the nodes are required to submit data to a central platform, whereas we consider a more realistic distributed setup and propose a new federated meta-learning approach to fit the specific efficiency problem in our work. 2) [56] takes a stochastic approach and learns a task-specific Bernoulli distribution for mask generation, which however could possibly generate masks that lead to significant performance degradation. In stark contrast, we develop a deterministic approach by learning a task-specific channel gating module, and also provide theoretic foundations by carrying out a thorough convergence analysis of the proposed federated meta-learning algorithm.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

MetaGater: Fast Learning of Conditional Channel Gated Networks via Federated Meta-Learning

Shen¹,

Yang²,

He³

et al. 2020

Preprint

View full text Add to dashboard Cite

While deep learning has achieved phenomenal successes in many AI applications, its enormous model size and intensive computation requirements pose a formidable challenge to the deployment in resource-limited nodes. There has recently been an increasing interest in computationallyefficient learning methods, e.g., quantization, pruning and channel gating. However, most existing techniques cannot adapt to different tasks quickly. In this work, we advocate a holistic approach to jointly train the backbone network and the channel gating which enables dynamical selection of a subset of filters for more efficient local computation given the data input. Particularly, we develop a federated metalearning approach to jointly learn good meta-initializations for both backbone networks and gating modules, by making use of the model similarity across learning tasks on different nodes. In this way, the learnt meta-gating module effectively captures the important filters of a good meta-backbone network, based on which a task-specific conditional channel gated network can be quickly adapted, i.e., through one-step gradient descent, from the meta-initializations in a two-stage procedure using new samples of that task. The convergence of the proposed federated meta-learning algorithm is established under mild conditions. Experimental results corroborate the effectiveness of our method in comparison to related work.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%