Compressing Convolutional Neural Networks in the Frequency Domain

Chen, Wenlin; Wilson, James T.; Tyree, Stephen; Weinberger, Kilian Q.; Chen, Yixin

doi:10.1145/2939672.2939839

Cited by 145 publications

(104 citation statements)

References 35 publications

(40 reference statements)

Supporting

Mentioning

104

Contrasting

Order By: Relevance

“…Much has been done to minimize the memory requirements of neural networks [445,[493][494][495][496]506,507], but there is also growing interest in specialized hardware, such as field-programmable gate arrays (FPGAs) [502,508] and application-specific integrated circuits (ASICs) [509]. Less software is available for such highly specialized hardware [508].…”

Section: Data Limitationsmentioning

confidence: 99%

Opportunities and obstacles for deep learning in biology and medicine

Ching

Himmelstein

Beaulieu‐Jones

et al. 2017

Preprint

226

248

View full text Add to dashboard Cite

Deep learning, which describes a class of machine learning algorithms, has recently showed impressive results across a variety of domains. Biology and medicine are data rich, but the data are complex and often ill-understood. Problems of this nature may be particularly well-suited to deep learning techniques. We examine applications of deep learning to a variety of biomedical problems—patient classification, fundamental biological processes, and treatment of patients—and discuss whether deep learning will transform these tasks or if the biomedical sphere poses unique challenges. We find that deep learning has yet to revolutionize or definitively resolve any of these problems, but promising advances have been made on the prior state of the art. Even when improvement over a previous baseline has been modest, we have seen signs that deep learning methods may speed or aid human investigation. More work is needed to address concerns related to interpretability and how to best model each problem. Furthermore, the limited amount of labeled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning powering changes at both bench and bedside with the potential to transform several areas of biology and medicine.

show abstract

Section: Data Limitationsmentioning

confidence: 99%

Opportunities and obstacles for deep learning in biology and medicine

Ching

Himmelstein

Beaulieu‐Jones

et al. 2017

Preprint

226

248

View full text Add to dashboard Cite

show abstract

“…One stream focuses on designing efficient network architectures [30,28,15,40,13], including depthwise separable convolution [30], point-wise group convolution with channel shuffling [39], and learned group convolution [15], to name a few. The other line of research explores methods to prune [8,23,11] or quantize [4,8,17] neural network weights. These strategies are effective when neural networks have a substantial amount of redundant weights, which can be safely removed or quantized without sacrificing accuracy.…”

Section: Related Workmentioning

confidence: 99%

“…Extensive efforts have been made to improve the inference efficiency of deep CNNs in recent years. Popular approaches include efficient architecture design [30,28,15,40], network pruning [8,23,26], weight quantiza- * First two authors contributed equally † Corresponding author tion [4,8,17] and adaptive inference [7,14,2,35,6,34]. Among them, adaptive inference is gaining increasing attention recently, due to its remarkable advantages.…”

Section: Introductionmentioning

confidence: 99%

Improved Techniques for Training Adaptive Deep Networks

Zhang

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

105

View full text Add to dashboard Cite

Adaptive inference is a promising technique to improve the computational efficiency of deep models at test time. In contrast to static models which use the same computation graph for all instances, adaptive networks can dynamically adjust their structure conditioned on each input. While existing research on adaptive inference mainly focuses on designing more advanced architectures, this paper investigates how to train such networks more effectively. Specifically, we consider a typical adaptive deep network with multiple intermediate classifiers. We present three techniques to improve its training efficacy from two aspects: 1) a Gradient Equilibrium algorithm to resolve the conflict of learning of different classifiers; 2) an Inline Subnetwork Collaboration approach and a One-for-all Knowledge Distillation algorithm to enhance the collaboration among classifiers. On multiple datasets (CIFAR-10, CIFAR-100 and ImageNet), we show that the proposed approach consistently leads to further improved efficiency on top of stateof-the-art adaptive deep networks.

show abstract

“…Yoon and Hwang enforce sparsity on the filter through regularization. Chen and others also explored the frequency domain for compression . However, all of these methods are more complex when implemented using standard deep‐learning tools, which was actually the reason why we did not choose them as candidates for our approach.…”

Section: Related Workmentioning

confidence: 99%

Deep compression of convolutional neural networks with low-rank approximation

Astrid

Lee

2018

ETRI Journal

View full text Add to dashboard Cite

The application of deep neural networks (DNNs) to connect the world with cyber physical systems (CPSs) has attracted much attention. However, DNNs require a large amount of memory and computational cost, which hinders their use in the relatively low‐end smart devices that are widely used in CPSs. In this paper, we aim to determine whether DNNs can be efficiently deployed and operated in low‐end smart devices. To do this, we develop a method to reduce the memory requirement of DNNs and increase the inference speed, while maintaining the performance (for example, accuracy) close to the original level. The parameters of DNNs are decomposed using a hybrid of canonical polyadic–singular value decomposition, approximated using a tensor power method, and fine‐tuned by performing iterative one‐shot hybrid fine‐tuning to recover from a decreased accuracy. In this study, we evaluate our method on frequently used networks. We also present results from extensive experiments on the effects of several fine‐tuning methods, the importance of iterative fine‐tuning, and decomposition techniques. We demonstrate the effectiveness of the proposed method by deploying compressed networks in smartphones.

show abstract

Compressing Convolutional Neural Networks in the Frequency Domain

Cited by 145 publications

References 35 publications

Opportunities and obstacles for deep learning in biology and medicine

Opportunities and obstacles for deep learning in biology and medicine

Improved Techniques for Training Adaptive Deep Networks

Deep compression of convolutional neural networks with low-rank approximation

Contact Info

Product

Resources

About