Igor Durdanovic scite author profile

The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various layers without hurting original accuracy. However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks. We present an acceleration method for CNNs, where we prune filters from CNNs that are identified as having a small effect on the output accuracy. By removing whole filters in the network together with their connecting feature maps, the computation costs are reduced significantly. In contrast to pruning weights, this approach does not result in sparse connectivity patterns. Hence, it does not need the support of sparse convolution libraries and can work with existing efficient BLAS libraries for dense matrix multiplications. We show that even simple filter pruning techniques can reduce inference costs for VGG-16 by up to 34% and ResNet-110 by up to 38% on CIFAR10 while regaining close to the original accuracy by retraining the networks.

show abstract

A Massively Parallel Coprocessor for Convolutional Neural Networks

Sankaradas¹,

Jakkula²,

Cadambi³

et al. 2009

201

View full text Add to dashboard Cite

A Massively Parallel FPGA-Based Coprocessor for Support Vector Machines

Cadambi¹,

Durdanovic²,

Jakkula³

et al. 2009

View full text Add to dashboard Cite

Evolution of Cooperative Problem Solving in an Artificial Economy

Baum

Durdanovic

2000

Neural Computation

View full text Add to dashboard Cite

We address the problem of how to reinforce learning in ultracomplex environments, with huge state-spaces, where one must learn to exploit a compact structure of the problem domain. The approach we propose is to simulate the evolution of an artificial economy of computer programs. The economy is constructed based on two simple principles so as to assign credit to the individual programs for collaborating on problem solutions. We find empirically that starting from programs that are random computer code, we can develop systems that solve hard problems. In particular, our economy learned to solve almost all random Blocks World problems with goal stacks that are 200 blocks high. Competing methods solve such problems only up to goal stacks of at most 8 blocks. Our economy has also learned to unscramble about half a randomly scrambled Rubik's cube and to solve several commercially sold puzzles.

show abstract

An Artificial Economy of Post Production Systems

Baum

Durdanovic

2001

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Igor Durdanovic

Pruning Filters for Efficient ConvNets

A Massively Parallel Coprocessor for Convolutional Neural Networks

A Massively Parallel FPGA-Based Coprocessor for Support Vector Machines

Evolution of Cooperative Problem Solving in an Artificial Economy

An Artificial Economy of Post Production Systems

Contact Info

Product

Resources

About