Kartik Lakhotia scite author profile

To accelerate inference of Convolutional Neural Networks (CNNs), various techniques have been proposed to reduce computation redundancy. Converting convolutional layers into frequency domain significantly reduces the computation complexity of the sliding window operations in space domain. On the other hand, weight pruning techniques address the redundancy in model parameters by converting dense convolutional kernels into sparse ones. To obtain high-throughput FPGA implementation, we propose SPEC 2 -the first work to prune and accelerate spectral CNNs. First, we propose a systematic pruning algorithm based on Alternative Direction Method of Multipliers (ADMM). The offline pruning iteratively sets the majority of spectral weights to zero, without using any handcrafted heuristics. Then, we design an optimized pipeline architecture on FPGA that has efficient random access into the sparse kernels and exploits various dimensions of parallelism in convolutional layers. Overall, SPEC 2 achieves high inference throughput with extremely low computation complexity and negligible accuracy degradation. We demonstrate SPEC 2 by pruning and implementing LeNet and VGG16 on the Xilinx Virtex platform. After pruning 75% of the spectral weights, SPEC 2 achieves 0% accuracy loss for LeNet, and < 1% accuracy loss for VGG16. The resulting accelerators achieve up to 24× higher throughput, compared with the stateof-the-art FPGA implementations for VGG16.

show abstract

A Fast and Efficient Parallel Algorithm for Pruned Landmark Labeling

Dong

Lakhotia

Zeng

et al. 2018

View full text Add to dashboard Cite

Design and implementation of parallel PageRank on multicore platforms

Zhou¹,

Lakhotia²,

Singapura³

et al. 2017

View full text Add to dashboard Cite

Quickly finding a truss in a haystack

Green

Fox

Kim

et al. 2017

View full text Add to dashboard Cite

Low-Power, Low-Latency Hermite Polynomial Characterization of Heartbeats Using a Field-Programmable Gate Array

Lakhotia

Caffarena

Gil

et al. 2016

View full text Add to dashboard Cite

Parallel edge-based sampling for static and dynamic graphs

Lakhotia

Kannan

Gaur

et al. 2019

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kartik Lakhotia

PolarFly: A Cost-Effective and Flexible Low-Diameter Topology

ReCALL: Reordered Cache Aware Locality Based Graph Processing

SPEC2: SPECtral SParsE CNN Accelerator on FPGAs

A Fast and Efficient Parallel Algorithm for Pruned Landmark Labeling

Design and implementation of parallel PageRank on multicore platforms

Quickly finding a truss in a haystack

Low-Power, Low-Latency Hermite Polynomial Characterization of Heartbeats Using a Field-Programmable Gate Array

Parallel edge-based sampling for static and dynamic graphs

Contact Info

Product

Resources

About