Beerel, Peter A. scite author profile

Beerel, Peter A.

5Publications

26Citation Statements Received

56Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Southern California, Southern California University for Professional Studies

Publications

Order By: Most citations

P2M: A Processing-in-Pixel-in-Memory Paradigm for Resource-Constrained TinyML Applications

Datta¹,

Kundu²,

Yin³

et al. 2022

Preprint

View full text Add to dashboard Cite

The demand to process vast amounts of data generated from state-of-the-art high resolution cameras has motivated novel energy-efficient on-device AI solutions. Visual data in such cameras are usually captured in analog voltages by a sensor pixel array, and then converted to the digital domain for subsequent AI processing using analog-to-digital converters (ADC). Recent research has tried to take advantage of massively parallel low-power analog/digital computing in the form of nearand in-sensor processing, in which the AI computation is performed partly in the periphery of the pixel array and partly in a separate on-board CPU/accelerator. Unfortunately, high-resolution input images still need to be streamed between the camera and the AI processing unit, frame by frame, causing energy, bandwidth, and security bottlenecks. To mitigate this problem, we propose a novel Processing-in-Pixel-in-memory (P 2 M) paradigm, that customizes the pixel array by adding support for analog multi-channel, multi-bit convolution, batch normalization, and ReLU (Rectified Linear Units). Our solution includes a holistic algorithm-circuit co-design approach and the resulting P 2 M paradigm can be used as a drop-in replacement for embedding memory-intensive first few layers of convolutional neural network (CNN) models within foundry-manufacturable CMOS image sensor platforms. Our experimental results indicate that P 2 M reduces data transfer bandwidth from sensors and analog to digital conversions by ∼21×, and the energy-delay product (EDP) incurred in processing a MobileNetV2 model on a TinyML use case for visual wake words dataset (VWW) by up to ∼11× compared to standard near-processing or in-sensor implementations, without any significant drop in test accuracy.

show abstract

Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?

Datta¹,

A.²

2021

Preprint

View full text Add to dashboard Cite

Spiking neural networks (SNNs), that operate via binary spikes distributed over time, have emerged as a promising energy efficient ML paradigm for resource-constrained devices. However, the current state-of-the-art (SOTA) SNNs require multiple time steps for acceptable inference accuracy, increasing spiking activity and, consequently, energy consumption. SOTA training strategies for SNNs involve conversion from a non-spiking deep neural network (DNN). In this paper, we determine that SOTA conversion strategies cannot yield ultra low latency because they incorrectly assume that the DNN and SNN pre-activation values are uniformly distributed. We propose a new training algorithm that accurately captures these distributions, minimizing the error between the DNN and converted SNN. The resulting SNNs have ultra low latency and high activation sparsity, yielding significant improvements in compute efficiency. In particular, we evaluate our framework on image recognition tasks from CIFAR-10 and CIFAR-100 datasets on several VGG and ResNet architectures. We obtain top-1 accuracy of 64.19% with only 2 time steps on the CIFAR-100 dataset with ∼159.2× lower compute energy compared to an iso-architecture standard DNN. Compared to other SOTA SNN models, our models perform inference 2.5-8× faster (i.e., with fewer time steps).

show abstract

ViTA: A Vision Transformer Inference Accelerator for Edge Applications

Shashank¹,

Datta²,

Kundu³

et al. 2023

Preprint

View full text Add to dashboard Cite

Logical Equivalence Checking of Asynchronous Circuits using Commercial Tools

Saifhashemi¹,

Huang²,

Bhalerao³

et al. 2015

View full text Add to dashboard Cite

Hoyer regularizer is all you need for ultra low-latency spiking neural networks

Datta¹,

Liu²,

A.³

2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Beerel, Peter A.

P2M: A Processing-in-Pixel-in-Memory Paradigm for Resource-Constrained TinyML Applications

Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?

ViTA: A Vision Transformer Inference Accelerator for Edge Applications

Logical Equivalence Checking of Asynchronous Circuits using Commercial Tools

Hoyer regularizer is all you need for ultra low-latency spiking neural networks

Contact Info

Product

Resources

About