Shiv Surya scite author profile

We propose a novel crowd counting model that maps a given crowd scene to its density. Crowd analysis is compounded by myriad of factors like inter-occlusion between people due to extreme crowding, high similarity of appearance between people and background elements, and large variability of camera view-points. Current state-of-the art approaches tackle these factors by using multi-scale CNN architectures, recurrent networks and late fusion of features from multi-column CNN with different receptive fields. We propose switching convolutional neural network that leverages variation of crowd density within an image to improve the accuracy and localization of the predicted crowd count. Patches from a grid within a crowd scene are relayed to independent CNN regressors based on crowd count prediction quality of the CNN established during training. The independent CNN regressors are designed to have different receptive fields and a switch classifier is trained to relay the crowd scene patch to the best CNN regressor. We perform extensive experiments on all major crowd counting datasets and evidence better performance compared to current stateof-the-art methods. We provide interpretable representations of the multichotomy of space of crowd scene patches inferred from the switch. It is observed that the switch relays an image patch to a particular CNN column based on density of crowd.

show abstract

Switching Convolutional Neural Network for Crowd Counting

Sam¹,

Surya²,

Babu³

2017

Preprint

View full text Add to dashboard Cite

Pictionary-Style Word Guessing on Hand-Drawn Object Sketches: Dataset, Analysis and Deep Network Models

Sarvadevabhatla

Surya

Mittal

et al. 2020

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

ReStGAN: A step towards visually guided shopper experience via text-to-image synthesis

Surya

Biswas

Negi

2020

View full text Add to dashboard Cite

Game of Sketches: Deep Recurrent Models of Pictionary-Style Word Guessing

Sarvadevabhatla

Surya

Mittal

et al. 2018

AAAI

View full text Add to dashboard Cite

The ability of machine-based agents to play games in human-like fashion is considered a benchmark of progress in AI. In this paper, we introduce the first computational model aimed at Pictionary, the popular word-guessing social game. We first introduce Sketch-QA, an elementary version of Visual Question Answering task. Styled after Pictionary, Sketch-QA uses incrementally accumulated sketch stroke sequences as visual data. Notably, Sketch-QA involves asking a fixed question ("What object is being drawn?") and gathering open-ended guess-words from human guessers. To mimic Pictionary-style guessing, we propose a deep neural model which generates guess-words in response to temporally evolving human-drawn sketches. Our model even makes human-like mistakes while guessing, thus amplifying the human mimicry factor. We evaluate our model on the large-scale guess-word dataset generated via Sketch-QA task and compare with various baselines. We also conduct a Visual Turing Test to obtain human impressions of the guess-words generated by humans and our model. Experimental results demonstrate the promise of our approach for Pictionary and similarly themed games.

show abstract

SwiDeN

Sarvadevabhatla

Surya

Kruthiventi

et al. 2016

View full text Add to dashboard Cite

SwiDeN : Convolutional Neural Networks For Depiction Invariant Object Recognition

Sarvadevabhatla¹,

Surya²,

Kruthiventi³

et al. 2016

Preprint

View full text Add to dashboard Cite

TraCount

Surya

Babu

2016

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shiv Surya

Switching Convolutional Neural Network for Crowd Counting

Switching Convolutional Neural Network for Crowd Counting

Pictionary-Style Word Guessing on Hand-Drawn Object Sketches: Dataset, Analysis and Deep Network Models

ReStGAN: A step towards visually guided shopper experience via text-to-image synthesis

Game of Sketches: Deep Recurrent Models of Pictionary-Style Word Guessing

SwiDeN

SwiDeN : Convolutional Neural Networks For Depiction Invariant Object Recognition

TraCount

Contact Info

Product

Resources

About