Archith J. Bency scite author profile

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image labels. This weakly supervised method leverages local spatial and semantic patterns captured in the convolutional layers of classification networks. We propose an efficient beam search based approach to detect and localize multiple objects in images. The proposed method significantly outperforms the state-of-the-art in standard object localization data-sets with a 8 point increase in mAP scores.

show abstract

Beyond Spatial Auto-Regressive Models: Predicting Housing Prices with Satellite Imagery

Bency

Rallapalli

Ganti

et al. 2017

View full text Add to dashboard Cite

Kestrel: Video Analytics for Augmented Multi-Camera Vehicle Tracking

Qiu

Liu

Rallapalli³

et al. 2018

View full text Add to dashboard Cite

In the future, the video-enabled camera will be the most pervasive type of sensor in the Internet of Things. Such cameras will enable continuous surveillance through heterogeneous camera networks consisting of fixed camera systems as well as cameras on mobile devices. The challenge in these networks is to enable efficient video analytics: the ability to process videos cheaply and quickly to enable searching for specific events or sequences of events. In this paper, we discuss the design and implementation of Kestrel, a video analytics system that tracks the path of vehicles across a heterogeneous camera network. In Kestrel, fixed camera feeds are processed on the cloud, and mobile devices are invoked only to resolve ambiguities in vehicle tracks. Kestrel's mobile device pipeline detects objects using a deep neural network, extracts attributes using cheap visual features, and resolves path ambiguities by careful association of vehicle visual descriptors, while using several optimizations to conserve energy and reduce latency. Our evaluations show that Kestrel can achieve precision and recall comparable to a fixed camera network of the same size and topology, while reducing energy usage on mobile devices by more than an order of magnitude.

show abstract

Weakly Supervised Localization using Deep Feature Maps

Bency

Kwon

Lee

et al. 2016

Preprint

View full text Add to dashboard Cite

Search Tracker: Human-Derived Object Tracking in the Wild Through Large-Scale Search and Retrieval

Bency

Karthikeyan

Leo

et al. 2017

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Abstract-Humans use context and scene knowledge to easily localize moving objects in conditions of complex illumination changes, scene clutter, and occlusions. In this paper, we present a method to leverage human knowledge in the form of annotated video libraries in a novel search and retrieval-based setting to track objects in unseen video sequences. For every video sequence, a document that represents motion information is generated. Documents of the unseen video are queried against the library at multiple scales to find videos with similar motion characteristics. This provides us with coarse localization of objects in the unseen video. We further adapt these retrieved object locations to the new video using an efficient warping scheme. The proposed method is validated on in-the-wild video surveillance data sets where we outperform state-of-the-art appearance-based trackers. We also introduce a new challenging data set with complex object appearance changes.Index Terms-Data-driven methods, video search and retrieval, visual object tracking.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Archith J. Bency

Weakly Supervised Localization Using Deep Feature Maps

Beyond Spatial Auto-Regressive Models: Predicting Housing Prices with Satellite Imagery

Kestrel: Video Analytics for Augmented Multi-Camera Vehicle Tracking

Weakly Supervised Localization using Deep Feature Maps

Search Tracker: Human-Derived Object Tracking in the Wild Through Large-Scale Search and Retrieval

Contact Info

Product

Resources

About