Guruprasad Somasundaram scite author profile

Recognizing actions is one of the important challenges in computer vision with respect to video data, with applications to surveillance, diagnostics of mental disorders, and video retrieval. Compared to other data modalities such as documents and images, processing video data demands orders of magnitude higher computational and storage resources. One way to alleviate this difficulty is to focus the computations to informative (salient) regions of the video. In this paper, we propose a novel global spatio-temporal selfsimilarity measure to score saliency using the ideas of dictionary learning and sparse coding. In contrast to existing methods that use local spatio-temporal feature detectors along with descriptors (such as HOG, HOG3D, HOF, etc.), dictionary learning helps consider the saliency in a global setting (on the entire video) in a computationally efficient way. We consider only a small percentage of the most salient (least self-similar) regions found using our algorithm, over which spatio-temporal descriptors such as HOG and region covariance descriptors are computed. The ensemble of such block descriptors in a bag-of-features framework provides a holistic description of the motion sequence which can be used in a classification setting. Experiments on several benchmark datasets in video based action classification demonstrate that our approach performs competitively to the state of the art.

show abstract

Counting pedestrians and bicycles in traffic scenes

Somasundaram

Morellas

Papanikolopoulos

2009

View full text Add to dashboard Cite

Optimal camera placement with adaptation to dynamic scenes

Fiore

Somasundaram

Drenner

et al. 2008

View full text Add to dashboard Cite

Sparse representation of point trajectories for action classification

Sivalingam

Somasundaram

Bhatawadekar

et al. 2012

View full text Add to dashboard Cite

Dictionary learning based object detection and counting in traffic scenes

Sivalingam

Somasundaram

Morellas

et al. 2010

View full text Add to dashboard Cite

Classification and Counting of Composite Objects in Traffic Scenes Using Global and Local Image Analysis

Somasundaram

Sivalingam

Morellas

et al. 2013

IEEE Trans. Intell. Transport. Syst.

View full text Add to dashboard Cite

Recognition of ballet micro-movements for use in choreography

Dancs

Sivalingam

Somasundaram

et al. 2013

View full text Add to dashboard Cite

Computer vision as an entire field has a wide and diverse range of applications. The specific application for this project was in the realm of dance, notably ballet and choreography. This project was proof-of-concept for a choreography assistance tool used to recognize and record dance movements demonstrated by a choreographer. Keeping the commercial arena in mind, the Kinect from Microsoft was chosen as the imaging hardware, and a pilot set chosen to verify recognition feasibility. Before implementing a classifier, all training and test data was transformed to a more applicable representation scheme to only pass the important aspects to the classifier to distinguish moves for the pilot set. In addition, several classification algorithms using the Nearest Neighbor (NN) and Support Vector Machine (SVM) methods were tested and compared from a single dictionary as well as on several different subjects. The results were promising given the framework of the project, and several new expansions of this work are proposed.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.