Group action induced distances for averaging and clustering Linear Dynamical Systems with applications to the analysis of dynamic scenes

Afsari, Bijan; Chaudhry, Rizwan; Ravichandran, Avinash; Vidal, René

doi:10.1109/cvpr.2012.6247929

Cited by 32 publications

(54 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Over the years, several metrics have been proposed e.g. [8,17,25,1,6]. Of these, the Martin distance has been the most extensively used as it is invariant to the noise statistics as well as initial state of the dynamical system.…”

Section: Time-series Modeling Using Ldssmentioning

confidence: 99%

Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition

Chaudhry

Ofli

Kurillo

et al. 2013

2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops

162

101

View full text Add to dashboard Cite

Over the last few years, with the immense popularity of the Kinect, there has been renewed interest in developing methods for human gesture and action recognition from 3D data. A number of approaches have been proposed that extract representative features from 3D depth data, a reconstructed 3D surface mesh or more commonly from the recovered estimate of the human skeleton. Recent advances in neuroscience have discovered a neural encoding of static 3D shapes in primate infero-temporal cortex that can be represented as a hierarchy of medial axis and surface features. We hypothesize a similar neural encoding might also exist for 3D shapes in motion and propose a hierarchy of dynamic medial axis structures at several spatio-temporal scales that can be modeled using a set of Linear Dynamical Systems (LDSs). We then propose novel discriminative metrics for comparing these sets of LDSs for the task of human activity recognition. Combined with simple classification frameworks, our proposed features and corresponding hierarchical dynamical models provide the highest human activity recognition rates as compared to state-of-the-art methods on several skeletal datasets.

show abstract

Section: Time-series Modeling Using Ldssmentioning

confidence: 99%

Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition

Chaudhry

Ofli

Kurillo

et al. 2013

2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops

162

101

View full text Add to dashboard Cite

show abstract

“…Hence, clustering algorithms used in the Euclidean space (such as k-means) cannot be applied directly, as discussed in prior work [7,5,6,1].…”

Section: Codebook Generationmentioning

confidence: 99%

Intrinsic Characterization of Dynamic Surfaces

Tung

Matsuyama

2013

2013 IEEE Conference on Computer Vision and Pattern Recognition

View full text Add to dashboard Cite

“…The main challenge of this dictionary leaning problem is the difficulty of identifying the "centroid" of a collection of dynamic textures, due to the non-Euclidean nature of the space of linear dynamic systems. [20] bypasses this problem with resort to a somewhat heuristic combination of multi-dimensional scaling and kmeans (denoted MDS-kM); while [1] presents a procedure to directly average dynamic models in the parameter space, the approach only works for LDS's. We propose an alternative principled solution, which is specifically designed for clustering attribute sequences, and has a number of advantages over MDS-kM.…”

Section: Related Workmentioning

confidence: 99%

“…This is related to the bag-of-systems framework of [20,1], where a set of dynamic textures (DTs) [5] were used to characterize dynamic scenes. The main challenge of this dictionary leaning problem is the difficulty of identifying the "centroid" of a collection of dynamic textures, due to the non-Euclidean nature of the space of linear dynamic systems.…”

Section: Related Workmentioning

confidence: 99%

Recognizing Activities via Bag of Words for Attribute Dynamics

Sawhney

et al. 2013

2013 IEEE Conference on Computer Vision and Pattern Recognition

View full text Add to dashboard Cite

In this work, we propose a novel video representation for activity recognition that models video dynamics with attributes of activities. A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes. These segments are modeled by a dictionary of attribute dynamics templates, which are implemented by a recently introduced generative model, the binary dynamic system (BDS). We propose methods for learning a dictionary of BDS's from a training corpus, and for quantizing attribute sequences extracted from videos into these BDS codewords. This procedure produces a representation of the video as a histogram of BDS codewords, which is denoted the bag-of-words for attribute dynamics (BoWAD). An extensive experimental evaluation reveals that this representation outperforms other state-of-the-art approaches in temporal structure modeling for complex activity recognition.

show abstract

Group action induced distances for averaging and clustering Linear Dynamical Systems with applications to the analysis of dynamic scenes

Cited by 32 publications

References 21 publications

Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition

Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition

Intrinsic Characterization of Dynamic Surfaces

Recognizing Activities via Bag of Words for Attribute Dynamics

Contact Info

Product

Resources

About