2008 IEEE Conference on Computer Vision and Pattern Recognition 2008
DOI: 10.1109/cvpr.2008.4587727
|View full text |Cite
|
Sign up to set email alerts
|

Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition

Abstract: In this paper we introduce a template-based method for recognizing human actions called Action MACH. Our approach is based on a Maximum Average Correlation Height (MACH) filter. A common limitation of template-based methods is their inability to generate a single template using a collection of examples. MACH is capable of capturing intra-class variability by synthesizing a single ActionMACH filter for a given action class. We generalize the traditional MACH filter to video (3D spatiotemporal volume), and vecto… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
775
1
4

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 998 publications
(786 citation statements)
references
References 19 publications
(19 reference statements)
6
775
1
4
Order By: Relevance
“…Recently, more challenging datasets were constructed by collecting realistic videos from movies [13,12,14]. These movie scenes are taken from varying view points with complex backgrounds, in contrast of the previous public datasets [16,8].…”
Section: Previous Datasetsmentioning
confidence: 99%
“…Recently, more challenging datasets were constructed by collecting realistic videos from movies [13,12,14]. These movie scenes are taken from varying view points with complex backgrounds, in contrast of the previous public datasets [16,8].…”
Section: Previous Datasetsmentioning
confidence: 99%
“…However, this approach is only capable of classifying, rather than detecting, activities. Other approaches include filtering techniques [29] and sampling of video patches [1]. Hierarchical techniques for activity recognition have been used as well, but these typically focus on neurologically-inspired visual cortex-type models [9,32,23,28].…”
Section: Introductionmentioning
confidence: 99%
“…The most notable recent foray into sports footage in the literature was a broadcast sports dataset collected by [1]. However the dataset was a mixture of many different sports captured at highly variable angles and the taks was limited to a categorization exercise.…”
Section: Related Workmentioning
confidence: 99%