Proceedings IEEE Workshop on Detection and Recognition of Events in Video
DOI: 10.1109/event.2001.938868
|View full text |Cite
|
Sign up to set email alerts
|

Recognizing action events from multiple viewpoints

Abstract: A first step towards a n understanding of the semantic content in a video is the reliable detection and recognition of actions performed by objects. This is a dificult problem due t o the enormous vaeability in a n action's appearance when seen from different viewpoints and/or at different times. In this paper we address the recognition of actions by taking a novel approach that models actions as special types of 3d objects. Specifically, we observe that any action can be represented as a generalized cylinder,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
48
0

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 74 publications
(49 citation statements)
references
References 17 publications
0
48
0
Order By: Relevance
“…In addition to this viewpoint change, other factors that make the problem even more challenging are the perspective or affine distortions (depending on the model used), anthropometric variations, or the speed at which the action is performed. Therefore, to make the problem more tractable, various simplifications or restricted special cases have been considered over the years [3,4,5,6,7,8,9]. We aim at alleviating such constraints.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to this viewpoint change, other factors that make the problem even more challenging are the perspective or affine distortions (depending on the model used), anthropometric variations, or the speed at which the action is performed. Therefore, to make the problem more tractable, various simplifications or restricted special cases have been considered over the years [3,4,5,6,7,8,9]. We aim at alleviating such constraints.…”
Section: Introductionmentioning
confidence: 99%
“…The reconstructions of 3-D plans for events or actions have been explored [103], [136]. We have mostly reviewed event detection from 2-D image sequences.…”
Section: ) Discussionmentioning
confidence: 99%
“…For example, Bobick & Davis (2001) proposed to capture the history of shape changes using temporal templates and Weinland et al (2006) extend these 2D templates to 3D action templates. Similarly, based on silhouettes, notions of action cylinders Syeda-Mahmood et al (2001), and space-time shapesYilmaz & Shah (2005a); Gorelick et al (2007) have also been introduced. Recently, researchers have started analyzing video sequences as space-time volumes, built by various local features, such as intensities, gradients, optical flow etc Fathi & Mori (2008); Jhuang et al (2007); Filipovych & Ribeiro (2008).…”
Section: Introductionmentioning
confidence: 99%