2012
DOI: 10.1007/978-3-642-33712-3_51
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Phrases for Activity Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
71
1
2

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(74 citation statements)
references
References 22 publications
0
71
1
2
Order By: Relevance
“…Laptev et al [19] Original split 91,80% Zhang et al [31] Original split 94,00% Wang et al [24] Original split 94,20% Proposed method…”
Section: Methodsmentioning
confidence: 99%
“…Laptev et al [19] Original split 91,80% Zhang et al [31] Original split 94,00% Wang et al [24] Original split 94,20% Proposed method…”
Section: Methodsmentioning
confidence: 99%
“…In order to compensate for the loss of structures in local representations, a lot of methods try to improve local representations by exploring spatio-temporal structural information [33], including context information of each interest point [34,35], relationships between/among spatio-temporal interest points [36,37,38,39] and neighborhood-based features [40]. The relationship among visual words in the BoW model and their semantic meaning have also be explored to encode higherlevel features [15,41,42,43]. New local descriptors have also be developed [44,45] to improve the performance of local methods.…”
Section: The Bow Modelmentioning
confidence: 99%
“…Aiming to encode rich temporal ordering and spatial geometry information of local visual words, Zhang et al [41] proposed to model the mutual relationships among visual words by a novel concept named the spatio-temporal phrase (ST phrase). A ST phrase is defined as a combination of k words in a certain spatial and temporal structure including their order and relative positions.…”
Section: The Bow Modelmentioning
confidence: 99%
“…Although the advantage of these approaches that use image descriptors is that they do not require skeleton or object tracks to describe the activity observed, they are unable to take into account spatiotemporal relations between the different relevant entities in the scene, which are important elements when learning and recognising human activities [25,17]. To address this issue the concept of a "spatio-temporal phrase" that is defined as a combination of local words in a certain spatial and temporal structure, including their order and relative positions is introduced [26]. This is a very similar approach to the graphs representation described before [12][13][14], however, the spatio-temporal phrase still does not include qualitative spatial relations and also the temporal relations are much fewer than the Allen's Interval Algebra used in the graphs method.…”
Section: Related Workmentioning
confidence: 99%