2012
DOI: 10.1007/978-3-642-33863-2_30
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-temporal SIFT and Its Application to Human Action Classification

Abstract: Abstract. This paper presents a space-time extension of scale-invariant feature transform (SIFT) originally applied to the 2-dimensional (2D) volumetric images. Most of the previous extensions dealt with 3-dimensional (3D) spacial information using a combination of a 2D detector and a 3D descriptor for applications such as medical image analysis. In this work we build a spatio-temporal difference-of-Gaussian (DoG) pyramid to detect the local extrema, aiming at processing video streams. Interest points are extr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 18 publications
(33 reference statements)
0
14
0
Order By: Relevance
“…From a computational cost point of view, SGSH Method on KTH Accuracy (%) Al Ghamdi et al [27] 90.7 Liu et al [28] 91.3 Iosifidis et al [29] 92.1 Baumann et al [30] 92.1 Kläser [31] 92.6 Ji et al [32] 93.1 Wang et al [33] 94.2 Wu et al [34] 94.5 Raptis and Soatto [35] 94.8 Zhang et al [5] 94.8 Wang et al [21] 95.0 Yuan et al [4] 95.4 SGSH 97.2 Table 3 Recognition accuracy comparisons on the UCF-Sports dataset…”
Section: Running Timementioning
confidence: 99%
“…From a computational cost point of view, SGSH Method on KTH Accuracy (%) Al Ghamdi et al [27] 90.7 Liu et al [28] 91.3 Iosifidis et al [29] 92.1 Baumann et al [30] 92.1 Kläser [31] 92.6 Ji et al [32] 93.1 Wang et al [33] 94.2 Wu et al [34] 94.5 Raptis and Soatto [35] 94.8 Zhang et al [5] 94.8 Wang et al [21] 95.0 Yuan et al [4] 95.4 SGSH 97.2 Table 3 Recognition accuracy comparisons on the UCF-Sports dataset…”
Section: Running Timementioning
confidence: 99%
“…Another popular 3D feature description methodology is based on regular polyhedrons [1,7,11,13,25]. This technique approximates the orientation space by a regular polyhedron with congruent faces that are regular polygons, each of which serves as a bin.…”
Section: Description Of 3d Featuresmentioning
confidence: 99%
“…• 3D spatio-temporal features computed in xyt spatiotemporal space using a temporal sequence of images, including 3D SIFT [18,23], ST-SIFT [1], HOG3D [13], CHOG3D [11], 3D optical flow [10], etc.…”
Section: D Features For Action Recognitionmentioning
confidence: 99%
See 2 more Smart Citations