2016
DOI: 10.1007/s11263-016-0917-2
|View full text |Cite
|
Sign up to set email alerts
|

Hollywood 3D: What are the Best 3D Features for Action Recognition?

Abstract: Action recognition "in the wild" is extremely challenging, particularly when complex 3D actions are projected down to the image plane, losing a great deal of information. The recent growth of 3D data in broadcast content and commercial depth sensors, makes it possible to overcome this. However, there is little work examining the best way to exploit this new modality. In this paper we introduce the Hollywood 3D benchmark, which is the first dataset containing "in the wild" action footage including 3D data. This… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 51 publications
(54 reference statements)
0
8
0
Order By: Relevance
“…This offers the possibility of tapping into millions of high-quality images from an ever-growing library of content. We note that 3D movies have been used in related tasks in isolation [49], [50]. We will show that their full potential is unlocked by combining them with other, complementary data sources.…”
Section: D Moviesmentioning
confidence: 91%
“…This offers the possibility of tapping into millions of high-quality images from an ever-growing library of content. We note that 3D movies have been used in related tasks in isolation [49], [50]. We will show that their full potential is unlocked by combining them with other, complementary data sources.…”
Section: D Moviesmentioning
confidence: 91%
“…2D TSD 78% 3D TSD 74% 2D TSD+3D TSD 85% [20] 20.8% [24] 21.8% signed to cope specifically with random camera motions and/or rotations, as they can degrade the trajectory extraction drastically. As it can be seen, our method still yields superior results compared to the trajectory aligned descriptors proposed in [20] and reported in [24]. Our method also outperforms the method proposed by [24] in terms of accuracy.…”
Section: Methods Accuracymentioning
confidence: 99%
“…Hadfield et al [24] used 3D Hollywood movies to create a challenging stereo dataset for human activity recognition. The authors estimated the calibration information using RANSAC method and repeating the process 100 times, before selecting the best estimation.…”
Section: Background Workmentioning
confidence: 99%
“…The DiDeMo dataset [6] has been introduced for temporal localization given natural language, but has also been used for the purpose of textto-clip video retrieval [317]. Recently, the Hollywood 3D dataset was proposed [93] which contains 650 stereo clips with 14 action classes, together with stereo calibration and depth reconstruction.…”
Section: Other Datasetsmentioning
confidence: 99%