2017
DOI: 10.1109/tpami.2016.2558148
|View full text |Cite
|
Sign up to set email alerts
|

Rank Pooling for Action Recognition

Abstract: We propose a function-based temporal pooling method that captures the latent structure of the video sequence data -e.g. how frame-level features evolve over time in a video. We show how the parameters of a function that has been fit to the video data can serve as a robust new video representation. As a specific example, we learn a pooling function via ranking machines. By learning to rank the frame-level features of a video in chronological order, we obtain a new representation that captures the video-wide tem… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
218
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 278 publications
(219 citation statements)
references
References 67 publications
1
218
0
Order By: Relevance
“…1, the dynamic image reflects different parts of the video; for instance, the rails that appear as a secondary motion contributor are superimposed on top of the horses and the jockeys who are the main actors. Such observations were also made in [7].…”
Section: Constructing Dynamic Imagessupporting
confidence: 64%
See 3 more Smart Citations
“…1, the dynamic image reflects different parts of the video; for instance, the rails that appear as a secondary motion contributor are superimposed on top of the horses and the jockeys who are the main actors. Such observations were also made in [7].…”
Section: Constructing Dynamic Imagessupporting
confidence: 64%
“…For example, the authors of [17,29,29] have shown that local motion patterns in short frame sequences can capture very well the short temporal structures in actions. The rank pooling idea, on which our dynamic images are based, was proposed in [6,7] using hand-crafted representation of the frames, while in [5] authors increase the capacity of rank pooling using a hierarchical approach.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…This dataset is meant for evaluation of HAR algorithms in real life scenarios. Many researchers have evaluated their algorithms on this dataset, the best accuracy achieved so far is 75.2% in [169] using rank pooling and CNN. Some example frames from Hollywood2 dataset are shown in Figure 15.…”
Section: Hollywood2mentioning
confidence: 99%