2016
DOI: 10.1007/978-3-319-46478-7_48
|View full text |Cite
|
Sign up to set email alerts
|

Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video

Abstract: Current approaches for activity recognition often ignore constraints on computational resources: 1) they rely on extensive feature computation to obtain rich descriptors on all frames, and 2) they assume batch-mode access to the entire test video at once. We propose a new active approach to activity recognition that prioritizes "what to compute when" in order to make timely predictions. The main idea is to learn a policy that dynamically schedules the sequence of features to compute on selected frames of a giv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
3

Relationship

3
6

Authors

Journals

citations
Cited by 32 publications
(18 citation statements)
references
References 47 publications
0
17
0
Order By: Relevance
“…Most existing work focuses on extending 2D convolution to the video domain and modeling motion information in videos [19,29,28,35,24]. Only a few methods consider efficient video classifica-tion [38,31,40,20,10]. However, these approaches perform mean-pooling of scores/features from multiple frames, either uniformly sampled or decided by an agent, to classify a video clip.…”
Section: Related Workmentioning
confidence: 99%
“…Most existing work focuses on extending 2D convolution to the video domain and modeling motion information in videos [19,29,28,35,24]. Only a few methods consider efficient video classifica-tion [38,31,40,20,10]. However, these approaches perform mean-pooling of scores/features from multiple frames, either uniformly sampled or decided by an agent, to classify a video clip.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast, our policy network makes all routing decisions in a single step, resulting in lower overhead cost for the routing itself and thus larger computational savings. Reinforcement learning has also been applied for dynamic feature prioritization in images [26] and video [45,56], actively deciding which frames or image regions to visit next. These techniques could be used in tandem with our approach.…”
Section: Related Workmentioning
confidence: 99%
“…We found many research works addressing the issue of temporal action localization (Shou et al, 2016;Caba Heilbron et al, 2016;Escorcia et al, 2016;Karaman et al, 2014;Buch et al, 2017;Oneata et al, 2014;Gao et al, 2017;Su and Grauman, 2016;Sun et al, 2015;Wang et al, 2014;Yuan et al, 2015;Tran et al, 2015;Singh et al, 2016;Duchenne et al, 2009). A traditional way of performing temporal action detection is to densely apply action classifiers in a sliding window fashion (Duchenne et al, 2009).…”
Section: Temporal Action Detection and Proposalsmentioning
confidence: 99%