2016
DOI: 10.1007/978-3-319-46493-0_13
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Dynamic Parsing and Encoding for Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(13 citation statements)
references
References 32 publications
0
13
0
Order By: Relevance
“…Rank Pooling + IDT-FV [15] 66 Algorithm mAP(%) Interaction Part Mining [60] 72.4 Video Darwin [17] 72.0 Hier. Mid-Level Actions [45] 66.8 PCNN + IDT-FV [8] 71.4 GRP [6] 68.4 GRP + IDT-FV [6] 75.5 BRKP 66.3 IBKRP 68.7 IBKRP + IDT-FV 71.8 KRP-FS 70.0 KRP-FS + IDT-FV 76.1 Table 10. MPII Cooking Activities (7 splits) Algorithm Avg.…”
Section: Algorithmmentioning
confidence: 99%
“…Rank Pooling + IDT-FV [15] 66 Algorithm mAP(%) Interaction Part Mining [60] 72.4 Video Darwin [17] 72.0 Hier. Mid-Level Actions [45] 66.8 PCNN + IDT-FV [8] 71.4 GRP [6] 68.4 GRP + IDT-FV [6] 75.5 BRKP 66.3 IBKRP 68.7 IBKRP + IDT-FV 71.8 KRP-FS 70.0 KRP-FS + IDT-FV 76.1 Table 10. MPII Cooking Activities (7 splits) Algorithm Avg.…”
Section: Algorithmmentioning
confidence: 99%
“…Tran et al (2015) treated videos as cubes and performed convolutions and pooling with 3D kernels. Recent methods (Li et al, 2016;Zhu et al, 2016;Wang and Hoai, 2016;Zhang et al, 2016;Su et al, 2016;Wang et al, 2016a) emphasize on action recognition in large scale videos where the background context is also taken into account. Shahroudy et al (2016b) divided the actions into body parts and proposed a multimodal-multipart learning method to represent their dynamics and appearances.…”
Section: Ralated Workmentioning
confidence: 99%
“…On the representation learning front of our contribution, there are a few prior pooling schemes that are similar in the sense that they also use the parameters of an optimization functional as a representation. The most related work is rankpooling and its variants [22,21,20,47,4,11,53] that use a rank-SVM for capturing the video temporal evolution. Similar to ours, Cherian et al [10] propose to use a subspace to represent video sequences.…”
Section: Related Workmentioning
confidence: 99%