2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.161
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive RNN Tree for Large-Scale Human Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 102 publications
(59 citation statements)
references
References 24 publications
0
59
0
Order By: Relevance
“…For skeleton-based action recognition, the study of viewpoint influences is under-explored. To be view-invariant, the commonly used strategies are to perform pre-processing of skeletons [6], [17], [25], [27], [29], [38], [42], [45], [55], [62]. Unfortunately, frame-level pre-processing, where each frame is transformed to the body center with the upperbody orientation aligned, usually results in the partial loss of relative motion information.…”
Section: Viewpoints In Skeleton-based Action Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…For skeleton-based action recognition, the study of viewpoint influences is under-explored. To be view-invariant, the commonly used strategies are to perform pre-processing of skeletons [6], [17], [25], [27], [29], [38], [42], [45], [55], [62]. Unfortunately, frame-level pre-processing, where each frame is transformed to the body center with the upperbody orientation aligned, usually results in the partial loss of relative motion information.…”
Section: Viewpoints In Skeleton-based Action Recognitionmentioning
confidence: 99%
“…Similarly, Liu et al [29] use both global contextual information and local information to selectively focus on informative joints. Li et al [25] combine tree-like hierarchy RNNs with action category hierarchy to distinguish easy-tell actions in the low levels of networks and hard-tell actions in the high levels of networks. Different from the above works, we enhance the recognition performance from a new perspective.…”
Section: Rnn For Skeleton-based Action Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…The created architecture is combined with a 3-dimensional ConvNet by using a two-stream fusion of the RNN and ConvNet, with an SVM. The use of multiple recurrent networks has also been scaled to include tree structures (RNN-T) [76], to perform a hierarchical recognition process in which each RNN is responsible for learning an action instance based on an Action Category Hierarchy (ACH). This allows for the distinction between very dissimilar classes high in the hierarchy, while subtle differences between related classes such as a handshake and a fist bump are dealt with in the lower nodes.…”
Section: Recurrent Networkmentioning
confidence: 99%
“…Liu et al propose a spatio-temporal LSTM structure to explore the contextual dependency of joints in spatio-temporal domains [36]. Li et al propose an RNN tree network with a hierarchical structure which classifies the action classes that are easier to distinguish at the lower layers and the action classes that are harder to distinguish at higher layers [37]. To address the large view variation of the captured data, Zhang et al propose a view adaptive subnetwork which automatically selects the best observation viewpoints within an end-to-end network for recognition [38].…”
Section: Action Recognitionmentioning
confidence: 99%