2019 IEEE Winter Conference on Applications of Computer Vision (WACV) 2019
DOI: 10.1109/wacv.2019.00160
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Feature Learning of Human Actions As Trajectories in Pose Embedding Manifold

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(19 citation statements)
references
References 17 publications
0
19
0
Order By: Relevance
“…Considering the Se-BiReNet and all MLP layers used in our learning architecture, the learnable parameters in our method is about 0.27 million. As some details missed in several works, we can only estimate the lowest number of parameters in those methods, such as EnGAN-PoseRNN [8] and AGC-LSTM [20]. It can be seen that our method achieves a competitive result with the least parameters, which also shows the efficiency of our method from another perspective.…”
Section: Unsupervisedmentioning
confidence: 91%
See 1 more Smart Citation
“…Considering the Se-BiReNet and all MLP layers used in our learning architecture, the learnable parameters in our method is about 0.27 million. As some details missed in several works, we can only estimate the lowest number of parameters in those methods, such as EnGAN-PoseRNN [8] and AGC-LSTM [20]. It can be seen that our method achieves a competitive result with the least parameters, which also shows the efficiency of our method from another perspective.…”
Section: Unsupervisedmentioning
confidence: 91%
“…LongT GAN [35] 48.1* 40.18M EnGAN-PoseRNN [8] 77.8 >0.7M Ours (1-layer LSTM) 79.71 0.27M RGB+D dataset. Among those unsupervised methods on N-UCLA dataset, our method achieves the best performance with an increment of 18% compared to the work of [9].…”
Section: Unsupervisedmentioning
confidence: 99%
“…al. [25] learn the action sequence as a trajectory in the pose manifold for the downstream activity classification task. Caetano et al [26] use CNN-based feature representation over a temporal window containing skeleton dynamics.…”
Section: Related Workmentioning
confidence: 99%
“…It impels the exploration of learning skeleton-based action representation in an unsupervised manner [15,24,30,14]. Often unsupervised methods use pretext tasks to generate the supervision signals, such as reconstruction [7,44], autoregression [12,30] and jigsaw puzzles [22,36]. Consequently, the learning highly relies on the quality of the designed pretext tasks, and those tasks are hard to be generalized for different downstream tasks.…”
Section: Introductionmentioning
confidence: 99%