2018
DOI: 10.1609/aaai.v32i1.11853
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Representation Learning With Long-Term Dynamics for Skeleton Based Action Recognition

Abstract: In recent years, skeleton based action recognition is becoming an increasingly attractive alternative to existing video-based approaches, beneficial from its robust and comprehensive 3D information. In this paper, we explore an unsupervised representation learning approach for the first time to capture the long-term global motion dynamics in skeleton sequences. We design a conditional skeleton inpainting architecture for learning a fixed-dimensional representation, guided by additional adversarial training str… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 92 publications
(52 citation statements)
references
References 23 publications
(24 reference statements)
0
52
0
Order By: Relevance
“…Linear Evaluation Results on NTU-60. As shown in Table 4, for a single stream (i.e., joint stream), our AimCLR outperforms all other methods (Zheng et al 2018;Lin et al 2020;Rao et al 2021;Su, Liu, and Shlizerman 2020;Nie, Liu, and Liu 2020;Li et al 2021). For the performance of the Table 8: Finetuned results on NTU-60 and NTU-120 dataset. "…”
Section: Comparison With State-of-the-artmentioning
confidence: 88%
See 2 more Smart Citations
“…Linear Evaluation Results on NTU-60. As shown in Table 4, for a single stream (i.e., joint stream), our AimCLR outperforms all other methods (Zheng et al 2018;Lin et al 2020;Rao et al 2021;Su, Liu, and Shlizerman 2020;Nie, Liu, and Liu 2020;Li et al 2021). For the performance of the Table 8: Finetuned results on NTU-60 and NTU-120 dataset. "…”
Section: Comparison With State-of-the-artmentioning
confidence: 88%
“…Self-supervised Skeleton-based Action Recognition. LongT GAN (Zheng et al 2018) proposes to use the encoder-decoder to regenerate the input sequence to obtain useful feature representation. P&C (Su, Liu, and Shlizerman 2020) proposes a training strategy to weaken the decoder, forcing the encoder to learn more discriminative features.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…To classify activities, encoded features were given to the KNN classifier. Zheng et al [33] extracted deep features for classifying actions. They introduced a GAN autoencoder and extracted the dynamic motions from skeleton frames.…”
Section: Human Activity Recognition and Discoverymentioning
confidence: 99%
“…It impels the exploration of learning skeleton-based action representation in an unsupervised manner [15,24,30,14]. Often unsupervised methods use pretext tasks to generate the supervision signals, such as reconstruction [7,44], autoregression [12,30] and jigsaw puzzles [22,36]. Consequently, the learning highly relies on the quality of the designed pretext tasks, and those tasks are hard to be generalized for different downstream tasks.…”
Section: Introductionmentioning
confidence: 99%