Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475307
|View full text |Cite
|
Sign up to set email alerts
|

Skeleton-Contrastive 3D Action Representation Learning

Abstract: This paper strives for self-supervised learning of a feature space suitable for skeleton-based action recognition. Our proposal is built upon learning invariances to input skeleton representations and various skeleton augmentations via a noise contrastive estimation. In particular, we propose inter-skeleton contrastive learning, which learns from multiple different input skeleton representations in a cross-contrastive manner. In addition, we contribute several skeleton-specific spatial and temporal augmentatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 77 publications
(39 citation statements)
references
References 46 publications
0
18
0
Order By: Relevance
“…3S means the ensemble results of joint, bone and motion data. The obvious performance improvement compared with the recent advanced unsupervised counterparts [14,33] has been obtained and demonstrates the effectiveness of CPM. In addition, CPM (3S) outperforms the supervised ST-GCN [39] on both NTU and PKU-MMD datasets.…”
Section: Results and Comparisonmentioning
confidence: 73%
See 1 more Smart Citation
“…3S means the ensemble results of joint, bone and motion data. The obvious performance improvement compared with the recent advanced unsupervised counterparts [14,33] has been obtained and demonstrates the effectiveness of CPM. In addition, CPM (3S) outperforms the supervised ST-GCN [39] on both NTU and PKU-MMD datasets.…”
Section: Results and Comparisonmentioning
confidence: 73%
“…The results have shown the proposed CPM performs significantly better than the compared methods. Compared with MS 2 L [15] and ISC [33], CPM improves the performance by a large margin and shows its robustness when fewer labels are available for fine-tuning.…”
Section: Architecturesmentioning
confidence: 99%
“…X-Set(%) X-Sub(%) LongT GAN [43] 39.7 35.6 PCRP [45] 45.1 41.7 AS-CAL [46] 49.2 48.6 CRRL [48] 57.0 56.2 3s-CrossSCLR [47] 66.7 67.9 3s-AimCLR [49] 68.8 68.2 ISC [58] 67.1 67.9 BRL [57] 79.2 77.1 ConGT 80.5 78.6…”
Section: Methodsmentioning
confidence: 99%
“…Yang et al (2021b) design a novel skeleton cloud colorization technique to learn skeleton representations. AS-CAL (Rao et al 2021) and SkeletonCLR (Li et al 2021) use momentum encoder for contrastive learning with single-stream skeleton sequence while CrosSCLR (Li et al 2021) proposes cross-stream knowledge mining strategy to improve the performance and ISC (Thoker, Doughty, and Snoek 2021) proposes inter-skeleton contrastive learning to learn from multiple different input skeleton representations. In order to learn more general features, MS 2 L (Lin et al 2020) introduces multiple self-supervised tasks to learn more general representations.…”
Section: Related Workmentioning
confidence: 99%