2020 IEEE Winter Conference on Applications of Computer Vision (WACV) 2020
DOI: 10.1109/wacv45572.2020.9093639
|View full text |Cite
|
Sign up to set email alerts
|

Self-Attention Network for Skeleton-based Human Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
32
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(40 citation statements)
references
References 25 publications
0
32
0
Order By: Relevance
“…Self-Attention Network. There are 03 essential forms of the self-attention-network (SAN) these are SAN-Version 1 (V1), SAN-Version 2 (V2) & SAN-Version 3 (V3) [48] . The importance of the investigations are-03 SAN version, which is used for important correlations to form deep semantics, and accumulated Temporal Segment Network (TSN) thru SAN deviations by which performance can be measured.…”
Section: Motif-based Spatial Temporal Graph Convolutional Network (Motif-stgcn)mentioning
confidence: 99%
See 1 more Smart Citation
“…Self-Attention Network. There are 03 essential forms of the self-attention-network (SAN) these are SAN-Version 1 (V1), SAN-Version 2 (V2) & SAN-Version 3 (V3) [48] . The importance of the investigations are-03 SAN version, which is used for important correlations to form deep semantics, and accumulated Temporal Segment Network (TSN) thru SAN deviations by which performance can be measured.…”
Section: Motif-based Spatial Temporal Graph Convolutional Network (Motif-stgcn)mentioning
confidence: 99%
“…Figure 3. Class-score-fusion structure [48] Beyond Joints. The authors [49] worked on 03 primary factors such as joints, edges and surfaces.…”
Section: Motif-based Spatial Temporal Graph Convolutional Network (Motif-stgcn)mentioning
confidence: 99%
“…Human actions are composed of contemporary behaviors of human body parts. The objective of human action recognition is to recognize actions automatically from an unlabeled video [ 4 , 5 ]. To capture human actions, there are two broad categories of devices based on wearable sensors and video sensors.…”
Section: Introductionmentioning
confidence: 99%
“…Neural Network (RNN). Most existing CNN-based works [4,5,6,7] encode the skeleton sequence as an image with the image fed into a CNN-based model to extract features.…”
Section: Introductionmentioning
confidence: 99%