2022
DOI: 10.48550/arxiv.2208.08599
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Spatial Temporal Graph Attention Network for Skeleton-Based Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 0 publications
0
5
0
Order By: Relevance
“…Recognizing that these simplistic blocks cannot adequately capture the data representation of a skeletal sequence, AutoGCN embraces the integration of prior knowledge. This algorithm aims to exploit the full potential of skeletal sequence information by defining the search space using architectural components known to be suitable for skeletal data [7], [9], [16], [21], [24], [34]. This departure from traditional methodologies acknowledges the need for a more nuanced and informed exploration of possible architectures in this domain [12], [15].…”
Section: Search Spacementioning
confidence: 99%
“…Recognizing that these simplistic blocks cannot adequately capture the data representation of a skeletal sequence, AutoGCN embraces the integration of prior knowledge. This algorithm aims to exploit the full potential of skeletal sequence information by defining the search space using architectural components known to be suitable for skeletal data [7], [9], [16], [21], [24], [34]. This departure from traditional methodologies acknowledges the need for a more nuanced and informed exploration of possible architectures in this domain [12], [15].…”
Section: Search Spacementioning
confidence: 99%
“…The STGAT is proposed by Hu et al [77] to capture short-term dependencies of spatialtemporal modeling. STGAT uses the three simple modules to reduce local spatial-temporal feature redundancy and further release the potential.…”
Section: Gcn-basedmentioning
confidence: 99%
“…Therefore, it is very popular to use pose estimation algorithms on RGB videos. Because of the simple but precise expression of motion information of skeletons and its stability to kinds of clothing textures and noisy backgrounds, there have been many methods trying to use skeleton data to solve action recognition problems [4,20,21]. Among them, the graph convolutional network (GCN) method is the most popular to deal with the graph structure of the skeletons, which treats joints as nodes and limbs as edges.…”
Section: Skeleton-based Action Recognitionmentioning
confidence: 99%
“…I3D [39] RGB 63.2 71.1 R(2+1)D-TwoStream [40] RGB, Flow -75.4 TRN [41] RGB 68.7 -TSM [42] RGB 70.6 -TSM Two-stream [42] RGB, Flow 81.2 -RSANet-R50 [38] RGB 86.4 -MARS [43] RGB,Flow -74.9 TP-ViT [11] RGB, Skeleton -80.8 STGAT [20] Skeleton…”
Section: Modality Finegym99 Kinetics400mentioning
confidence: 99%