Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Hedegaard, Lukas; Heidari, Negar; Iosifidis, Alexandros

doi:10.48550/arxiv.2203.11009

Cited by 2 publications

(5 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Through a reformulation of the 3D convolution to compute inputs step-by-step rather than spatio-temporally, well-performing 3D CNNs such as X3D (Feichtenhofer, 2020), Slow (Feichtenhofer et al, 2019, and I3D (Carreira & Zisserman, 2017) trained for Trimmed Activity Recognition were re-implemented to execute step-by-step without any re-training. Likewise, Spatio-temporal Graph Convolutional Networks for Skeleton-based Action Recognition (Yan et al, 2018;Shi et al, 2019;Plizzari et al, 2021), which originally operated only on batches, were recently transformed to perform step-wise inference as well though a continual formulation of their Spatiotemporal Graph Convolution blocks (Hedegaard et al, 2022b).…”

Section: Continual Inference Networkmentioning

confidence: 99%

“…In general, non-continual networks, which are transformed to continual ones attain reductions in per-step computational complexity in proportion to the temporal receptive field of the network. In some cases, these savings can amount to multiple orders of magnitude (Hedegaard et al, 2022b). Still, the implementation of Continual Inference Networks with temporal convolutions and Multi-head Attention in frameworks such as PyTorch (Paszke et al, 2019) requires deep knowledge and practical experience with CINs.…”

Section: Continual Inference Networkmentioning

confidence: 99%

“…accuracy was improved by increasing model receptive fields through expansions of temporal global average pooling to 64 steps, and in Hedegaard et al (2022b), the stride of temporal convolutions was reduced to one to increase prediction rates. Sec.…”

Section: Residualmentioning

confidence: 99%

“…The noted metrics were originally presented inHedegaard & Iosifidis (2021);Hedegaard et al (2022a,b). and CoS-TR for Skeleton-based Action Recognition inHedegaard et al (2022b). While direct conversion from regular to continual versions of the above noted architectures works well in accelerating inference in itself, further improvements can be achieved by exploiting some core characteristics of CINs: in Hedegaard & Iosifidis (2021),…”

mentioning

confidence: 99%

See 3 more Smart Citations

Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch

Hedegaard¹,

Iosifidis²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

We present Continual Inference, a Python library for implementing Continual Inference Networks (CINs) in PyTorch, a class of Neural Networks designed specifically for efficient inference in both online and batch processing scenarios. We offer a comprehensive introduction and guide to CINs and their implementation in practice, and provide best-practices and code examples for composing complex modules for modern Deep Learning. Continual Inference is readily downloadable via the Python Package Index and at www.github.com/lukashedegaard/continual-inference.

show abstract

Section: Continual Inference Networkmentioning

confidence: 99%

Section: Continual Inference Networkmentioning

confidence: 99%

Section: Residualmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch

Hedegaard¹,

Iosifidis²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Heidari and Iosifidis 20 proposed the TA-GCN model to select the key bones most conducive to activity recognition to perform spatiotemporal convolution operations on the skeleton sequence. In order to realize online human behavior recognition, Hedegaard and Heidari et al 21 proposed Continual Spatio-Temporal Graph Convolutional Network (CoST-GCN), which reorganized the spatio-temporal graph convolutional neural network as a continuous inference network, which was processed without frame repetition. In this case, the step-by-step prediction function on the time axis is implemented.…”

Section: Introductionmentioning

confidence: 99%

Skeleton based human action recognition with relative position encoding

Xuan

Wang

et al. 2022

Third International Conference on Computer Science and Communication Technology (ICCSCT 2022)

View full text Add to dashboard Cite

Aiming at the fact that the current graph convolution operation based on skeleton graph is limited in local adjacent nodes, or the overall relative position information of skeleton is omitted, an enhancement method of joint point position information based on relative position encoding of skeleton is proposed. The proposed method takes the central joint point of the human trunk as the root node, and all joint nodes form a tree structure according to the natural connection of the body, and the code of each joint node inherits the code of its parent node and also includes its own number in the sibling node. In addition, considering that the number of channels in the graph-based convolutional network model is generally larger, and the channel information itself has a strong correlation, the channel information frequency division and recombination operation is proposed to reflect the difference of information in different frequency bands in the channel. Experiments show that the proposed method has a certain effect on improving the effect of the embedded model.

show abstract

Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Cited by 2 publications

References 26 publications

Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch

Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch

Skeleton based human action recognition with relative position encoding

Contact Info

Product

Resources

About