Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition

Heidari, Negar; Iosifidis, Alexandros

doi:10.1109/icassp39728.2021.9413860

Cited by 7 publications

(4 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…GCN-based models for skeleton-based action recognition [15,16,18,22,23,27,28] operate on sequences of skeleton graphs. The spatio-temporal graph of skeletons G = (V, E) has the human body joint coordinates as nodes V and the spatial and temporal connections between them as edges E. Figure 2 (right) illustrates such a spatio-temporal graph where the spatial graph edges encode the human bones and the temporal edges connect the same joints in subsequent time-steps.…”

Section: A Spatio-temporal Graph Convolutional Networkmentioning

confidence: 99%

“…Unfortunately, the high computational complexity of these GCN-based methods makes them infeasible in real-time applications and resource-constrained online inference settings. Multiple approaches have been explored to increase the efficiency of skeleton-based action recognition recently: GCN-NAS [22] and PST-GCN [23] are neural architecture search based methods which try to find an optimized ST-GCN architecture to increase the efficiency of the classification task; ShiftGCN [24] replaces graph and temporal convolutions with a zero-FLOPs shift graph operation and pointwise convolutions as an efficient alternative to the featurepropagation rule for GCNs [25]; ShiftGCN++ [26] boost the efficiency of ShiftGCN further via progressive architecture search, knowledge-distillation, explicit spatial positional encodings, and a Dynamic Shift Graph Convolution; SGN [27] utilizes semantic information such as joint type and frame index as side information to design a compact semanticsguided neural network (SGN) for capturing both spatial and temporal correlations in joint and frame level; TA-GCN [28] tries to make inference more efficient by selecting a subset of key skeletons, which hold the most important features for action recognition, from a sequence to be processed by the spatio-temporal convolutions.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Hedegaard¹,

Heidari²,

Iosifidis³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Graph-based reasoning over skeleton data has emerged as a promising approach for human action recognition. However, the application of prior graph-based methods, which predominantly employ whole temporal sequences as their input, to the setting of online inference entails considerable computational redundancy. In this paper, we tackle this issue by reformulating the Spatio-Temporal Graph Convolutional Neural Network as a Continual Inference Network, which can perform step-by-step predictions in time without repeat frame processing. To evaluate our method, we create a continual version of ST-GCN, CoST-GCN, alongside two derived methods with different self-attention mechanisms, CoAGCN and CoS-TR. We investigate weight transfer strategies and architectural modifications for inference acceleration, and perform experiments on the NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400 datasets. Retaining similar predictive accuracy, we observe up to 109× reduction in time complexity, on-hardware accelerations of 26×, and reductions in maximum allocated memory of 52% during online inference.

show abstract

Section: A Spatio-temporal Graph Convolutional Networkmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Hedegaard¹,

Heidari²,

Iosifidis³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Methods utilizing GCNs obviously need the movement represented as a graph. Popular encodings are spatiotemporal graphs [ 150 , 151 , 152 ]. Usually, the graph structure is a description of the skeleton structure, where each node represents a joint, and the edges indicate that two joints are connected by a limb.…”

Section: Machine Learning Algorithms For Human Motion Analysismentioning

confidence: 99%

Using Artificial Intelligence for Assistance Systems to Bring Motor Learning Principles into Real World Motor Tasks

Vandevoorde

Vollenkemper

Schwan

et al. 2022

Sensors

View full text Add to dashboard Cite

Humans learn movements naturally, but it takes a lot of time and training to achieve expert performance in motor skills. In this review, we show how modern technologies can support people in learning new motor skills. First, we introduce important concepts in motor control, motor learning and motor skill learning. We also give an overview about the rapid expansion of machine learning algorithms and sensor technologies for human motion analysis. The integration between motor learning principles, machine learning algorithms and recent sensor technologies has the potential to develop AI-guided assistance systems for motor skill training. We give our perspective on this integration of different fields to transition from motor learning research in laboratory settings to real world environments and real world motor tasks and propose a stepwise approach to facilitate this transition.

show abstract

“…There have been different approaches to reduce computational complexity when training deep neural networks, such as designing novel low-complexity network architectures (Kiranyaz et al, 2017;Tran et al, 2019c;Tran & Iosifidis, 2019;Tran et al, 2020;Kiranyaz et al, 2020;Heidari & Iosifidis, 2020), replacing existing ones with their low-rank counterparts (Denton et al, 2014;Jaderberg et al, 2014;Tran et al, 2018;Huang & Yu, 2018;Ruan et al, 2020), or adapting the pre-trained models to new tasks, i.e., performing Transfer Learning (TL) (Shao et al, 2014;Yang et al, 2015;Ding et al, 2016;Ding & Fu, 2018;Fons et al, 2020) or Domain Adaptation (DA) learning (Duan et al, 2012;Wang et al, 2019;Zhao et al, 2020;Hedegaard et al, 2021). Among these approaches, model adaptation is the most versatile since a method in this category is often architecture-agnostic, being complementary to other approaches.…”

Section: Introductionmentioning

confidence: 99%

Augmented Bilinear Network for Incremental Multi-Stock Time-Series Classification

Shabani¹,

Tran²,

Kanniainen³

et al. 2022

Preprint

View full text Add to dashboard Cite

Deep Learning models have become dominant in tackling financial time-series analysis problems, overturning conventional machine learning and statistical methods. Most often, a model trained for one market or security cannot be directly applied to another market or security due to differences inherent in the market conditions. In addition, as the market evolves through time, it is necessary to update the existing models or train new ones when new data is made available. This scenario, which is inherent in most financial forecasting applications, naturally raises the following research question: How to efficiently adapt a pre-trained model to a new set of data while retaining performance on the old data, especially when the old data is not accessible? In this paper, we propose a method to efficiently retain the knowledge available in a neural network pretrained on a set of securities and adapt it to achieve high performance in new ones. In our method, the prior knowledge encoded in a pre-trained neural network is maintained by keeping existing connections fixed, and this knowledge is adjusted for the new securities by a set of augmented connections, which are optimized using the new data. The auxiliary connections are constrained to be of low rank. This not only allows us to rapidly optimize for the new task but also reduces the storage and run-time complexity during the deployment phase. The efficiency of our approach is empirically validated in the stock midprice movement prediction problem using a large-scale limit order book dataset. Experimental results show that our approach enhances prediction performance as well as reduces the overall number of network parameters.

show abstract

Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition

Cited by 7 publications

References 16 publications

Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Online Skeleton-based Action Recognition with Continual Spatio-Temporal Graph Convolutional Networks

Using Artificial Intelligence for Assistance Systems to Bring Motor Learning Principles into Real World Motor Tasks

Augmented Bilinear Network for Incremental Multi-Stock Time-Series Classification

Contact Info

Product

Resources

About