Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition

Nan, Mihai; Trăşcău, Mihai; Florea, Adina Magda; Iacob, Cezar Cătălin

doi:10.3390/s21062051

Cited by 28 publications

(23 citation statements)

References 48 publications

(61 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Noise usually accompanies images during acquisition or transmission, resulting in contrast reduction, color shift, and poor visual quality. The interference of noise not only contaminates the naturalness of an image, but also damages the precision of various computer vision-based applications, such as semantic segmentation [ 1 , 2 ], motion tracking [ 3 , 4 ], action recognition [ 5 , 6 ], and object detection [ 7 , 8 , 9 , 10 , 11 , 12 ], to name a few. Consequently, noise removal for these applications has attracted great interest as a preprocessing task over the last two decades.…”

Section: Introductionmentioning

confidence: 99%

An Advanced Noise Reduction and Edge Enhancement Algorithm

Huang

Hoang

et al. 2021

Sensors

View full text Add to dashboard Cite

Complementary metal-oxide-semiconductor (CMOS) image sensors can cause noise in images collected or transmitted in unfavorable environments, especially low-illumination scenarios. Numerous approaches have been developed to solve the problem of image noise removal. However, producing natural and high-quality denoised images remains a crucial challenge. To meet this challenge, we introduce a novel approach for image denoising with the following three main contributions. First, we devise a deep image prior-based module that can produce a noise-reduced image as well as a contrast-enhanced denoised one from a noisy input image. Second, the produced images are passed through a proposed image fusion (IF) module based on Laplacian pyramid decomposition to combine them and prevent noise amplification and color shift. Finally, we introduce a progressive refinement (PR) module, which adopts the summed-area tables to take advantage of spatially correlated information for edge and image quality enhancement. Qualitative and quantitative evaluations demonstrate the efficiency, superiority, and robustness of our proposed method.

show abstract

Section: Introductionmentioning

confidence: 99%

An Advanced Noise Reduction and Edge Enhancement Algorithm

Huang

Hoang

et al. 2021

Sensors

View full text Add to dashboard Cite

show abstract

“…A spatial-temporal two-stream transformer network [ 32 ] is proposed to model dependencies between joints using the Transformer self-attention operator. Additionally, some work [ 34 ] has been done to explore and compare different ways of extracting human pose features, and to extend a TCN-like unit to extract the most relevant spatial and temporal characteristics for a sequence of frames.…”

Section: Related Workmentioning

confidence: 99%

Skeleton-Based Spatio-Temporal U-Network for 3D Human Pose Estimation in Video

Chen

2022

Sensors

View full text Add to dashboard Cite

Despite the great progress in 3D pose estimation from videos, there is still a lack of effective means to extract spatio-temporal features of different granularity from complex dynamic skeleton sequences. To tackle this problem, we propose a novel, skeleton-based spatio-temporal U-Net(STUNet) scheme to deal with spatio-temporal features in multiple scales for 3D human pose estimation in video. The proposed STUNet architecture consists of a cascade structure of semantic graph convolution layers and structural temporal dilated convolution layers, progressively extracting and fusing the spatio-temporal semantic features from fine-grained to coarse-grained. This U-shaped network achieves scale compression and feature squeezing by downscaling and upscaling, while abstracting multi-resolution spatio-temporal dependencies through skip connections. Experiments demonstrate that our model effectively captures comprehensive spatio-temporal features in multiple scales and achieves substantial improvements over mainstream methods on real-world datasets.

show abstract

“…TCN accounts for the caveats of sequence models, compare RNN e.g., LSTM or Gated-Recurrent-Unit Network (GRU) when learning very long sequences [36]. Advantages are the mitigation of the vanishing/exploding gradient problem when back-propagating through time as often encountered with LSTM; reduction of memory usage, training and inference time over traditional RNN architectures [37]; compared to LSTM, TCN also requires less trainable parameters to store intermediate results [35]. To elaborate, 1D-convolution adopted in TCN shares the learned filters across the entire input feature map of length l per input channel c. This can be attributed to the parallelism of the convolution operation.…”

Section: A Principles Of Temporal Convolutional Networkmentioning

confidence: 99%

“…TCN, initially presented by [35] addresses above shortcomings. TCN performs dilated, causal convolution -transforming CNN to highly efficient, auto-regressive models as evidenced by [35]- [37]. Unlike e.g., LSTM, TCN is able to be trained on input sequences, irrespective of the length as the number of trainable parameters per layer only depends on the number of input features, filters and the kernel-size.…”

Section: Introductionmentioning

confidence: 99%

Prognostics for Electromagnetic Relays Using Deep Learning

et al. 2022

View full text Add to dashboard Cite

Electromagnetic Relays (Electromagnetic Relay (EMR)s) are omnipresent in electrical systems, ranging from mass-produced consumer products to highly specialised, safety-critical industrial systems. Our detailed literature review focused on EMR reliability highlighting the methods used to estimate the State of Health or the Remaining Useful Life emphasises the limited analysis and understanding of expressive EMR degradation indicators, as well as accessibility and use of EMR life cycle data sets. Prioritising these open challenges, a deep learning pipeline is presented in a prognostic context termed Electromagnetic Relay Useful Actuation Pipeline (EMRUA). Leveraging the attributes of causal convolution, a Temporal Convolutional Network (TCN) based architecture integrates an arbitrary long sequence of multiple features to produce a remaining useful switching actuations forecast. These features are extracted from raw, high volume life cycle data sets, namely EMR switching data (Contact-Voltage, Contact-Current). Monte-Carlo Dropout is utilised to estimate uncertainty during inference. The TCN hyperparameter space, as well as various methods to select and analyse long sequences of multivariate time series data are investigated. Subsequently, our results demonstrate improvements using the developed statistical feature-set over traditional, time-based features, commonly found in literature. EMRUA achieves an average forecasting mean absolute percentage error of ±12 % over the course of the entire EMR life. INDEX TERMSElectromagnetic relay, prognostics, prognostics and health management, predictive maintenance, remaining useful life, artificial intelligence, deep learning, temporal convolutional networks, Monte-Carlo dropout. ABBREVIATIONS AT Arcing time. BT Bounce time. CAE Convolutional auto encoder. CC Coil current. CI Contact current. CNN Convolutional neural network. CR Contact resistance. CT Closing time. CV Contact voltage. DCR Dynamic contact resistance. DI Degradation indicator. EI Exponential indexing. The associate editor coordinating the review of this manuscript and approving it for publication was Sajid Ali . EMR Electromagnetic relay. EMRUA Electromagnetic relay useful actuation pipeline. EOL End of life. FC Fully connected layer. GI Growing-sequence indexing. LI Linear indexing. LSTM Long-short-term-memory network. MAE Mean absolute error. MAPE Mean absolute percentage error. MCD Monte-carlo dropout. MVTD Multivariate time series data. NN Neural network. OT Over-travel time.

show abstract

Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition

Cited by 28 publications

References 48 publications

An Advanced Noise Reduction and Edge Enhancement Algorithm

An Advanced Noise Reduction and Edge Enhancement Algorithm

Skeleton-Based Spatio-Temporal U-Network for 3D Human Pose Estimation in Video

Prognostics for Electromagnetic Relays Using Deep Learning

Contact Info

Product

Resources

About