A Deep Bi-directional Attention Network for Human Motion Recovery

Cui, Qiongjie; Sun, Huaijiang; Li, Yupeng; Kong, Yong

doi:10.24963/ijcai.2019/99

Cited by 22 publications

(23 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They are designed for specific problems and usually cannot solve other tasks without retraining mapping functions even if the problems are similar. For example, training a network for random joint missing samples is not easily re-applied to continuous joint corruption scenarios or to solve motion gaps [10]. Deep learning-based approaches typically generate the trained model from the training database.…”

Section: Preliminaries 21 Signal Inverse Problemmentioning

confidence: 99%

“…Nowadays, Deep Neural Networks (DNNs) have been widely exploited in various tasks related to human motion [3,7,26]. They are committed to training an optimal model from vast mount training samples, and then use this trained model to repair damaged sequences in testing time [10]. Despite remarkable achievements, however, training data can hardly cover all action types due to the non-enumerability of human motion.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Human Dynamics Prior

Cui

Sun

Kong

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Preliminaries 21 Signal Inverse Problemmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep Human Dynamics Prior

Cui

Sun

Kong

et al. 2021

Proceedings of the 29th ACM International Conference on Multimedia

Self Cite

View full text Add to dashboard Cite

show abstract

“…Mall et al [11] trained a set of filters using a deep, bidirectional, recurrent framework for clean, noisy and incomplete mocap data. In 2019, Cui et al [44] proposed a bidirectional attention network for missing data recovery, and their embedded attention mechanism can decide where to borrow information from and use this information to recover corrupted frames. The above deep-learning-based methods are action agnostic but noise specific, i.e., these methods can be trained by largescale data with a specified type of noise (such as Gaussian noise or missing data) and a heterogeneous mix of action types, and the network can refine any action with that noise type.…”

Section: Refinement Neural Networkmentioning

confidence: 99%

A Perceptual-Based Noise-Agnostic 3D Skeleton Motion Data Refinement Network

Zhu

Zheng

et al. 2020

IEEE Access

View full text Add to dashboard Cite

In this paper, we demonstrate a perceptual-based 3D skeleton motion data refinement method based on a bidirectional recurrent autoencoder, called BRA-P. Three main technical contributions are made by the proposed network. First, the proposed BRA-P can address noisy data with different noise types and amplitudes using one network, and this attribute makes the approach more suitable for raw motion data with heterogeneous mixed noise. Second, due to the usage of perceptual loss, which measures the difference in high-level features extracted by a pretrained perceptual autoencoder, BRA-P improves the perceptual similarity between refined motion data and clean motion data, especially for the case where the noisy data and target clean data have different topologies. Third, BRA-P further improves the bone-length consistency and smoothness of the refined motion using the perceptual autoencoder as a postprocessing network. Ablation experiments verify the effect of the three technical contributions of our approach. The results of the experiments on synthetic noise data and raw motion data captured by Kinect demonstrate that our method outperforms several state-of-the-art methods in the cleaning of mixed-noise data by one network.

show abstract

“…Tang et al 17 introduced a RNN‐based motion prediction system by analyzing the observed motion sequences. Cui et al 7 proposed a bidirectional RNN with an attention mechanism to accurately infer the missing joints. In contrast to ours, unsupervised motion retargeting systems have been proposed in References 18,19.…”

Section: Related Workmentioning

confidence: 99%

“…Against this backdrop, based on deep learning frameworks that can efficiently handle large amounts of motion data, 5‐7 here, we propose a novel deep autoencoder that combines the deep convolutional inverse graphics network (DC‐IGN) 8 and U‐Net 9 shown in Figure 1 to efficiently retarget human motion. Our approach handles various types of motions to be retargeted and is sufficiently fast to be used as a real‐time application.…”

Section: Introductionmentioning

confidence: 99%

A variational U‐Net for motion retargeting

Kim

Jang

Kim

2020

Computer Animation & Virtual

View full text Add to dashboard Cite

Motion retargeting is the process of copying motion from one character (source) to another (target) when the source and target body sizes and proportions (of arms, legs, torso, etc.) are different. The problem of automatic motion retargeting has been studied for several decades; however, the motion quality obtained with the application of current approaches is on occasion unrealistic. This is because previous methods, which are mainly based on numerical optimization, generally do not incorporate prior knowledge of the details and nuances of human movements. To address these issues, we present a novel human motion retargeting system using a deep learning framework with large-scale motion data to produce high-quality retargeted human motion. We establish a deep-learning-based motion retargeting system using a variational deep autoencoder combining the deep convolutional inverse graphics network (DC-IGN) and the U-Net. The DC-IGN is utilized for disentangling the motion of each body part, while the U-Net is employed to preserve details of the original motion. We conduct several experiments to validate the proposed motion retargeting system, and find that ours achieves better accuracy along with reduced computational burden when compared with the conventional motion retargeting approach and other neural network architectures.

show abstract

A Deep Bi-directional Attention Network for Human Motion Recovery

Cited by 22 publications

References 9 publications

Deep Human Dynamics Prior

Deep Human Dynamics Prior

A Perceptual-Based Noise-Agnostic 3D Skeleton Motion Data Refinement Network

A variational U‐Net for motion retargeting

Contact Info

Product

Resources

About