2018
DOI: 10.48550/arxiv.1812.05478
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Human Motion Prediction via Spatio-Temporal Inpainting

Abstract: We propose a Generative Adversarial Network (GAN) to forecast 3D human motion given a sequence of past 3D skeleton poses. While recent GANs have shown promising results, they can only forecast plausible motion over relatively short periods of time (few hundred milliseconds) and typically ignore the absolute position of the skeleton w.r.t. the camera. Our scheme provides long term predictions (two seconds or more) for both the body pose and its absolute position. Our approach builds upon three main contribution… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…More related to our work, stochastic human motion prediction methods start to gain popularity with the development of deep generative models. These methods [68,46,6,61,42,71,3] often build upon popular generative models such as conditional generative adversarial networks (CGANs; [21]) or conditional variational autoencoders (CVAEs; [38]). The aforementioned methods differ in the design of their generative models, but at test time they follow the same sampling strategy -randomly and independently sampling trajectories from the pretrained generative model without considering the correlation between samples.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…More related to our work, stochastic human motion prediction methods start to gain popularity with the development of deep generative models. These methods [68,46,6,61,42,71,3] often build upon popular generative models such as conditional generative adversarial networks (CGANs; [21]) or conditional variational autoencoders (CVAEs; [38]). The aforementioned methods differ in the design of their generative models, but at test time they follow the same sampling strategy -randomly and independently sampling trajectories from the pretrained generative model without considering the correlation between samples.…”
Section: Related Workmentioning
confidence: 99%
“…Deep generative models, e.g., variational autoencoders (VAEs) [38], are effective tools to model multi-modal data distributions. Most existing work [68,46,6,61,42,71,3] using deep generative models for human motion prediction is focused on the design of the generative model to allow it to effectively learn the data distribution. After the generative model is learned, little attention has been paid to the sampling method used to produce motion samples (predicted future motions) from the pretrained generative model (weights kept fixed).…”
Section: Introductionmentioning
confidence: 99%
“…It learns a function based on samples of input data and desired outputs and approximates a function between input and output. Researchers have used machine learning successfully for recognition of human motions/behavior [17][18][19][20][21]. In a recent study conducted by Wang et al [22], supervised machine learning is used to learn structures of different three-dimensional (3D) human poses.…”
Section: Introductionmentioning
confidence: 99%