2021
DOI: 10.48550/arxiv.2109.05776
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning to Predict Diverse Human Motions from a Single Image via Mixture Density Networks

Abstract: Human motion prediction, which plays a key role in computer vision, generally requires a past motion sequence as input. However, in real applications, a complete and correct past motion sequence can be too expensive to achieve. In this paper, we propose a novel approach to predict future human motions from a much weaker condition, i.e., a single image, with mixture density networks (MDN) modeling. Contrary to most existing deep human motion prediction approaches, the multimodal nature of MDN enables the genera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 49 publications
0
0
0
Order By: Relevance
“…Motion prediction is a regression task, where a series of coordinates that correspond to an agent's future pose or location are predicted using their past pose or location, sometimes in combination with other features like ego video (e.g., Adeli et al, 2021), maps (e.g., Salzmann et al, 2021), head orientation (e.g., Haddad & Lam, 2021), body positioning (e.g., , GPS location, (e.g., Sadeghian et al, 2018b), and/or extracted visual features from cropped images (e.g., Haddad & Lam, 2021) of the agents in the scene. Multimodality in motion prediction is a large area of interest in both trajectory (e.g., Dong et al, 2021;Kosaraju et al, 2019;Gu et al, 2022) and pose (e.g., Fragkiadaki et al, 2017;Gu et al, 2021;Yan et al, 2018) 2020) and Korbmacher and Tordeux (2021).…”
Section: Motion Predictionmentioning
confidence: 99%
“…Motion prediction is a regression task, where a series of coordinates that correspond to an agent's future pose or location are predicted using their past pose or location, sometimes in combination with other features like ego video (e.g., Adeli et al, 2021), maps (e.g., Salzmann et al, 2021), head orientation (e.g., Haddad & Lam, 2021), body positioning (e.g., , GPS location, (e.g., Sadeghian et al, 2018b), and/or extracted visual features from cropped images (e.g., Haddad & Lam, 2021) of the agents in the scene. Multimodality in motion prediction is a large area of interest in both trajectory (e.g., Dong et al, 2021;Kosaraju et al, 2019;Gu et al, 2022) and pose (e.g., Fragkiadaki et al, 2017;Gu et al, 2021;Yan et al, 2018) 2020) and Korbmacher and Tordeux (2021).…”
Section: Motion Predictionmentioning
confidence: 99%