2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00229
|View full text |Cite
|
Sign up to set email alerts
|

Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes: The Importance of Multiple Scene Constraints

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
198
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 295 publications
(205 citation statements)
references
References 29 publications
0
198
0
Order By: Relevance
“…In [20], a single image and corresponding landmarks are used to lookup a similar human pose using a kd-tree, containing about 4 million examples. A method intended for multi-instance model fitting from a single image is described in [21].…”
Section: Closely Related Workmentioning
confidence: 99%
“…In [20], a single image and corresponding landmarks are used to lookup a similar human pose using a kd-tree, containing about 4 million examples. A method intended for multi-instance model fitting from a single image is described in [21].…”
Section: Closely Related Workmentioning
confidence: 99%
“…Our method, on the other hand, avoids the use of such datasets and relies on 3D data only for the single person case. Further, Zanfir et al [37] proposed a largescale human sensing system for multiple people that estimates pose and shape using the top-down approach of person detection followed by pose estimation for each person. Recently, Zanfir et al [38] proposed MubyNet, a bottom-up approach that performs joint association by formulating it as a binary integer programming problem.…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, the run-time of this approach is likely to increase linearly with the number of people in the scene, making it inefficient for analysis in crowded scenes. Additionally, most existing multi-person pose estimation methods [27,20,28], with the exception of [37] estimate 3D pose configuration only relative to the root joint. However, relative spatial ordering of different people in the scene is also needed to facilitate reasoning about human interactions and provide a better understanding of the scene.…”
Section: Introductionmentioning
confidence: 99%
“…In [10], the authors predict a full body mesh using the skinned multi-person linear (SMPL) model [25]. First they predict an initial pose with the DMHS detector [26] and refine the prediction using multiple constraints, including reprojection error, a semantic loss involving body part segmentation and matching to ground plane.…”
Section: Https://githubcom/vegesm/depthposementioning
confidence: 99%
“…For example, detecting hand-shakes, object manipulation and passing all require more information than the root-relative pose. To our knowledge, the only solution for absolute pose estimation is finding an optimal translation vector that minimizes the reprojection error [9], [10]. The search for the optimal translation is performed as a This work was completed in the ELTE Institutional Excellence Program (1783-3/2018/FEKUTSRAT) supported by the Hungarian Ministry of Human Capacities.…”
Section: Introductionmentioning
confidence: 99%