2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.51
|View full text |Cite
|
Sign up to set email alerts
|

Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach

Abstract: In this paper, we study the task of 3D human pose estimation in the wild. This task is challenging due to lack of training data, as existing datasets are either in the wild images with 2D pose or in the lab images with 3D pose.We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure. Our network augments a state-of-the-art 2D pose estimation sub-network with a 3D depth regression sub-network. Unlike pre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
578
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 577 publications
(592 citation statements)
references
References 39 publications
5
578
0
Order By: Relevance
“…To leverage such cross-domain shared knowledge, domain adaptation [49,15] has been widely studied on different tasks, such as detection [7,26], classification [19,21,17,16], segmentation [59,54,16] and pose estimation [57,46]. But in previous works about keypoint detection or pose estimation [9,57,53,56], source domain and target domain face much slighter domain shift than when transferring from human dataset to animals or among different animal species. Besides, some extra information might be available for easier knowledge transfer, such as view consistency [57], attribute attached to samples [17] or morphological similarity [53,56].…”
Section: Related Workmentioning
confidence: 99%
“…To leverage such cross-domain shared knowledge, domain adaptation [49,15] has been widely studied on different tasks, such as detection [7,26], classification [19,21,17,16], segmentation [59,54,16] and pose estimation [57,46]. But in previous works about keypoint detection or pose estimation [9,57,53,56], source domain and target domain face much slighter domain shift than when transferring from human dataset to animals or among different animal species. Besides, some extra information might be available for easier knowledge transfer, such as view consistency [57], attribute attached to samples [17] or morphological similarity [53,56].…”
Section: Related Workmentioning
confidence: 99%
“…We compare our method with both task-oriented 3D pose state of the art [50,62,34,36,37,9,48,60,16] and four parametric body model based estimators [26,22,42,39]. We set up two baselines to validate the effectiveness of two key components in the proposed framework: renderand-compare and joint learning with synthetic data.…”
Section: Pose Estimationmentioning
confidence: 99%
“…Weak supervision provided by 2D annotations is typical, with different works employing 2D keypoints, silhouettes and semantic parts [18,31,35,46]. Non-parametric approaches typically use extra supervision from 2D keypoint annotation [12,45,55], while some recent works leverage ordinal depth relations of the joints [33,41]. Multi-view consistency is also well explored as discussed earlier [21,34,38,39,44].…”
Section: Related Workmentioning
confidence: 99%