2020
DOI: 10.1609/aaai.v34i07.6792
|View full text |Cite
|
Sign up to set email alerts
|

Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation

Abstract: Estimation of 3D human pose from monocular image has gained considerable attention, as a key step to several human-centric applications. However, generalizability of human pose estimation models developed using supervision on large-scale in-studio datasets remains questionable, as these models often perform unsatisfactorily on unseen in-the-wild environments. Though weakly-supervised models have been proposed to address this shortcoming, performance of such models relies on availability of paired supervision o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(9 citation statements)
references
References 16 publications
0
9
0
Order By: Relevance
“…It is observed that our method is comparable with the SOTA methods. In addition, although our method shows improved performance over others that use kinematic constraints (Wandt and Rosenhahn 2019;Cheng et al 2019) because of our GCNs and TCNs, adding kinematic constraints could potentially improve our performance further (Akhter and Black 2015;Kundu et al 2020). Qualitative Results As shown in Figure 3, our full model can better handle occlusions and incorrect detection compared with the baselines and the relative positions among all persons are well captured without camera parameters.…”
Section: Methodsmentioning
confidence: 88%
“…It is observed that our method is comparable with the SOTA methods. In addition, although our method shows improved performance over others that use kinematic constraints (Wandt and Rosenhahn 2019;Cheng et al 2019) because of our GCNs and TCNs, adding kinematic constraints could potentially improve our performance further (Akhter and Black 2015;Kundu et al 2020). Qualitative Results As shown in Figure 3, our full model can better handle occlusions and incorrect detection compared with the baselines and the relative positions among all persons are well captured without camera parameters.…”
Section: Methodsmentioning
confidence: 88%
“…By contrast, unsupervised learning can directly derive the properties of the data from the data itself and then summarize them, thereby enabling researchers to use these properties to make data-driven decisions. Therefore, some unsupervised-learning-based human pose estimation methods have been studied extensively [113][114][115]. Unsupervised learning usually treats pose estimation as a template matching problem that can be learned.…”
Section: Unsupervised Methodsmentioning
confidence: 99%
“…Observing that human body parts have a distinct degree of freedom (DOF) based on the kinematic structure, Wang et al [151] and Nie et al [154] proposed bidirectional networks to model the kinematic and geometric dependencies of the human skeleton. Kundu et al [152] [157] designed a kinematic structure preservation approach by inferring local-kinematic parameters with energy-based loss and explored 2D part segments based on the parent-relative local limb kinematic model. Xu et al [153] demonstrated that noisy 2D joint is one of the key obstacles for accurate 3D pose estimation.…”
Section: A Single-person 3d Hpementioning
confidence: 99%