Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00339
|View full text |Cite
|
Sign up to set email alerts
|

HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation

Abstract: Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting in inaccurate reconstruction. In contrast, 3D keypoint estimation methods utilize the volumetric representation to achieve pixel-level accuracy but may predict unrealistic body structures. To address these issues, this paper presents a novel hybrid inverse kinemat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
100
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 257 publications
(125 citation statements)
references
References 112 publications
0
100
0
Order By: Relevance
“…the camera. To address the lack of translation, recent methods start to estimate human meshes in the camera coordinates [33,36,53,58,74,77,84,98,106,108,110]. Several approaches recover the absolute translation of the person using an optimization framework [62-64, 80, 107].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…the camera. To address the lack of translation, recent methods start to estimate human meshes in the camera coordinates [33,36,53,58,74,77,84,98,106,108,110]. Several approaches recover the absolute translation of the person using an optimization framework [62-64, 80, 107].…”
Section: Related Workmentioning
confidence: 99%
“…Some approaches approximate the depth of the person using the bounding box size [36,65,110]. HybrIK [53] and KAMA [33] employ inverse kinematics to estimate human meshes with absolute translations in the camera coordinates. Several methods directly predict the absolute depth of each person using a heatmap representation [16,115].…”
Section: Related Workmentioning
confidence: 99%
“…In terms of motion representations, human pose estimation can be divided into 2D [8,32,45,52] and 3D [39,46,65,66,68] pose estimation, which output 2D or 3D positions of the target person. Meanwhile, 3D pose estimation can be further divided into model-free and model-based methods, where the latter ones are also referred to as human mesh recovery [9,22,24,29,30,33,37].…”
Section: Human Pose Estimationmentioning
confidence: 99%
“…Given the 3D supervision, the spatial relationship can be directly learned via supervised learning [6,20,55,59]. Various representations have been proposed to effectively encode the spatial relationship such as volumetric representation [43], graph structure [4,11,71,76], transformer architecture [31,34,74], compact designs for realtime reconstruction [37,38], and inverse kinematics [30]. These supervised learning approaches that rely on the 3D ground truth supervision, however, show limited generalization to images of out-of-distribution scenes and poses due to the domain gap.…”
Section: Related Workmentioning
confidence: 99%