2018
DOI: 10.1007/978-3-030-01249-6_49
|View full text |Cite
|
Sign up to set email alerts
|

Occlusion-Aware Hand Pose Estimation Using Hierarchical Mixture Density Network

Abstract: Learning and predicting the pose parameters of a 3D hand model given an image, such as locations of hand joints, is challenging due to large viewpoint changes and articulations, and severe self-occlusions exhibited particularly in egocentric views. Both feature learning and prediction modeling have been investigated to tackle the problem. Though effective, most existing discriminative methods yield a single deterministic estimation of target poses. Due to their single-value mapping intrinsic, they fail to adeq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
47
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
4
1

Relationship

2
7

Authors

Journals

citations
Cited by 57 publications
(47 citation statements)
references
References 47 publications
(108 reference statements)
0
47
0
Order By: Relevance
“…The MDN can represent arbitrary conditional distributions by combining a conventional neural network with a mixture density model. Ye et al [26] used a hierarchical MDN to solve the occlusion problem in hand pose estimation. Inspired by the work of Ye et al, we use the MDN to solve the depth ambiguity and occlusion problem in 3D human pose estimation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The MDN can represent arbitrary conditional distributions by combining a conventional neural network with a mixture density model. Ye et al [26] used a hierarchical MDN to solve the occlusion problem in hand pose estimation. Inspired by the work of Ye et al, we use the MDN to solve the depth ambiguity and occlusion problem in 3D human pose estimation.…”
Section: Related Workmentioning
confidence: 99%
“…To this end, we introduce the mixture density networks (MDN) [3,26] to the 3D joint estimation module of the two-stage approach. Contrary to most existing works that generate a single 3D pose by minimizing the negative loglikelihood of an unimodal Gaussian, i.e.…”
Section: Introductionmentioning
confidence: 99%
“…3D hand pose and shape estimation in egocentric views has started receiving attention recently [7,22,37,51,70,72,73]. The problem is challenging with respect to thirdperson scenarios [58,74], due to self-occlusions and the limited amount of training data available [37,39,51,70]. Mueller et al [37] train CNNs on synthetic data and combine them with a generative hand model to track hands interacting with objects from egocentric RGB-D videos.…”
Section: Related Workmentioning
confidence: 99%
“…Zhou et al [40] regress a set of hand joint angles and feed the joint angles into an embedded kinematic layer to obtain the final pose. Ye et al [33] use a hierarchical mixture density network to handle the multi-modal distribution of occluded hand joints.…”
Section: Deep Learning For Hand Pose Estimationmentioning
confidence: 99%