2012
DOI: 10.1007/s11263-012-0557-0
|View full text |Cite
|
Sign up to set email alerts
|

Using Segmented 3D Point Clouds for Accurate Likelihood Approximation in Human Pose Tracking

Abstract: The observation likelihood approximation is a central problem in stochastic human pose tracking. In this paper, we present a new approach to quantify the correspondence between hypothetical and observed human poses in depth images. Our approach is based on segmented point clouds, enabling accurate approximations even under selfocclusion and in the absence of color or texture cues. The segmentation step extracts small regions of high saliency such as hands or arms and ensures that the information contained in t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 17 publications
(24 reference statements)
0
5
0
Order By: Relevance
“…Human point cloud segmentation is typically challenging and difficult to achieve correct approximations, especially when there is self-occlusion and no color or texture signals. Lehment et al's suggested segmentation model [16] attempted to address this issue by extracting tiny regions of high saliency, such as the hands or arms, and ensuring that the information contained in these regions is not overshadowed by larger, less salient parts, such as the chest. The use of a deep recurrent hierarchical network can also provide more flexibility by minimizing or eliminating posture detection issues caused by a limited visibility human torso in the frame, also known as the frame occlusion problem, which occurs when only a portion of the body is visible [17].…”
Section: Related Workmentioning
confidence: 99%
“…Human point cloud segmentation is typically challenging and difficult to achieve correct approximations, especially when there is self-occlusion and no color or texture signals. Lehment et al's suggested segmentation model [16] attempted to address this issue by extracting tiny regions of high saliency, such as the hands or arms, and ensuring that the information contained in these regions is not overshadowed by larger, less salient parts, such as the chest. The use of a deep recurrent hierarchical network can also provide more flexibility by minimizing or eliminating posture detection issues caused by a limited visibility human torso in the frame, also known as the frame occlusion problem, which occurs when only a portion of the body is visible [17].…”
Section: Related Workmentioning
confidence: 99%
“…Fouhey et al presented an estimation algorithm for the depth fading in Kinect depth image [7]. For human gesture recognition, Yan et al used Kinect to precisely recognize hand gestures, especially the gestures of fingers [8].Lehment et al proposed a pose tracking algorithm based on segmented 3D point clouds [9].…”
Section: Introductionmentioning
confidence: 99%
“…Ugolotti and Cagnoni (2013) parametrically describes a deformable body model with 42 parameters: 29 degrees-of-freedom to describe articulated skeletal joints, 7 parameters to specify limb lengths/thicknesses; and 6 parameters describing the relative pose of the model. A similar idea is explored by Lehment et al (2013) who searches a 22-DOF human pose space using an observation likelihood function approximation whereby point-cloud measurements from a Kinect sensor are compared to the expected point-cloud of a pose hypothesis. Both of these works use parallel processing to search the large pose space in real-time.…”
Section: Quality Control Is a Domain In Which 'mentioning
confidence: 99%
“…The most likely hypothesis is determined using a Bayesian filter that infers evidence towards hypotheses through the summation of likelihood weights based on measurements. Similarly Lehment et al (2013) determines the parametric pose of a human by comparing measurements against the expected measurements for 800 pose hypotheses. The most likely pose is determined using an approximation of the conditional measurement probability under each pose hypothesis.…”
Section: Maximum-sum-of-evidence Pose Estimationmentioning
confidence: 99%
See 1 more Smart Citation