CVPR 2011 2011
DOI: 10.1109/cvpr.2011.5995458
|View full text |Cite
|
Sign up to set email alerts
|

Real time head pose estimation with random regression forests

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
351
0
2

Year Published

2013
2013
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 392 publications
(353 citation statements)
references
References 23 publications
0
351
0
2
Order By: Relevance
“…A success of such methods in 3D full-body pose estimation is evident from recent results that use Microsoft Kinect sensor (Girshick et al, 2011;Sun et al, 2012); such discriminative methods have also proved effective for other problems, including image-based 3D pose (Bo and Sminchisescu, 2010;Kanaujia et al, 2007;Shakhnarovich et al, 2003;Sminchisescu et al, 2006;Urtasun and Darrell, 2008), head pose (Fanelli et al, 2011) and body shape (Chen et al, 2011;Sigal et al, 2007) estimation. The typical goal of discriminative regression methods is to learn a direct (and sometimes multi-modal) mapping, f : R dx → R dy , from features (e.g., computed from image or depth data) to pose (e.g., 3D position and orientation of the head, or full 3D pose of the body encoded by joint positions or joint angles).…”
Section: Introductionmentioning
confidence: 97%
“…A success of such methods in 3D full-body pose estimation is evident from recent results that use Microsoft Kinect sensor (Girshick et al, 2011;Sun et al, 2012); such discriminative methods have also proved effective for other problems, including image-based 3D pose (Bo and Sminchisescu, 2010;Kanaujia et al, 2007;Shakhnarovich et al, 2003;Sminchisescu et al, 2006;Urtasun and Darrell, 2008), head pose (Fanelli et al, 2011) and body shape (Chen et al, 2011;Sigal et al, 2007) estimation. The typical goal of discriminative regression methods is to learn a direct (and sometimes multi-modal) mapping, f : R dx → R dy , from features (e.g., computed from image or depth data) to pose (e.g., 3D position and orientation of the head, or full 3D pose of the body encoded by joint positions or joint angles).…”
Section: Introductionmentioning
confidence: 97%
“…Papazov et al [25] also used a random forest-based framework, in a similar way to the methods in Refs. [22][23][24]. They replaced depth features by more elaborate triangular surface patch (TSP) features to ensure view-invariance.…”
Section: Head Pose Estimationmentioning
confidence: 99%
“…Fanelli et al [22][23][24] adopted a voting method to directly determine head pose. However, their feature selection method for depth images degenerates into using 2D features, i.e., the RGB information used in 2D images was replaced by xyz-coordinate values in depth images.…”
Section: Head Pose Estimationmentioning
confidence: 99%
“…Similar to existing regression forests in literature including (Fanelli et al 2011;Shotton et al 2011;Denil et al 2014), at a split node, we randomly select a relatively small set of s distinct features Φ := {φ i } s i=1 from the d-dimensional space as candidate features (i.e. entries of the feature vector).…”
Section: The Split Criteriamentioning
confidence: 99%
“…The information gains and split criteria, the usage of whole hand image patch rather than individual pixels, as well as the DOT features to be detailed later are also quite different. Meanwhile, various related regression forest models have been investigated recently: in Fanelli et al (2011), the head pose has 6 degree-of-freedom (DoF), which is divided into 2 parts: 3D translation and 3D orientation. In each leaf node, the distribution is approximated by a 3D Gaussian.…”
Section: Related Workmentioning
confidence: 99%