2019 14th IEEE International Conference on Automatic Face &Amp; Gesture Recognition (FG 2019) 2019
DOI: 10.1109/fg.2019.8756536
|View full text |Cite
|
Sign up to set email alerts
|

A CNN Model for Head Pose Recognition using Wholes and Regions

Abstract: Head pose recognition and monitoring is key to many real-world applications, since it is a vital indicator for human attention and behavior. Currently, head pose is often computed by localizing landmarks on a targeted face and solving 2D to 3D correspondence problem with a mean head model. Recent research has shown that this is a brittle approach since it relies entirely on the accuracy of landmark detection, the extraneous head model and an ad-hoc alignment step. Recent work has also shown that the best-perfo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 37 publications
(108 reference statements)
0
10
0
Order By: Relevance
“…A common observation is that the overall performance of the baselines and ROI-CNN [50] is low in VGGFace2 [54], MTFL [20], and AFLW [55] in comparison to MultiLab [50]. This is mainly due to the clutter in images.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…A common observation is that the overall performance of the baselines and ROI-CNN [50] is low in VGGFace2 [54], MTFL [20], and AFLW [55] in comparison to MultiLab [50]. This is mainly due to the clutter in images.…”
Section: Resultsmentioning
confidence: 99%
“…This work builds on the published conference output [50], focusing on coarse head pose recognition from image intensities using ROIs. The proposed RAN makes a substantial advance to it in two aspects: (i) by integrating a novel attention mechanism to explore salient regions in images while making recognition decisions.…”
Section: Previous Work By Authorsmentioning
confidence: 99%
“…The HPE methods have attracted increasing attention of researchers, especially when deep learning-based methods have become more and more prevalent in computer visionrelated tasks. Behera et al [26] learned multi-level features by exploring different image regions, which combine multiple local regions with the whole image for discrete pose classification. Limitation of pure classification based method is that they only predict the approximate range of head pose intervals, and they lack the ability for fine-grained estimation, which would inevitably obstruct the way to wider applications.…”
Section: B Methods Focusing On Algorithm Performancementioning
confidence: 99%
“…There are a number of datasets produced so far for head pose estimation [54,55]. Often facial landmarks are used to generate the ground-truth head poses by fitting a mean 3D face with the POSIT algorithm [26] since it is difficult to precisely measure (or manually annotate) them.…”
Section: Datasets and Evaluation Strategiesmentioning
confidence: 99%