Improving Head Pose Estimation with a Combined Loss and Bounding Box Margin Adjustment

Shao, Mingzhen; Sun, Zhun; Özay, Mete; Okatani, Takayuki

doi:10.1109/fg.2019.8756605

Cited by 14 publications

(14 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, 300W-LP is a synthetic one, while the BIWI and AFLW2000 consist of real images. The deep learning-based landmark-free approaches such as Hopenet [10], SSR-Net-MD [12], ResNet-BBM [13], FSA-Net [11] and our RAFA-Net perform better than the landmark-based ones (Dlib [1], 3DDFA [31], FAN [2], KEPLER [5] and Two-stage [3]) tested on both the BIWI and AFLW2000 datasets. This is mainly since the landmark-free approaches can better accommodate the domain discrepancies between training and testing datasets.…”

Section: Comparison With the State-of-the-art (Sota) Methodsmentioning

confidence: 97%

“…The overall performance (MAE) of our RAFA-Net is inferior to the FSA-Caps-Fusion [11] and SSR-Net-MD [12] landmark-free approaches (Table 1). However, it is better than the ResNet-BBM [13] and Hopenet [10]. Moreover, the estimated average error in pitch is better (6.26) than the landmark-free approaches except for the FSA-Caps-Fusion [11] (4.96).…”

Section: Train On 300w-lp and Test On Biwimentioning

confidence: 95%

“…However, these approaches require manually annotated ground-truth, which is laborious, time-consuming, and often experts cannot accurately assign landmark locations in low-resolution images. Landmark-free approaches: To address the above drawback, recently, there is a significant interest in estimating head poses directly from the image intensities using deep networks [9][10][11][12][13][14]. Such approaches often encounter problems due to illumination variations or poor illumination during night time.…”

Section: Related Workmentioning

confidence: 99%

“…2) train the model using 70% of videos (16 videos) in the BIWI dataset and evaluate the rest 30% (8 videos). In all three datasets, we use the detected face bounding box provided by Shao et al [13]. The standard evaluation metric of mean absolute error (MAE) is used.…”

Section: Datasets and Evaluation Strategiesmentioning

confidence: 99%

“…We propose a novel data augmentation approach (Fig 4) and is inspired by the experiment carried out by Shao et al [13] to measure the accuracy of their We experimentally found that this randomization gives better generalization resulting in improved performance rather than using standard augmentation techniques such as random scaling, width and/or height sifting and cropping. For all our experiments, we have used 0 ≤ γ ≤ 0.5.…”

Section: Data Augmentationmentioning

confidence: 99%

See 4 more Smart Citations

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Behera

Wharton

Hewage

et al. 2021

Computer Vision – ACCV 2020

View full text Add to dashboard Cite

Head pose is a vital indicator of human attention and behavior. Therefore, automatic estimation of head pose from images is key to many applications. In this paper, we propose a novel approach for head pose estimation from a single RGB image. Many existing approaches often predict head poses by localizing facial landmarks and then solve 2D to 3D correspondence problem with a mean head model. Such approaches rely entirely on the landmark detection accuracy, an ad-hoc alignment step, and the extraneous head model. To address this drawback, we present an end-to-end deep network, which explores rotation axis (yaw, pitch and roll) focused innovative attention mechanism to capture the subtle changes in images. The mechanism uses attentional spatial pooling from a self-attention layer and learns the importance over fine-grained to coarse spatial structures and combine them to capture rich semantic information concerning a given rotation axis. The evaluation of our approach using three benchmark datasets is very competitive to state-of-the-arts, including with and without landmark-based methods.

show abstract

Section: Comparison With the State-of-the-art (Sota) Methodsmentioning

confidence: 97%

Section: Train On 300w-lp and Test On Biwimentioning

confidence: 95%

Section: Related Workmentioning

confidence: 99%

Section: Datasets and Evaluation Strategiesmentioning

confidence: 99%

Section: Data Augmentationmentioning

confidence: 99%

See 3 more Smart Citations

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Behera

Wharton

Hewage

et al. 2021

Computer Vision – ACCV 2020

View full text Add to dashboard Cite

show abstract

A Study of General Data Improvement for Large-Angle Head Pose Estimation

Bai

Peng

et al. 2021

Computer Analysis of Images and Patterns

View full text Add to dashboard Cite

Deep Learning for Head Pose Estimation: A Survey

Asperti

Filippini

2023

SN COMPUT. SCI.

View full text Add to dashboard Cite

Head pose estimation (HPE) is an active and popular area of research. Over the years, many approaches have constantly been developed, leading to a progressive improvement in accuracy; nevertheless, head pose estimation remains an open research topic, especially in unconstrained environments. In this paper, we will review the increasing amount of available datasets and the modern methodologies used to estimate orientation, with a special attention to deep learning techniques. We will discuss the evolution of the field by proposing a classification of head pose estimation methods, explaining their advantages and disadvantages, and highlighting the different ways deep learning techniques have been used in the context of HPE. An in-depth performance comparison and discussion is presented at the end of the work. We also highlight the most promising research directions for future investigations on the topic.

show abstract

Improving Head Pose Estimation with a Combined Loss and Bounding Box Margin Adjustment

Cited by 14 publications

References 22 publications

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

A Study of General Data Improvement for Large-Angle Head Pose Estimation

Deep Learning for Head Pose Estimation: A Survey

Contact Info

Product

Resources

About