2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.583
|View full text |Cite
|
Sign up to set email alerts
|

POSEidon: Face-from-Depth for Driver Pose Estimation

Abstract: Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regressive neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for under… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
140
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 152 publications
(143 citation statements)
references
References 56 publications
0
140
0
Order By: Relevance
“…However, no previous method that uses depth maps as the only input exploits CNNs in an effective way. In this work we propose a method based on [10] which uses depth maps to produce accurate head pose predictions by leveraging CNNs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, no previous method that uses depth maps as the only input exploits CNNs in an effective way. In this work we propose a method based on [10] which uses depth maps to produce accurate head pose predictions by leveraging CNNs.…”
Section: Related Workmentioning
confidence: 99%
“…In the automotive field, vision-based systems are required to cooperate or even replace other traditional sensors, due to the increasing presence of cameras inside new car's cockpits and to the ease of capturing images and videos in a completely non-invasive manner. In the past, encouraging results for driver head pose estimation have been achieved using RGB images [1], [2], [3], [4], [5] as well as different camera types, such as infrared [6], thermal [7], or depth [8], [9], [10]. Among them, the last ones are very promising, since they allow robustness when facing strong illumination variations.…”
Section: Introductionmentioning
confidence: 99%
“…Experiments are conducted exploiting two publicly available datasets: Pandora [6] and MotorMark [13]. Pandora contains more than 250k frames, splitted into 110 annotated sequences of 22 different actors (10 males and 12 females), while MotorMark is composed of more than 30k frames of 35 different subjects, guaranteeing a great variety of face appearances.…”
Section: A Datasetsmentioning
confidence: 99%
“…The Pandora dataset was introduced in [3] for the head pose estimation task in depth images. It consists of more than Table 1.…”
Section: Pandora Datasetmentioning
confidence: 99%