2018
DOI: 10.1007/978-3-030-01261-8_44
|View full text |Cite
|
Sign up to set email alerts
|

Deep Pictorial Gaze Estimation

Abstract: Estimating human gaze from natural eye images only is a challenging task. Gaze direction can be defined by the pupil-and the eyeball center where the latter is unobservable in 2D images. Hence, achieving highly accurate gaze estimates is an ill-posed problem. In this paper, we introduce a novel deep neural network architecture specifically designed for the task of gaze estimation from single eye input. Instead of directly regressing two angles for the pitch and yaw of the eyeball, we regress to an intermediate… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
105
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 163 publications
(117 citation statements)
references
References 44 publications
0
105
0
Order By: Relevance
“…Gaze Estimation Error. For the assessment of gaze redirection accuracy, we employ a state-of-the-art gaze estimator proposed by Park et al [24] which was pre-trained on MPIIGaze [34]. The estimator predicts the gaze direction of the generated gaze images.…”
Section: Metricsmentioning
confidence: 99%
“…Gaze Estimation Error. For the assessment of gaze redirection accuracy, we employ a state-of-the-art gaze estimator proposed by Park et al [24] which was pre-trained on MPIIGaze [34]. The estimator predicts the gaze direction of the generated gaze images.…”
Section: Metricsmentioning
confidence: 99%
“…Note that all the aforementioned methods are used for semantic segmentation. Recently, however, the FCN-like network structure has been also applied successfully to other keypoint detection problems such as human pose estimation [58], facial landmark detection [59] and eye gaze estimation [60,61]. They all have an encoder-decoder architecture and used a FCN-like network structure called hourglass network which borrows the idea from FCN.…”
Section: Related Workmentioning
confidence: 99%
“…Rows from top to bottom show input test images, ground truth labels, predictions from mSegNet w/BR [4] and predictions from RITnet, respectively. semantic segmentation of these regions enables the extrac- tion of region-specific features (e.g., iridial feature tracking [2])and mathematical models which summarize the region structures (e.g., iris ellipse [17,1,13], or pupil ellipse [7]) used to derive a measure of gaze orientation.…”
Section: Introductionmentioning
confidence: 99%