2019
DOI: 10.1109/tpami.2018.2858783
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

Abstract: Panoramic video provides immersive and interactive experience by enabling humans to control the field of view (FoV) through head movement (HM). Thus, HM plays a key role in modeling human attention on panoramic video. This paper establishes a database collecting subjects' HM in panoramic video sequences. From this database, we find that the HM data are highly consistent across subjects. Furthermore, we find that deep reinforcement learning (DRL) can be applied to predict HM positions, via maximizing the reward… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
97
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 170 publications
(97 citation statements)
references
References 45 publications
0
97
0
Order By: Relevance
“…In practice, both the HM and EM data are not available for objective VQA on omnidirectional video. Therefore, the predicted HM maps [48] and EM maps [23] are used in our VQA approach. Also, we test the performance of our approach with the ground truth HM and EM data, to show the upper bound performance of our approach.…”
Section: Discussionmentioning
confidence: 99%
“…In practice, both the HM and EM data are not available for objective VQA on omnidirectional video. Therefore, the predicted HM maps [48] and EM maps [23] are used in our VQA approach. Also, we test the performance of our approach with the ground truth HM and EM data, to show the upper bound performance of our approach.…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, measuring the distance between states is another important direction to explore, where better representations of states may lead to improvements. Finally, DEHRL may be a promising solution for visual tasks (Xu et al 2018b) with diverse representation and mixed reward functions.…”
Section: Discussionmentioning
confidence: 99%
“…A deep reinforcement learning based viewpoint prediction approach is proposed in [18], in which the reinforcement learning model is established to track the long-term head movement behaviors of humans.…”
Section: Related Workmentioning
confidence: 99%