2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR) 2019
DOI: 10.1109/mipr.2019.00060
|View full text |Cite
|
Sign up to set email alerts
|

Very Long Term Field of View Prediction for 360-Degree Video Streaming

Abstract: 360-degree videos have gained increasing popularity in recent years with the developments and advances in Virtual Reality (VR) and Augmented Reality (AR) technologies. In such applications, a user only watches a video scene within a field of view (FoV) centered in a certain direction. Predicting the future FoV in a long time horizon (more than seconds ahead) can help save bandwidth resources in on-demand video streaming while minimizing video freezing in networks with significant bandwidth variations. In this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 54 publications
(36 citation statements)
references
References 15 publications
(24 reference statements)
0
36
0
Order By: Relevance
“…Petrangeli et al [11] first identify user clusters with a kind of spectral clustering algorithm, then fit a regression model for each cluster, finally predict with the regression model from the user's corresponding cluster. Li et al [12] utilize LSTM and Attentive Mixture Experts (AME) techniques to train a model to predict based on both the target user's historical fixations and other users' fixations. Nasrabadi et al [13] first cluster users based on their quaternion rotations, and then classify the target user to the corresponding cluster and estimate the future fixation as the cluster center.…”
Section: B Viewport Predictionmentioning
confidence: 99%
“…Petrangeli et al [11] first identify user clusters with a kind of spectral clustering algorithm, then fit a regression model for each cluster, finally predict with the regression model from the user's corresponding cluster. Li et al [12] utilize LSTM and Attentive Mixture Experts (AME) techniques to train a model to predict based on both the target user's historical fixations and other users' fixations. Nasrabadi et al [13] first cluster users based on their quaternion rotations, and then classify the target user to the corresponding cluster and estimate the future fixation as the cluster center.…”
Section: B Viewport Predictionmentioning
confidence: 99%
“…FoV prediction is critical to the performance of FoV-adaptive streaming. In the past, linear regression, weighted linear regression, and truncated linear prediction [5,17,18] as well as neural network based methods [2,6,12] have been proposed. Most of these methods can predict the short-term FoV (within the future 1 second) well with an accuracy of more than 90% [17][12] [6].…”
Section: Predicting Fov and Bandwidthmentioning
confidence: 99%
“…As such,the major part of the available resources can be allocated to this particular part of the video hemisphere, resulting in the perceived quality being optimized. Two major types of viewport prediction can be identified: content-based [22] and content-agnostic [23] pre-…”
Section: Accurate and Real-time Viewport Prediction: Knowing The Umentioning
confidence: 99%
“…Content-based prediction aims at characterizing the given content, independent from the user, in terms of saliency maps, motion detection, Regions-Of-Interest (ROIs) etc. Based on this information, density functions can be created that estimate the probability of a generic user looking at a particular part of the hemisphere at a given time instant [22]. Contentagnostic approaches, on the other hand, do not take the content into consideration, but try to predict the user's future fixation point based on this and other user's historical movement.…”
Section: Authorsmentioning
confidence: 99%