Panoramic streaming is a particular way of video streaming where an arbitrary Region-of-Interest (RoI) of high-spatial resolution videos is transmitted. It allows users to navigate interactively around the video and thus select anytime the portion of it they are interested in. The most basic approach consists of each user interacting with the system indicating the desired RoI. Then an encoder associated with each user encodes the desired RoI. However, such a system does not scale well. Instead, we consider tile based panoramic streaming, where users receive a set of tiles that match their RoI, and propose a low-complexity compressed domain video processing technique for tile based panoramic streaming using H.265/HEVC that generates a single video bitstream out of the selected tiles so that single hardware decoders can be used to decode the RoI video stream
With the continuously increasing attention that 360°video streaming is drawing, several solutions have been developed lately. They can be grouped into two main categories, with the first one being the viewport-independent category that consists of encoding the whole 360°video content using a particular projection, e.g., Equirectangular Projection or Cubemap Projection, without taking any viewport orientation into account. Such approaches waste resources since content not being watched by the user is encoded with the same visual fidelity as the content actually watched. The second category, the viewport-dependent category, relies on techniques that allow for viewport adaptivity. One approach is to apply viewport-dependent projections, wherein the spherical 360°video is mapped onto a rectangular frame in such a way that a specific viewport is mapped to a comparatively larger part of the rectangular frame than the rest of the content. Another approach, as specified in MPEG OMAF, is to use tile-based streaming, with the 360°video being offered in a tiled manner at various resolutions. Thus, each user can retrieve the tiles at different resolutions so that the high-resolution tiles match its viewport. Although viewport-dependent approaches allow providing better visual quality at the viewport, the end-toend delay is critical, since it has an impact on the time needed by clients to adapt to user movements so that movements are reflected on the retrieved content. In this paper, we analyze the impact of the delay and introduce various means to reduce the impact on the observed fidelity.
Versatile Video Coding (VVC), a.k.a. ITU-T H.266 | ISO/IEC 23090-3, is the new generation video coding standard that has just been finalized by the Joint Video Experts Team (JVET) of ITU-T VCEG and ISO/IEC MPEG at its 19 th meeting ending on July 1, 2020. This paper gives an overview of the VVC high-level syntax (HLS), which forms its system and transport interface. Comparisons to the HLS designs in High Efficiency Video Coding (HEVC) and Advanced Video Coding (AVC), the previous major video coding standards, are included. When discussing new HLS features introduced into VVC or differences relative to HEVC and AVC, the reasoning behind the design differences and the benefits they bring are described. The HLS of VVC enables newer and more versatile use cases such as video region extraction, composition and merging of content from multiple coded video bitstreams, and viewport-adaptive 360 • immersive media.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.