This paper describes an extension of the high efficiency video coding (HEVC) standard for coding of multi-view video and depth data. In addition to the known concept of disparity-compensated prediction, inter-view motion parameter, and inter-view residual prediction for coding of the dependent video views are developed and integrated. Furthermore, for depth coding, new intra coding modes, a modified motion compensation and motion vector coding as well as the concept of motion parameter inheritance are part of the HEVC extension. A novel encoder control uses view synthesis optimization, which guarantees that high quality intermediate views can be generated based on the decoded data. The bitstream format supports the extraction of partial bitstreams, so that conventional 2D video, stereo video, and the full multi-view video plus depth format can be decoded from a single bitstream. Objective and subjective results are presented, demonstrating that the proposed approach provides 50% bit rate savings in comparison with HEVC simulcast and 20% in comparison with a straightforward multi-view extension of HEVC without the newly developed coding tools.
This paper presents results for full and mixed resolution stereo video coding including subjective and objective evaluation for 3DTV on Mobile devices. For this, objective as well as subjective tests have been carried out with different sequences at different image sizes and evaluated on two different stereoscopic displays. The subjective tests showed, that for uncoded sequences full resolution was rated better, however for coded sequences at low bit rates mixed resolution was rated better. An unsharp masking algorithm for up-sampling the lower resolution view only partially improved the overall quality because coding artifacts were strengthened as well. Objective tests showed that the optimum bit rate distribution for mixed resolution stereo video coding is 30% to 35% of the total bit rate for the lower resolution view (half horizontal and half vertical). Finally, tests showed that mixed resolution stereo video decoding with up-sampling is less complex than decoding full resolution stereo video which is important for implementation on mobile devices
The presented approach for 3D video coding uses the multiview video plus depth format, in which a small number of video views as well as associated depth maps are coded. Based on the coded signals, additional views required for displaying the 3D video on an autostereoscopic display can be generated by depth image based rendering techniques. The developed coding scheme represents an extension of HEVC, similar to the MVC extension of H.264/AVC. However, in addition to the well-known disparity-compensated prediction advanced techniques for inter-view and inter-component prediction, the representation of depth blocks, and the encoder control for depth signals have been integrated. In comparison to simulcasting the different signals using HEVC, the proposed approach provides about 40% and 50% bit rate savings for the tested configurations with 2 and 3 views, respectively. Bit rate reductions of about 20% have been obtained in comparison to a straightforward multiview extension of HEVC without the newly developed coding tools
This paper presents a study on different techniques for stereo video compression and its optimization for mobile 3D services. Stereo video enables 3D television, but as mobile services are subject to various limitations, including bandwidth, memory, and processing power, efficient compression is required. Three of the currently available MPEG coding standards are applicable for stereo video coding, namely H.264/AVC with and without stereo SEI message and H.264/MVC. These methods are evaluated with respect to the limitations of mobile services. The results clearly indicate that for a certain bitrate inter-view prediction as well as temporal prediction with hierarchical B pictures lead to a significantly increased subjective and objective quality. Although both techniques require more complex processing at the encoder side, their coding efficiency offers the chance to realize 3D stereo at the bitrate of conventional video for mobile services
This paper presents results on mixed resolution stereo video coding including inter view prediction. The objective tests show that coding one view of a stereo pair at a lower resolution outperforms coding of a low pass filtered view at base resolution. For both coding methods inter view prediction produces additional coding gains. Two different prediction approaches have been compared: One uses the lower resolution view as base view, which is used to predict the second view, the other uses the higher resolution view as base view. The prediction is realized by up-sampling or down-sampling the decoded base view before predicting. It turned out that equal results are achieved for low bit rates, while predicting from high resolution to low resolution view performs better for medium and high bit rates. During coding, different QP combinations for left and right view have been compared to achieve the maximum combined PSNR
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.