Abstract-The multimedia signal processing community has recently identified the need to design depth map compression algorithms which preserve depth discontinuities in order to improve the rendering quality of virtual views for Free Viewpoint Video (FVV) services. This paper adopts contour detection with surround suppression on the color video to approximate the foreground edges present in the depth image. Displacement estimation and compensation is then used to improve this prediction and reduce the amount of side information required by the decoder. Simulation results indicate that the proposed method manages to accurately predict around 64% of the blocks. Moreover, the proposed scheme achieves a Peak Signal-to-Noise Ratio (PSNR) gain of around 4.9 -6.6 dB relative to the JPEG standard and manages to outperform other state of the art depth map compression algorithms found in literature.Index Terms-3D television, depth map compression, displacement estimation and compensation, edge detection, surround suppression I. INTRODUCTION Advances in technology have contributed to the development of new applications such as 3D Television (3DTV) and Free Viewpoint Video (FVV), which have recently started to attract interest within the marketplace. The multi-view video plus depth (MVD) [1] format enables the generation of an infinite set of videos using a finite set of color and corresponding depth videos [2]. The MVD format significantly reduces the amount of information needed to deploy these services. Moreover, in order to make these applications more viable, both color and depth videos need to be compressed. However, the quality of the rendered views is significantly dependent on the quality of the depth video.Traditional image and video compression standards produce blurring artifacts along depth discontinuities, which negatively affect the Quality of Experience (QoE) of the rendered virtual views [3], [4]. The authors in [5] have demonstrated that higher rendering quality can be perceived when employing depth map compression strategies which preserve edge boundaries. Following this observation the same authors have proposed to reshape the dynamic range and use Region of Interest (RoI) coding to improve the performance of traditional image compression standards. Directional transforms were considered in [3], [6] which are highly computational intensive.
An FPGA-based design of an electronic cochlear model is presented with the target FPGA being the Xilinx XC3S500E from the Spartan-3E family of devices. The design adopted consists of the traditional cascade of 2n d order lIR low-pass filter stages tuned at frequencies following an exponential distribution, covering the human auditory range from 20 Hz to 20 KHz. Filter stages within the implementation employ dual fixed-point arithmetic and make use of the FPGA's dedicated on-board hardware multiplier blocks for their computations. Such an approach for the implementation of the filters' transfer function leads to the use of a time-division multiplexed scheme for the realization of the 24-stage filter chain designed, in view of limited FPGA hardware resources. Furthermore, a design for an ADC interface module using the Spartan-3E Starter Kit Board's analogue capture circuit is presented to feed audio signals to the model. Index Terms-digital silicon cochlea, digital filter design, field programmable gate array (FPGA).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.