2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) 2015
DOI: 10.1109/icapr.2015.7050696
|View full text |Cite
|
Sign up to set email alerts
|

A multimodal approach for image de-fencing and depth inpainting

Abstract: Low cost RGB-D sensors such as the Microsoft Kinect have enabled the use of depth data along with color images. In this work, we propose a multi-modal approach to address the problem of removal of fences/occlusions from images captured using a Kinect camera. We also perform depth completion by fusing data from multiple recorded depth maps affected by occlusions. The availability of aligned image and depth data from Kinect aids us in the detection of the fence locations. However, accurate estimation of the rela… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 20 publications
0
9
0
Order By: Relevance
“…In this work, we propose a depth-based technique to detect the fence mask from the disparity map obtained from a stereo image pair. The works of Wang et al [17] and Jonna et al [18] are most closely related to our method. The algorithm in [17] leveraged advantages of stereo image pair for joint completion of missing texture and depth.…”
Section: Related Workmentioning
confidence: 90%
See 1 more Smart Citation
“…In this work, we propose a depth-based technique to detect the fence mask from the disparity map obtained from a stereo image pair. The works of Wang et al [17] and Jonna et al [18] are most closely related to our method. The algorithm in [17] leveraged advantages of stereo image pair for joint completion of missing texture and depth.…”
Section: Related Workmentioning
confidence: 90%
“…The limitation of [17] is that the region to be inpainted has to be specified manually. The technique of [18] proposed a multimodal approach for image de-fencing where the fence masks are extracted automatically from depth maps corresponding to the color images obtained using Kinect sensor. One limitation of the Kinect sensor is that it provides the depth map of a scene in a constrained laboratory environment and only works accurately if the scene is within a particular distance from sensor (viewing distance range is 1.2m to 3.5m).…”
Section: Related Workmentioning
confidence: 99%
“…Their proposed algorithms achieve great performance in video-based de-fencing. In addition to RGB video-based methods, method [10] incorporates depth map to enhance the estimated fence mask.…”
Section: Related Work a Video-based Fence Removalmentioning
confidence: 99%
“…This method uses an image matting (Zheng and Kambhamettu [2009]) for fence segmentation, but the main drawback of this approach is that it involves significant user interaction and is therefore not very suitable for practical purposes. Jonna et al [2015b] proposed a multimodal approach for image de-fencing in video frames, in which the fence mask was first extracted in each frame with the aid of depth maps corresponding to the color images obtained from a Kinect sensor, and next an optical flow algorithm was used to find correspondences between adjacent frames. Finally, estimation of the de-fenced image was done by modeling it as a Markov Random Field, and obtaining its maximum a-posteriori estimate by applying loopy belief propagation.…”
Section: Image De-fencingmentioning
confidence: 99%