Don’t Forget The Past: Recurrent Depth Estimation from Monocular Video

Patil, Vaishakh; Gansbeke, Wouter Van; Dai, Dengxin; Gool, Luc Van

doi:10.1109/lra.2020.3017478

Cited by 95 publications

(57 citation statements)

References 42 publications

(109 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where α = 0.85 is a balancing weight and SSIM is a method of comparing and evaluating the quality of the predicted image with the original image. It is an index frequently used for depth estimation [17,21,23,33,37]. The SSIM between two images I x and I y is defined by:…”

Section: Image Reconstruction Lossmentioning

confidence: 99%

“…Since the model trained by the general self-supervised monocular depth estimation method predicts the relative depth for a single frame, flicker may occur when applied to consecutive images [22]. Patil et al [23] improves the depth accuracy based on spatiotemporal information by concatenating the encoding output of the previous frame with the encoding output of the current frame and decoding it. In a recent study [22], performance was improved by proposing optical flow-based loss including geometry consistency, but real-time execution is impossible because of an additional operation that requires learning at test time.…”

Section: Depth Feedback Networkmentioning

confidence: 99%

“…However, the depth obtained Sensors 2021, 21, 2691 2 of 16 by the monocular learning-based method often flickers depending on the scale ambiguity and prediction per single frame [22]. In recent research, recurrent depth estimation using temporal information [23] and multi-view reconstruction using spatial information [24] were proposed for using spatiotemporal information.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised Monocular Depth Estimation for Colonoscope System Using Feedback Network

Hwang

Park

Kim

et al. 2021

Sensors

View full text Add to dashboard Cite

A colonoscopy is a medical examination used to check disease or abnormalities in the large intestine. If necessary, polyps or adenomas would be removed through the scope during a colonoscopy. Colorectal cancer can be prevented through this. However, the polyp detection rate differs depending on the condition and skill level of the endoscopist. Even some endoscopists have a 90% chance of missing an adenoma. Artificial intelligence and robot technologies for colonoscopy are being studied to compensate for these problems. In this study, we propose a self-supervised monocular depth estimation using spatiotemporal consistency in the colon environment. It is our contribution to propose a loss function for reconstruction errors between adjacent predicted depths and a depth feedback network that uses predicted depth information of the previous frame to predict the depth of the next frame. We performed quantitative and qualitative evaluation of our approach, and the proposed FBNet (depth FeedBack Network) outperformed state-of-the-art results for unsupervised depth estimation on the UCL datasets.

show abstract

Section: Image Reconstruction Lossmentioning

confidence: 99%

Section: Depth Feedback Networkmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Monocular Depth Estimation for Colonoscope System Using Feedback Network

Hwang

Park

Kim

et al. 2021

Sensors

View full text Add to dashboard Cite

show abstract

“…There is a large body of prior work on depth estimation across multiple frames [17,31,8,30,18,28,19]. Liu et al [17] aggregate per-frame depth estimates across frames using Bayesian filtering.…”

Section: B Multi-frame Depth Estimationmentioning

confidence: 99%

“…Matthies et al [18] use a similar Bayesian approach, but their method is only applied to controlled scenes and restricted camera motion. Other works [31,8,28,19] use RNNs for predicting depth maps at each frame. All of aforementioned works try to predict the full 2D depth map of the environment from monocular images.…”

Section: B Multi-frame Depth Estimationmentioning

confidence: 99%

Active Safety Envelopes using Light Curtains with Probabilistic Guarantees

Ancha¹,

Pathak²,

Narasimhan³

et al. 2021

Robotics: Science and Systems XVII

View full text Add to dashboard Cite

To safely navigate unknown environments, robots must accurately perceive dynamic obstacles. Instead of directly measuring the scene depth with a LiDAR sensor, we explore the use of a much cheaper and higher resolution sensor: programmable light curtains. Light curtains are controllable depth sensors that sense only along a surface that a user selects. We use light curtains to estimate the safety envelope of a scene: a hypothetical surface that separates the robot from all obstacles. We show that generating light curtains that sense random locations (from a particular distribution) can quickly discover the safety envelope for scenes with unknown objects. Importantly, we produce theoretical safety guarantees on the probability of detecting an obstacle using random curtains. We combine random curtains with a machine learning based model that forecasts and tracks the motion of the safety envelope efficiently. Our method accurately estimates safety envelopes while providing probabilistic safety guarantees that can be used to certify the efficacy of a robot perception system to detect and avoid dynamic obstacles. We evaluate our approach in a simulated urban driving environment and a real-world environment with moving pedestrians using a light curtain device and show that we can estimate safety envelopes efficiently and effectively. 1

show abstract

Semantic and Optical Flow Guided Self-supervised Monocular Depth and Ego-Motion Estimation

Fang

Liu

2021

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Self-supervised monocular depth estimation approaches suffer not only from scale ambiguity but also infer temporally inconsistent depth maps w.r.t. scale. While disambiguating scale during training is not possible without some kind of ground truth supervision, having scale consistent depth predictions would make it possible to calculate scale once during inference as a post-processing step and use it over-time. With this as a goal, a set of temporal consistency losses that minimize pose inconsistencies over time are introduced. Evaluations show that introducing these constraints not only reduces depth inconsistencies but also improves the baseline performance of depth and ego-motion prediction.

show abstract

Don’t Forget The Past: Recurrent Depth Estimation from Monocular Video

Cited by 95 publications

References 42 publications

Unsupervised Monocular Depth Estimation for Colonoscope System Using Feedback Network

Unsupervised Monocular Depth Estimation for Colonoscope System Using Feedback Network

Active Safety Envelopes using Light Curtains with Probabilistic Guarantees

Semantic and Optical Flow Guided Self-supervised Monocular Depth and Ego-Motion Estimation

Contact Info

Product

Resources

About