2017
DOI: 10.1109/lra.2017.2657002
|View full text |Cite
|
Sign up to set email alerts
|

Toward Domain Independence for Learning-Based Monocular Depth Estimation

Abstract: Modern autonomous mobile robots require a strong understanding of their surroundings in order to safely operate in cluttered and dynamic environments. Monocular depth estimation offers a geometry-independent paradigm to detect free, navigable space with minimum space, and power consumption. These represent highly desirable features, especially for microaerial vehicles. In order to guarantee robust operation in real-world scenarios, the estimator is required to generalize well in diverse environments. Most of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
46
0
5

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 59 publications
(51 citation statements)
references
References 25 publications
0
46
0
5
Order By: Relevance
“…Another promising approach has been use simulations to get training data for reinforcement or imitation learning tasks, while testing the learned policy in the real world [14], [15], [11]. Clearly, this approach suffers from the domain shift between simulation and reality and might require some realworld data to be able to generalize [11].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Another promising approach has been use simulations to get training data for reinforcement or imitation learning tasks, while testing the learned policy in the real world [14], [15], [11]. Clearly, this approach suffers from the domain shift between simulation and reality and might require some realworld data to be able to generalize [11].…”
Section: Related Workmentioning
confidence: 99%
“…Indeed, we conjecture the network to make decision on the base of the distance to observed objects in the field of view. Convolutional networks are in fact well known to be successful on the task of monocular depth estimation [15]. An interesting question that we would like to answer in future work is how this approach compares to an LSTM [22] based solution, making decisions over a temporal horizon.…”
Section: Drone Controlmentioning
confidence: 99%
“…Out of the 16 layers of the truncated VGG network, the first 8 were kept fixed, while the others were finetuned for the task of depth estimation. In order to accomodate for this task, in [20] two deconvolutional layers were added to the network that bring the neural representation back to the desired depth map resolution.…”
Section: A Monocular Depth Estimationmentioning
confidence: 99%
“…In [20], the FCN was trained on depth maps obtained from various visually highly realistic simulated environments. In the current study, we will train and fine-tune the same layers, but then using sparse stereo-based disparity measurements as supervised targets.…”
Section: A Monocular Depth Estimationmentioning
confidence: 99%
“…Different solutions, typically based on external robotic simulators such as Gazebo [11], V-REP [12], AirSim [13], MORSE [14], are available to this aim. They employ recent advances in computation and computer graphics (e.g., AirSim is a photorealistic environment [7]) in order to simulate physical phenomena (gravity, magnetism, atmospheric conditions) and perception (e.g., providing sensor models) in such a way that the environment realistically reflects the actual world. In some cases those solutions do not have enough features that could allow to create large scale complex environments close to reality.…”
Section: Introductionmentioning
confidence: 99%