SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation

Gordon, Daniel; Kadian, Abhishek; Parikh, Devi; Hoffman, Judy; Batra, Dhruv

doi:10.1109/iccv.2019.00111

Cited by 60 publications

(67 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Finally, there are two really recent articles closely related to ours. The first one by Gordon et al [10] introduced SplitNet on which they explicitly decompose the learning scheme in finding features from perception task and use these features as input to their model-free RL agent. But their scheme is applied to a completely different task, robot navigation and scene exploration.…”

Section: Auxiliary Tasks and Learning Affordancesmentioning

confidence: 99%

End-to-End Model-Free Reinforcement Learning for Urban Driving Using Implicit Affordances

Toromanoff

Wirbel

Moutarde

2020

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

121

132

View full text Add to dashboard Cite

Reinforcement Learning (RL) aims at learning an optimal behavior policy from its own experiments and not rulebased control methods. However, there is no RL algorithm yet capable of handling a task as difficult as urban driving. We present a novel technique, coined implicit affordances, to effectively leverage RL for urban driving thus including lane keeping, pedestrians and vehicles avoidance, and traffic light detection. To our knowledge we are the first to present a successful RL agent handling such a complex task especially regarding the traffic light detection. Furthermore, we have demonstrated the effectiveness of our method by winning the Camera Only track of the CARLA challenge.

show abstract

Section: Auxiliary Tasks and Learning Affordancesmentioning

confidence: 99%

End-to-End Model-Free Reinforcement Learning for Urban Driving Using Implicit Affordances

Toromanoff

Wirbel

Moutarde

2020

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

121

132

View full text Add to dashboard Cite

show abstract

“…Thus, we should have pursued a more effective real-to-red.sim transfer. However, as indicated by [24], the separation of advanced visual perception and motion planning is advantageous for domain adaptation. For real-time MAV motion planning, where motion blurring is introduced, candidate applications are limited to non-agile and noncomplex trajectories.…”

Section: Discussionmentioning

confidence: 99%

“…Thus, it is not appropriate to use the existing sim-to-real type datasets and simulators or conventional methods directly for industrial applications in narrow or confined environments. Previous research [24] has handled this gap between real-worldbased simulators. This study has revealed the effectiveness of splitting visual perception and motion control.…”

Section: Sim-to-real Approachesmentioning

confidence: 99%

See 1 more Smart Citation

Reduced Simulation: Real-to-Sim Approach toward Collision Detection in Narrowly Confined Environments

2021

View full text Add to dashboard Cite

Recently, several deep-learning based navigation methods have been achieved because of a high quality dataset collected from high-quality simulated environments. However, the cost of creating high-quality simulated environments is high. In this paper, we present a concept of the reduced simulation, which can serve as a simplified version of a simulated environment yet be efficient enough for training deep-learning based UAV collision avoidance approaches. Our approach deals with the reality gap between a reduced simulation dataset and real world dataset and can provide a clear guideline for reduced simulation design. Our experimental result confirmed that the reduction in visual features provided by textures and lighting does not affect operating performance with the user study. Moreover, by conducting collision detection experiments, we verified that our reduced simulation outperforms the conventional cost-effective simulations in adaptation capability with respect to realistic simulation and real-world scenario.

show abstract

“…TD-A3C, I2A and Gated-LSTM-A3C are all first trained via behavioral cloning using ground-truth paths. After pre-training, we update the three policy layers using a shaped reward based on the geodesic distance to the goal, geo(x, g), as described in (Gordon et al 2019): r t = geo(x t−1 , g)−geo(x t , g)+ζ,where ζ = −0.01 is a small constant time penalty. More implementation details are provided in the supplemental material.…”

Section: Success Criteriamentioning

confidence: 99%

NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations

Manocha

Wang

et al. 2020

AAAI

View full text Add to dashboard Cite

We propose improving the cross-target and cross-scene generalization of visual navigation through learning an agent that is guided by conceiving the next observations it expects to see. This is achieved by learning a variational Bayesian model, called NeoNav, which generates the next expected observations (NEO) conditioned on the current observations of the agent and the target view. Our generative model is learned through optimizing a variational objective encompassing two key designs. First, the latent distribution is conditioned on current observations and the target view, leading to a model-based, target-driven navigation. Second, the latent space is modeled with a Mixture of Gaussians conditioned on the current observation and the next best action. Our use of mixture-of-posteriors prior effectively alleviates the issue of over-regularized latent space, thus significantly boosting the model generalization for new targets and in novel scenes. Moreover, the NEO generation models the forward dynamics of agent-environment interaction, which improves the quality of approximate inference and hence benefits data efficiency. We have conducted extensive evaluations on both real-world and synthetic benchmarks, and show that our model consistently outperforms the state-of-the-art models in terms of success rate, data efficiency, and generalization.

show abstract

SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation

Cited by 60 publications

References 33 publications

End-to-End Model-Free Reinforcement Learning for Urban Driving Using Implicit Affordances

End-to-End Model-Free Reinforcement Learning for Urban Driving Using Implicit Affordances

Reduced Simulation: Real-to-Sim Approach toward Collision Detection in Narrowly Confined Environments

NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations

Contact Info

Product

Resources

About