Kei Ota scite author profile

Kei Ota

13Publications

59Citation Statements Received

91Citation Statements Given

How they've been cited

How they cite others

Affiliations

Mitsubishi Electric (Japan), Tokyo Institute of Technology, National Institute of Advanced Industrial Science and Technology

Publications

Order By: Most citations

Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

Ota

Jha

Oiki

et al. 2019

View full text Add to dashboard Cite

In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known. Generating smooth, dynamically feasible trajectories could be difficult for such systems. Using samplingbased algorithms for motion planning may result in trajectories that are prone to undesirable control jumps. However, they can usually provide a good reference trajectory which a model-free reinforcement learning algorithm can then exploit by limiting the search domain and quickly finding a dynamically smooth trajectory. We use this idea to train a reinforcement learning agent to learn a dynamically smooth trajectory in a curriculum learning setting. Furthermore, for generalization, we parameterize the policies with goal locations, so that the agent can be trained for multiple goals simultaneously. We show result in both simulated environments as well as real experiments, for a 6-DoF manipulator arm operated in position-controlled mode to validate the proposed idea. We compare the proposed ideas against a PID controller which is used to track a designed trajectory in configuration space. Our experiments show that our RL agent trained with a reference path outperformed a model-free PID controller of the type commonly used on many robotic platforms for trajectory tracking.

show abstract

Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

Ota

Sasaki

Jha

et al. 2020

View full text Add to dashboard Cite

Training Larger Networks for Deep Reinforcement Learning

Ota¹,

Jha²,

Kanezaki³

2021

Preprint

View full text Add to dashboard Cite

Object Memory Transformer for Object Goal Navigation

Fukushima

Ota

Kanezaki

et al. 2022

View full text Add to dashboard Cite

Data-Efficient Learning for Complex and Real-Time Physical Problem Solving Using Augmented Simulation

Ota

Jha

Romeres

et al. 2021

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Hoshino

Ota

Kanezaki

et al. 2022

View full text Add to dashboard Cite

Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

Ota¹,

Sasaki²,

Jha³

et al. 2020

Preprint

View full text Add to dashboard Cite

Deep Reactive Planning in Dynamic Environments

Ota¹,

Jha²,

Onishi³

et al. 2020

Preprint

View full text Add to dashboard Cite

4 AIST Figure 1: Our proposed agent learns an end-to-end reactive planning technique by combining traditional path planning algorithms, supervised learning (SL) and reinforcement learning (RL) algorithms in a synergistic way. A deep CNN is used to learn the sequence of waypoints obtained from a kinematic planning algorithm (e.g., a Bidirectional RRT*) given a depth image of the environment. The agent learns to follow arbitrary waypoints using path-conditioned RL, thus resulting in efficient exploration. We show that our trained agent can achieve good sample efficiency, as well as generalization to novel environments in simulation as well as real environments. The whole learning process is done in the simulator by learning a Real2Sim transfer function to make the training process efficient and suitable for robotic systems.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kei Ota

Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

Training Larger Networks for Deep Reinforcement Learning

Object Memory Transformer for Object Goal Navigation

Data-Efficient Learning for Complex and Real-Time Physical Problem Solving Using Augmented Simulation

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

Deep Reactive Planning in Dynamic Environments

Contact Info

Product

Resources

About