Sim–Real Mapping of an Image-Based Robot Arm Controller Using Deep Reinforcement Learning

Sasaki, Minoru; Muguro, Joseph; Kitano, Fumiya; Njeri, Waweru; Matsushita, Kojiro

doi:10.3390/app122010277

Cited by 5 publications

(5 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…[48,17,19,38] Não utilizou [3,27,24,35,45,20,42,18,47,14,49] Verifica-se também que a variedade de manipuladores robóticos utilizados é ampla, sendo o modelo UR3 da Universal Robots o mais utilizado entre estes em trabalhos com enfoque em: Tarefa peg-in-hole [6,4] e controle de brac ¸o duplo robótico [23]; seguido do PANDA [10,34], UR5 [31,22], RM-X52 [32,33] e IRB 1600 [1,2]. Além disso, 3 trabalhos fizeram o uso de manipuladores produzidos em laboratório, customizados ou com pec ¸as impressas em 3D, implementados em: Controle de articulac ¸ões robóticas [36], planejamento de movimento [46] e mapeamento de controlador de brac ¸o robótico [37]. Já 10,53% dos artigos não informaram o modelo de manipulador implementado no trabalho.…”

Section: A Manipuladores Robóticos E Simuladoresunclassified

“…Outra importante aplicac ¸ão de Q-Learning, em conjunto com DRL foi executada por [37] no mapeamento Sim-Real de um controlador de brac ¸o robótico baseado em imagem, onde foi feita uma comparac ¸ão entre um sistema de transferência de aprendizado convencional DRL e o método com mapeamento proposto. Concluiu-se que o sistema proposto obteve uma taxa de sucesso de 100% na tarefa de preensão, sendo superior ao sistema convencional que obteve entre 15 e 57% de taxa de sucesso a depender da posic ¸ão do objeto.…”

Section: B Técnicas De Ar Utilizadas Nos Trabalhos Analisadosunclassified

See 1 more Smart Citation

Aplicações de Aprendizado por Reforço em Manipuladores Robóticos: Uma Revisão Sistemática

Brito,

Carvalho Ottoni,

Ottoni

2023

Anais Do XVI Congresso Brasileiro De Inteligência Computacional

View full text Add to dashboard Cite

O Aprendizado por Reforço (AR) tem diversas aplicações em tecnologia como um método expoente de resolução de problemas otimizando ações por meio de recompensas. Existe uma gama de estudos aplicados à robótica com o objetivo de aprimorar o estado da arte nesta área. Assim, este trabalho visa discutir as aplicações de AR em manipuladores robóticos através de uma revisão sistemática da literatura, onde foram analisados 38 trabalhos da área publicados entre 2013 e 2023. Desta forma, foram elaboradas 6 perguntas de pesquisa para o desenvolvimento do trabalho. Baseado nestas perguntas, foi possível destacar os resultados da revisão sistemática. Entre as técnicas discutidas, Q-Learning (23,68%), Deep Reinforcement Learning (28,95%), Actor-Critic (26,32%) e Policy Gradient (21,05%) foram as principais. Entre os ambientes e equipamentos para experimentação física e simulada, os mais utilizados foram o simulador Matlab (18,42%) e o manipulador UR3 (7,89%). também foram apresentados resultados sobre o ajuste de hiperparâmetros, sendo que apenas 10,53% dos trabalhos realizaram o ajuste. Além disso, foi realizada uma comparação com outros trabalhos de revisão sistemática do tema proposto. Por fim, foram discutidas as perguntas de pesquisa deste trabalho e apresentadas as principais indicações de trabalhos futuros para promover a continuidade do desenvolvimento de aplicações na área.

show abstract

Section: A Manipuladores Robóticos E Simuladoresunclassified

Section: B Técnicas De Ar Utilizadas Nos Trabalhos Analisadosunclassified

Aplicações de Aprendizado por Reforço em Manipuladores Robóticos: Uma Revisão Sistemática

Brito,

Carvalho Ottoni,

Ottoni

2023

Anais Do XVI Congresso Brasileiro De Inteligência Computacional

View full text Add to dashboard Cite

show abstract

“…However, when implementing these learned policies on the actual machine, there is a possibility of encountering issues related to the motor's ability to follow the desired trajectories. Previous research has indicated that subtle differences between the numerical simulation environment and the real-world experimental environment can significantly impact the learning outcomes in a negative manner, what is referred to as sim-real challenges [18,38,39].…”

Section: Experiments On the Actual Flexible Manipulatormentioning

confidence: 99%

“…In a previous study, we proposed a search algorithm that employed convolutional neural networks to map real-world observations (images) to policy-equivalent images trained by RL in a simulated environment [18]. The system was trained in two steps, involving RL policy and a mapping model, which mitigated the challenges associated with sim-real transfer using solely simulated data.…”

Section: Introductionmentioning

confidence: 99%

Vibration and Position Control of a Two-Link Flexible Manipulator Using Reinforcement Learning

et al. 2023

Self Cite

View full text Add to dashboard Cite

In recent years, industries have increasingly emphasized the need for high-speed, energy-efficient, and cost-effective solutions. As a result, there has been growing interest in developing flexible link manipulator robots to meet these requirements. However, reducing the weight of the manipulator leads to increased flexibility which, in turn, causes vibrations. This research paper introduces a novel approach for controlling the vibration and motion of a two-link flexible manipulator using reinforcement learning. The proposed system utilizes trust region policy optimization to train the manipulator’s end effector to reach a desired target position, while minimizing vibration and strain at the root of the link. To achieve the research objectives, a 3D model of the flexible-link manipulator is designed, and an optimal reward function is identified to guide the learning process. The results demonstrate that the proposed approach successfully suppresses vibration and strain when moving the end effector to the target position. Furthermore, the trained model is applied to a physical flexible manipulator for real-world control verification. However, it is observed that the performance of the trained model does not meet expectations, due to simulation-to-real challenges. These challenges may include unanticipated differences in dynamics, calibration issues, actuator limitations, or other factors that affect the performance and behavior of the system in the real world. Therefore, further investigations and improvements are recommended to bridge this gap and enhance the applicability of the proposed approach.

show abstract

“…The RG refers to the discrepancy between the simulated environment and the real-world environment that the agent will ultimately be deployed in. This discrepancy can lead to inaccuracy in the model used in the simulation, which can affect the optimization of the policy learned by the agent [26].…”

Section: Introductionmentioning

confidence: 99%

Deep reinforcement learning based voltage control revisited

Nematshahi,

Shi,

Wang

et al. 2023

IET Generation Trans & Dist

View full text Add to dashboard Cite

Deep Reinforcement Learning (DRL) has shown promise for voltage control in power systems due to its speed and model‐free nature. However, learning optimal control policies through trial and error on a real grid is infeasible due to the mission‐critical nature of power systems. Instead, DRL agents are typically trained on a simulator, which may not accurately represent the real grid. This discrepancy can lead to suboptimal control policies and raises concerns for power system operators. In this paper, we revisit the problem of RL‐based voltage control and investigate how model inaccuracies affect the performance of the DRL agent. Extensive numerical experiments are conducted to quantify the impact of model inaccuracies on learning outcomes. Specifically, techniques that enable the DRL agent are focused on learning robust policies that can still perform well in the presence of model errors. Furthermore, the impact of the agent's decisions on the overall system loss are analyzed to provide additional insight into the control problem. This work aims to address the concerns of power system operators and make DRL‐based voltage control more practical and reliable.

show abstract

Sim–Real Mapping of an Image-Based Robot Arm Controller Using Deep Reinforcement Learning

Cited by 5 publications

References 30 publications

Aplicações de Aprendizado por Reforço em Manipuladores Robóticos: Uma Revisão Sistemática

Aplicações de Aprendizado por Reforço em Manipuladores Robóticos: Uma Revisão Sistemática

Vibration and Position Control of a Two-Link Flexible Manipulator Using Reinforcement Learning

Deep reinforcement learning based voltage control revisited

Contact Info

Product

Resources

About