Benchmarking model-free and model-based optimal control

Koryakovskiy, Ivan; Kudruss, Manuel; Babuka, R.; Caarls, Wouter; Kirches, Christian; Mombaur, Katja; Schl�der, Johannes P.; Vallery, Heike

doi:10.1016/j.robot.2017.02.006

Cited by 18 publications

(12 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The longer prediction horizon used by CAL γ = 0.99 attains a better performance. Another problem of CAL is the slow convergence which is probably caused by the fact that the reward constructed from the quadratic objective function of the nominal controller results in small gradients [2]. This hypothesis is supported by the fact that DPG with a quadratic cost function learns the task extremely slowly.…”

Section: Discussionmentioning

confidence: 99%

“…Usually, the dynamics of physical systems are known, but various uncertainties do not allow achieving optimal performance with model-based control methods [2]. Whereas for the estimation of parametric uncertainties moving horizon estimation techniques [3] can often be employed, for structural uncertainties, such as backlash, Coulomb friction or wear and tear, this is not easily possible.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Model-Plant Mismatch Compensation Using Reinforcement Learning

Koryakovskiy

Kudruss

Babuška

et al. 2018

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

Abstract-Learning-based approaches are suitable for the control of systems with unknown dynamics. However, learning from scratch involves many trials with exploratory actions until a good control policy is discovered. Real robots usually cannot withstand the exploratory actions and suffer damage. This problem can be circumvented by combining learning with model-based control. In this article, we employ a nominal model-predictive controller that is impeded by the presence of an unknown modelplant mismatch. To compensate for the mismatch, we propose two approaches of combining reinforcement learning with the nominal controller. The first approach learns a compensatory control action which minimizes the same performance measure as is minimized by the nominal controller. The second approach learns a compensatory signal from a difference of a transition predicted by the internal model and an actual transition. We compare the approaches on a robot attached to the ground and performing a setpoint reaching task in simulations. We implement the better approach on the real robot and demonstrate successful learning results.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Model-Plant Mismatch Compensation Using Reinforcement Learning

Koryakovskiy

Kudruss

Babuška

et al. 2018

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Then, a policy-based reinforcement-learning feedback controller can be used to learn the nonlinear model of a 4D-printed soft robot through experiments and simulation data to compensate for the uncertainties. The self-learning algorithms play a significant role in adaptive 4D-printed systems to optimise the controller commands based on the information acquired from the interaction with the environment via the 3D-printed sensors [138][139][140][141][142].…”

Section: Adaptive 4d-printed Systems Designmentioning

confidence: 99%

Control-Based 4D Printing: Adaptive 4D-Printed Systems

et al. 2020

View full text Add to dashboard Cite

Building on the recent progress of four-dimensional (4D) printing to produce dynamic structures, this study aimed to bring this technology to the next level by introducing control-based 4D printing to develop adaptive 4D-printed systems with highly versatile multi-disciplinary applications, including medicine, in the form of assisted soft robots, smart textiles as wearable electronics and other industries such as agriculture and microfluidics. This study introduced and analysed adaptive 4D-printed systems with an advanced manufacturing approach for developing stimuli-responsive constructs that organically adapted to environmental dynamic situations and uncertainties as nature does. The adaptive 4D-printed systems incorporated synergic integration of three-dimensional (3D)-printed sensors into 4D-printing and control units, which could be assembled and programmed to transform their shapes based on the assigned tasks and environmental stimuli. This paper demonstrates the adaptivity of these systems via a combination of proprioceptive sensory feedback, modeling and controllers, as well as the challenges and future opportunities they present.

show abstract

“…Nowadays, most robot manipulators need to possess the capacity of accurate and fast trajectory tracking. Trajectory tracking control is a key issue in the field of robot manipulator motion planning [1][2][3]. It aims to enable the joints or links of the robot manipulator to track the desired trajectory with ideal dynamic quality or to stabilize them in the specified position [4].…”

Section: Introductionmentioning

confidence: 99%

Trajectory Tracking Control of Robot Manipulators Based on U-Model

Zhao

2020

Mathematical Problems in Engineering

View full text Add to dashboard Cite

A new trajectory tracking control method based on the U-model is proposed to improve the trajectory tacking speed of robot manipulators. The U-model method is introduced to relieve the requirement of the dynamic mathematical model and make the design of trajectory tracking controller of robot manipulators simpler. To further improve the trajectory tacking speed, an improved iterative learning control algorithm is used to suppress the influence of the initial state error with less computation time. Experimental results show that the proposed control method is effective and practical for the trajectory tracking control of robot manipulators, especially with a high real-time requirement.

show abstract

Benchmarking model-free and model-based optimal control

Cited by 18 publications

References 38 publications

Model-Plant Mismatch Compensation Using Reinforcement Learning

Model-Plant Mismatch Compensation Using Reinforcement Learning

Control-Based 4D Printing: Adaptive 4D-Printed Systems

Trajectory Tracking Control of Robot Manipulators Based on U-Model

Contact Info

Product

Resources

About