Weights-varying MPC for Autonomous Vehicle Guidance: a Deep Reinforcement Learning Approach

Zarrouki, Baha; Klös, Verena; Heppner, Nikolas; Schwan, Simon; Ritschel, Robert; Voswinkel, Rick

doi:10.23919/ecc54610.2021.9655042

Cited by 14 publications

(5 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Comparatively, Reinforcement Learning (RL) can modify MPC parameters at each time step to quickly adapt to dynamic environments. Zarrouki et al (2021) and Song and Scaramuzza (2020) both learn a policy that can improve parameters of MPC's cost function, while Gros and Zanon (2020); Amos et al (2018) aim to modify both transition and cost function's parameters. Nevertheless, none of these methods consider the non-stationary scenario in an autonomous driving system, which violates the essential Markov property in RL.…”

Section: Related Workmentioning

confidence: 99%

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Zhang¹,

Boedecker²,

Li³

et al. 2023

Preprint

View full text Add to dashboard Cite

Model Predictive Control (MPC) is attracting tremendous attention in the autonomous driving task as a powerful control technique. The success of an MPC controller strongly depends on an accurate internal dynamics model. However, the static parameters, usually learned by system identification, often fail to adapt to both internal and external perturbations in real-world scenarios. In this paper, we firstly (1) reformulate the problem as a Partially Observed Markov Decision Process (POMDP) that absorbs the uncertainties into observations and maintains Markov property into hidden states; and (2) learn a recurrent policy continually adapting the parameters of the dynamics model via Recurrent Reinforcement Learning (RRL) for optimal and adaptive control; and (3) finally evaluate the proposed algorithm (referred as MPC-RRL) in CARLA simulator and leading to robust behaviours under a wide range of perturbations.

show abstract

Section: Related Workmentioning

confidence: 99%

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Zhang¹,

Boedecker²,

Li³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…In [16], a neural network model predictive control(NNMPC) is d proposed, which used neural network (NN) to learn and predict vehicle dynamics based on measured states and input variables. The concept of using Reinforcement Learning (RL) to learn MPC cost function parameters is introduced in [17], and [18] proposes a weights-varying MPC based on deep reinforcement learning to adjust cost function weights in different situations. Prediction horizon refers to the time range used to predict the system's future behavior during the control process, and is the key parameter affecting performance and computational burden of the control system in MPC [19].…”

Section: Introductionmentioning

confidence: 99%

Prediction Horizon-varying Model Predictive Control (MPC)for Autonomous Vehicle Control

Chen,

Lai,

et al. 2024

Preprint

View full text Add to dashboard Cite

The prediction horizon is a key parameter in model predictive control (MPC), related to the effectiveness and stability of model predictive control. In vehicle control, the selection of prediction horizon is influenced by factors such as speed, path curvature, and target point density. To accommodate varying conditions such as road curvature and vehicle speed, we proposed a control strategy using the Proximal Policy Optimization (PPO) algorithm to adjust the prediction horizon, enabling MPC to achieve optimal performance, and called it PPO-MPC. In this paper, we constructed the vehicle dynamics model and designed a basic model prediction control. We have established a state space related to the path information and vehicle state, regarded the prediction horizon as actions, and designed a reward function to optimize the policy and value function. We conducted simulation verifications at various speeds and compared the MPC of the fixed Prediction Horizon. The simulation demonstrates that the PPO-MPC proposed in this article exhibits strong trajectory tracking capability.

show abstract

“…Ref. [20] proposes a weights-varying MPC using a deep reinforcement learning (DRL) algorithm to adjust cost function weights in different situations. Ref.…”

Section: Introductionmentioning

confidence: 99%

“…Ref. [21] proposed a novel approach limiting DRL actions within a safe learning space, and the proposed DRL algorithm can automatically learn context-dependent optimal parameter sets and dynamically adapt for a weights-varying MPC. Ref.…”

Section: Introductionmentioning

confidence: 99%

Prediction Horizon-Varying Model Predictive Control (MPC) for Autonomous Vehicle Control

Chen,

Lai,

et al. 2024

Electronics

View full text Add to dashboard Cite

The prediction horizon is a key parameter in model predictive control (MPC), which is related to the effectiveness and stability of model predictive control. In vehicle control, the selection of a prediction horizon is influenced by factors such as speed, path curvature, and target point density. To accommodate varying conditions such as road curvature and vehicle speed, we proposed a control strategy using the proximal policy optimization (PPO) algorithm to adjust the prediction horizon, enabling MPC to achieve optimal performance, and called it PPO-MPC. We established a state space related to the path information and vehicle state, regarded the prediction horizon as actions, and designed a reward function to optimize the policy and value function. We conducted simulation verifications at various speeds and compared them with an MPC with fixed prediction horizons. The simulation demonstrates that the PPO-MPC proposed in this article exhibits strong adaptability and trajectory tracking capability.

show abstract

Weights-varying MPC for Autonomous Vehicle Guidance: a Deep Reinforcement Learning Approach

Cited by 14 publications

References 50 publications

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Prediction Horizon-varying Model Predictive Control (MPC)for Autonomous Vehicle Control

Prediction Horizon-Varying Model Predictive Control (MPC) for Autonomous Vehicle Control

Contact Info

Product

Resources

About