A Mental Simulation Approach for Learning Neural-Network Predictive Control (in Self-Driving Cars)

Lio, Mauro Da; Donà, Riccardo; Papini, Gastone Pietro Rosati; Biral, Francesco; Svensson, Henrik

doi:10.1109/access.2020.3032780

Cited by 21 publications

(17 citation statements)

References 46 publications

(90 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For practical reasons, we reuse the self-driving agent of the Dreams4Cars project. In this work, we describe the novel interaction mechanics and the enabling elements (Section III), but not a comprehensive description of the rest of the agent, which was published in [3] (agent architecture), [4] (offline learning via mental simulations) and [5] (learning cautious behaviors).…”

Section: A What This Paper Is (And Is Not) Aboutmentioning

confidence: 99%

“…The dynamics of a vehicle are in part stochastic because of external disturbances: an action u = {j(t), r(t)} may generate a family of trajectories γ. The stochastic vehicle response {j(t), r(t)} → γ is specified by a probabilistic motion model (in our case, probabilistic motion models were learned with a technique similar to [4]). We hence begin with a mapping u → γ from a generic action u = {j(t), r(t)} to the distribution of generated trajectories γ.…”

Section: B Action Primingmentioning

confidence: 99%

See 1 more Smart Citation

The Biasing of Action Selection Produces Emergent Human-Robot Interactions in Autonomous Driving

Lio

Donà

Papini

et al. 2022

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

This paper describes a means to produce emergent collaboration between a human driver and an artificial codriver agent. The work exploits the hypothesis that humanhuman cooperation emerges from a shared understanding of the given context's affordances and emulates the same principle: the observation of one agent's behavior steers another agent's decision-making by favoring the selection of the goals that would produce the observed activity. Specifically, we describe how to steer the decision-making of a special self-driving agent via weighting the agent's action selection process with input from a dummy human driving activity. In this way, human input maps onto the safe and affordable actions recognized by the agent. We demonstrate an emergent and efficient driving, collaboration, and rejection of unsafe human requests.

show abstract

Section: A What This Paper Is (And Is Not) Aboutmentioning

confidence: 99%

Section: B Action Primingmentioning

confidence: 99%

The Biasing of Action Selection Produces Emergent Human-Robot Interactions in Autonomous Driving

Lio

Donà

Papini

et al. 2022

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

“…In our work, the driver agent is [2], but other realizations may also work, and we release an open access implementation of this work. The agent can drive [3], i.e., it is capable of high-level motor planning and low-level control. Specifically, it predicts the other road users' (pedestrians) trajectories with a mirroring mechanism (see also [4,Section IV.A and Section V.A]) and maneuvers accordingly to avoid collisions.…”

Section: B Driver Agentmentioning

confidence: 99%

“…1) when the RB common and RB rare files are ready to be read, i.e. the DQL middleware is not writing on the files, the DQL core loads the buffers and creates a batch of training data by randomly sampling the two buffers, taking only 5% of the data from the RB rare; 2) the DQL core updates the weights of Q andQ in equation ( 4) using, respectively, the ADAM optimization algorithm and the Polyak averaging (5), and it stores them in the network file; 3) the DQL core updates the value of ε using the rule in equation (3), and it stores it in the network file; 4) the training stops if it reaches the maximum number of epochs, otherwise it restarts from step 1). The simulation process produces the datasets (the pseudo-code of the algorithm is presented in the supplementary materials Alg.…”

Section: A the Training Proceduresmentioning

confidence: 99%

A Reinforcement Learning Approach for Enacting Cautious Behaviours in Autonomous Driving System: Safe Speed Choice in the Interaction With Distracted Pedestrians

Papini

Plebe

Lio

et al. 2022

IEEE Trans. Intell. Transport. Syst.

Self Cite

View full text Add to dashboard Cite

Driving requires the ability to handle unpredictable situations. Since it is not always possible to predict an impending danger, a good driver should preventively assess whether a situation has risks and adopt a safe behavior. Considering, in particular, the possibility of a pedestrian suddenly crossing the road, a prudent driver should limit the traveling speed. We present a work exploiting reinforcement learning to learn a function that specifies the safe speed limit for a given artificial driver agent. The safe speed function acts as a behavioral directive for the agent, thus extending its cognitive abilities. We consider scenarios where the vehicle interacts with a distracted pedestrian that might cross the road in hard-to-predict ways and propose a neural network mapping the pedestrian's context onto the appropriate traveling speed so that the autonomous vehicle can successfully perform emergency braking maneuvers. We discuss the advantages of developing a specialized neural network extension on top of an already functioning autonomous driving system, removing the burden of learning to drive from scratch while focusing on learning safe behavior at a highlevel. We demonstrate how the safe speed function can be learned in simulation and then transferred into a real vehicle. We include a statistical analysis of the network's improvements compared to the original autonomous driving system.

show abstract

“…Most of these studies are based on typical mathematical and control modeling algorithms to ensure smooth car-following such that an autonomous vehicle, defined as the follower, keeps following another vehicle, defined as the leader, while maintaining safety distances [9]- [11]. Recently, few studies have promoted the use of Artificial Intelligence in designing car-following models [12], [13]. Most of them resorted to use Reinforcement Learning (RL) methods to determine navigation decisions for the follower vehicle and hence, design their car-following models based on numerical inputs of the vehicle dynamics, e.g., the lateral position, the speed, and the yaw angle.…”

Section: Introductionmentioning

confidence: 99%

A Reinforcement Learning Framework for Video Frame-Based Autonomous Car-Following

Masmoudi

Friji

Ghazzai

et al. 2021

IEEE Open J. Intell. Transp. Syst.

View full text Add to dashboard Cite

Car-following theory has received considerable attention as a core component of Intelligent Transportation Systems. However, its application to the emerging autonomous vehicles (AVs) remains an unexplored research area. AVs are designed to provide convenient and safe driving by avoiding accidents caused by human errors. They require advanced levels of recognition of other drivers' driving-style. With car-following models, AVs can use their built-in technology to understand the environment surrounding them and make real-time decisions to follow other vehicles. In this paper, we design an end-to-end carfollowing framework for AVs using automated object detection and navigation decision modules. The objective is to allow an AV to follow another vehicle based on Red Green Blue Depth (RGB-D) frames. We propose to employ a joint solution involving the You Look Once version 3 (YOLOv3) object detector to identify the leader vehicle and other obstacles and a reinforcement learning (RL) algorithm to navigate the self-driving vehicle. Two RL algorithms, namely Q-learning and Deep Q-learning have been investigated. Simulation results show the convergence of the developed models and investigate their efficiency in following the leader. It is shown that, with video frames only, promising results are achieved and that AVs can adopt a reasonable car-following behavior.

show abstract

A Mental Simulation Approach for Learning Neural-Network Predictive Control (in Self-Driving Cars)

Cited by 21 publications

References 46 publications

The Biasing of Action Selection Produces Emergent Human-Robot Interactions in Autonomous Driving

The Biasing of Action Selection Produces Emergent Human-Robot Interactions in Autonomous Driving

A Reinforcement Learning Approach for Enacting Cautious Behaviours in Autonomous Driving System: Safe Speed Choice in the Interaction With Distracted Pedestrians

A Reinforcement Learning Framework for Video Frame-Based Autonomous Car-Following

Contact Info

Product

Resources

About