From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning

Capasso, Alessandro Paolo; Bacchiani, Giulio; Broggi, Alberto

doi:10.1109/iv47402.2020.9304593

Cited by 7 publications

(6 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This challenge was also identified by [25] and [26] regarding Digital Twin implementations. Despite the identified challenges, the implementation of a modular Digital Twin architecture in the context of mobile robotics can bring benefits in very different scenarios , such as provide faster problem detection due to the capabilities of the technologies used in Digital Twin's services [27], fail-safe sensing for cases where robot hardware devices fail [28], creating this way systems and components redundancy, multi-agv management [29,30], when there is the need to manage and optimize multiple robots routes [31], digitalfirst approach that allows for testing or calibration [32,33] of different hardware and robot design or even validate technologies before creating the physical asset [34]. These potential benefits are described on Table 1.…”

Section: Digital Twin Model Firstmentioning

confidence: 99%

Driving Forward Mobile Robotics: A Digital Twin Architecture Case Study for AGVs Data-Driven Autonomy

Marques,

Rodrigues,

Sousa

2024

Preprint

View full text Add to dashboard Cite

The evolving dynamics of today’s industry, which prioritize flexibility and adaptability to unforeseen circumstances, have accelerated the convergence and integration of diverse technologies within the realm of IT. This convergence empowers the execution of testing methodologies and facilitates the rapid refinement of solutions created through digital methods, enabling faster iteration cycles. This novel paradigm contributes to the decrease of potential expenses and time required for testing innovative approaches, a crucial aspect especially in contexts such as process redesign or robot reconfiguration within industrial environments. In these instances, conducting real-world experiments can frequently be unfeasible due to the need for production interruption. This paper presents and evaluates a modular Digital Twins architecture applied to mobile robotics, specifically Automated Guided Vehicles (AGVs). A practical case study is conducted involving robot navigation along a predetermined route and the detection of unexpected obstacles, which promts the robot to respond by halting to prevent collisions, validating, this way, several of the identified benefits of utilizing a digital replica.

show abstract

Section: Digital Twin Model Firstmentioning

confidence: 99%

Driving Forward Mobile Robotics: A Digital Twin Architecture Case Study for AGVs Data-Driven Autonomy

Marques,

Rodrigues,

Sousa

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Paolo Capasso, Giulio Bacchiani, Alberto Broggi are with Vislab srl, an Ambaraella Inc. company -Parma, Italy acapasso@ambarella.com, gbacchiani@ambarella.com, broggi@vislab.it In particular, RL algorithms are widely used in the autonomous driving field for the development of decisionmaking and maneuver execution systems like lane change ( [11], [12], [13]), lane keeping ( [14], [15]), overtaking maneuvers [16], intersection and roundabout handling ( [17], [18]) and many others. Starting from the delayed version of Asynchronous Advantage Actor Critic (A3C) algorithm ( [19], [20], [21]), we implemented a Reinforcement Learning planner training agents in a simulator based on High Definition Maps (HD Maps [22]) developed internally by the research team. In particular, we trained the model predicting continuous actions related to the acceleration and steering angle, and testing it on board of a real self-driving car on an entire urban area of the city of Parma (Fig.…”

Section: Alessandromentioning

confidence: 99%

“…In this paper we used a delayed version of the original A3C [19] called Delayed-A3C (D-A3C). This algorithm was previously developed and used in [20] and [21] where it has been shown that it allows to achieve better results than A3C. In D-A3C configuration, each agent begins the episode with a local copy of the latest version of the global network, while the system collects all the contribution of the actors; the agent updates their local copy of the network at fixed time intervals but all the updates are sent to the global network only at the end of the episode, while in the classical A3C algorithm this exchange is performed at fixed time intervals.…”

Section: Related Workmentioning

confidence: 99%

Tackling Real-World Autonomous Driving using Deep Reinforcement Learning

Maramotti

Capasso²,

Bacchiani³

et al. 2022

2022 IEEE Intelligent Vehicles Symposium (IV)

Self Cite

View full text Add to dashboard Cite

In the typical autonomous driving stack, planning and control systems represent two of the most crucial components in which data retrieved by sensors and processed by perception algorithms are used to implement a safe and comfortable self-driving behavior. In particular, the planning module predicts the path the autonomous car should follow taking the correct high-level maneuver, while control systems perform a sequence of low-level actions, controlling steering angle, throttle and brake. In this work, we propose a modelfree Deep Reinforcement Learning Planner training a neural network that predicts both acceleration and steering angle, thus obtaining a single module able to drive the vehicle using the data processed by localization and perception algorithms on board of the self-driving car. In particular, the system that was fully trained in simulation is able to drive smoothly and safely in obstacle-free environments both in simulation and in a real-world urban area of the city of Parma, proving that the system features good generalization capabilities also driving in those parts outside the training scenarios. Moreover, in order to deploy the system on board of the real self-driving car and to reduce the gap between simulated and real-world performances, we also develop a module represented by a tiny neural network able to reproduce the real vehicle dynamic behavior during the training in simulation.

show abstract

“…Logistics Research [38] Science Robotics [39] International Symposium on Experimental Robotics (ISER) [41] IEEE/CVF International Conferene on Computer Vision (ICCV) [43] IEEE Transactions on Vehicular Technology [44] IEEE Robotics and Automation Letters [45] IEEE Intelligent Vehicles Symposium [46] International Conference on Informatics in Control, Automation and Robotics [47] European Conference on Machine Learning [48] International Joint Conference on Artificial Intelligence [49] International Conference on Unsupervised and Transfer Learning workshop [50] International Conference on Neural Information Processing Systems [51] International Conference on Learning Representations (ICLR) Conference [52,54,55] International Conference on Machine Learning [56,57] Journal of Machine Learning Research [58,72] Springer [59] Nature [60] Stanford University AI Lab [62] ACM Transactions on Intelligent Systems and Technology [66] IEEE Transactions on Pattern Analysis and Machine Intelligence [67] AAAI Publications, 2016 AAAI Spring Symposium Series [70] IEEE International Conference on Data Mining Workshops (ICDMW) [71] Robotics and Autonomous Systems [73] Artificial Intelligence [74] Sensors [78] Synthesis lectures on Artificial Intelligence and Machine Learning [79]…”

Section: Publication Channel Papersmentioning

confidence: 99%

“…Additionally, it allows a reduction in the size of the datasets which in turn reduces the computational requirements. Examples are Autonomous MAV flight [41], motion planning [29], domain adaptation for improved robot grasping [30,31,35,45], multirobot transfer learning [24,32,53,65,66], mobile fulfilment systems [38] and autonomous driving [42][43][44][45][46][47]. The above mentioned papers use virtual training environments to generate synthetic training data to train a model in the virtual environment and use transfer learning techniques to transfer the knowledge to real-world platforms.…”

Section: What Are the Use Cases Of Transfer Learning In The Virtual To Real-world Context?mentioning

confidence: 99%

Virtual to Real-World Transfer Learning: A Systematic Review

Ranaweera

Mahmoud

2021

Electronics

View full text Add to dashboard Cite

Machine learning has become an important research area in many domains and real-world applications. The prevailing assumption in traditional machine learning techniques, that training and testing data should be of the same domain, is a challenge. In the real world, gathering enough training data to create high-performance learning models is not easy. Sometimes data are not available, very expensive, or dangerous to collect. In this scenario, the concept of machine learning does not hold up to its potential. Transfer learning has recently gained much acclaim in the field of research as it has the capability to create high performance learners through virtual environments or by using data gathered from other domains. This systematic review defines (a) transfer learning; (b) discusses the recent research conducted; (c) the current status of transfer learning and finally, (d) discusses how transfer learning can bridge the gap between the virtual and the real.

show abstract

From Simulation to Real World Maneuver Execution using Deep Reinforcement Learning

Cited by 7 publications

References 16 publications

Driving Forward Mobile Robotics: A Digital Twin Architecture Case Study for AGVs Data-Driven Autonomy

Driving Forward Mobile Robotics: A Digital Twin Architecture Case Study for AGVs Data-Driven Autonomy

Tackling Real-World Autonomous Driving using Deep Reinforcement Learning

Virtual to Real-World Transfer Learning: A Systematic Review

Contact Info

Product

Resources

About