Orthogonal Policy Gradient and Autonomous Driving Application

Luo, Mincong; Yin, Tong; Liu, Jiachi

doi:10.1109/icsess.2018.8663794

Cited by 2 publications

(2 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We do not aim to provide a full in-depth survey of the entire field and only note that despite its long history TORCS is being actively used for research purposes up to this day. In particular, Sallab et al [516,517] use it in their deep reinforcement learning frameworks for lane keeping assist and autonomous driving, Xiong et al [661] add safety-based control on top of deep RL, Wang et al [626] train a deep RL agent for autonomous driving in TORCS, Barati et al [37] use it to add multi-view inputs for deep RL agents, Li et al [352] develop Visual TORCS, a deep RL environment based on TORCS, Ando, Lubashevsky et al [20,381] use TORCS to study the statistical properties of human driving, Glassner et al [202] shift the emphasis to trajectory learning, Luo et al [383] use TORCS as the main test environment for a new variation of the policy gradient algorithm, Liu et al [369] make use of the multimodal sensors available in TORCS for end-to-end learning, Xu et al [576] train a segmentation network and feed segmentation results to the RL agent in order to unify synthetic imagery from TORCS and real data, and so on. In an interesting recent work, Choi et al [114] consider the driving experience transfer problem but consider a transfer not from a synthetic simulator to the real domain but from one simulator (TORCS) to another (GTA V).…”

Section: Urban and Outdoor Environments: Learning To Drivementioning

confidence: 99%

Synthetic Data for Deep Learning

Nikolenko¹

2019

Preprint

View full text Add to dashboard Cite

Synthetic data is an increasingly popular tool for training deep learning models, especially in computer vision but also in other areas. In this work, we attempt to provide a comprehensive survey of the various directions in the development and application of synthetic data. First, we discuss synthetic datasets for basic computer vision problems, both low-level (e.g., optical flow estimation) and high-level (e.g., semantic segmentation), synthetic environments and datasets for outdoor and urban scenes (autonomous driving), indoor scenes (indoor navigation), aerial navigation, simulation environments for robotics, applications of synthetic data outside computer vision (in neural programming, bioinformatics, NLP, and more); we also survey the work on improving synthetic data development and alternative ways to produce it such as GANs. Second, we discuss in detail the synthetic-to-real domain adaptation problem that inevitably arises in applications of synthetic data, including syntheticto-real refinement with GAN-based models and domain adaptation at the feature/model level without explicit data transformations. Third, we turn to privacy-related applications of synthetic data and review the work on generating synthetic datasets with differential privacy guarantees. We conclude by highlighting the most promising directions for further work in synthetic data studies.

show abstract

Section: Urban and Outdoor Environments: Learning To Drivementioning

confidence: 99%

Synthetic Data for Deep Learning

Nikolenko¹

2019

Preprint

View full text Add to dashboard Cite

show abstract

“…In recent years, artificial intelligence technology applied to autonomous driving has developed rapidly, especially reinforcement learning technology [ 4 , 5 , 6 , 7 , 8 ]. The first control example based on reinforcement learning (RL) was inspired by the concept of ALVINN [ 9 ].…”

Section: Introductionmentioning

confidence: 99%

Lane Following Method Based on Improved DDPG Algorithm

Zhang

et al. 2021

Sensors

View full text Add to dashboard Cite

In an autonomous vehicle, the lane following algorithm is an important component, which is a basic function of autonomous driving. However, the existing lane following system has a few shortcomings: first, the control method it adopts requires an accurate system model, and different vehicles have different parameters, which needs a lot of parameter calibration work. The second is that it may fail on road sections where the lateral acceleration requirements of vehicles are large, such as large curves. Third, its decision-making system is defined based on rules, which has disadvantages: it is difficult to formulate; human subjective factors cannot guarantee objectivity; coverage is difficult to guarantee. In recent years, the deep deterministic policy gradient (DDPG) algorithm has been widely used in the field of autonomous driving due to its strong nonlinear fitting ability and generalization performance. However, the DDPG algorithm has overestimated state action values and large cumulative errors, low training efficiency and other issues. Therefore, this paper improves the DDPG algorithm based on the double critic networks and priority experience replay mechanism. Then this paper proposes a lane following method based on this algorithm. Experiment shows that the algorithm can achieve excellent following results under various road conditions.

show abstract

Orthogonal Policy Gradient and Autonomous Driving Application

Cited by 2 publications

References 7 publications

Synthetic Data for Deep Learning

Synthetic Data for Deep Learning

Lane Following Method Based on Improved DDPG Algorithm

Contact Info

Product

Resources

About