2021 IEEE International Intelligent Transportation Systems Conference (ITSC) 2021
DOI: 10.1109/itsc48978.2021.9564464
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Objective End-to-End Self-Driving Based on Pareto-Optimal Actor-Critic Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 12 publications
1
3
0
Order By: Relevance
“…( 4). This intuition is in line with the Pareto optimality discussed in [9], which prescribes to update only when the gradient ascent directions (advantage functions) corresponding to all objectives are the same. Updating in the same gradient ascent direction will discover new undominated points on the Pareto front.…”
Section: B Deep Morlsupporting
confidence: 55%
See 3 more Smart Citations
“…( 4). This intuition is in line with the Pareto optimality discussed in [9], which prescribes to update only when the gradient ascent directions (advantage functions) corresponding to all objectives are the same. Updating in the same gradient ascent direction will discover new undominated points on the Pareto front.…”
Section: B Deep Morlsupporting
confidence: 55%
“…A2C uses a function called the advantage function for policy update to address the high variance problem of its predecessor, the REINFORCE algorithm [10]. We propose a multi-objective A2C algorithm for the considered MORL problem, following the Pareto optimality approach [9]. Fig.…”
Section: B Deep Morlmentioning
confidence: 99%
See 2 more Smart Citations
“…Some researches have explored novel methods to achieve a balance between two conflicting optimization objectives. For example, Reymond et al [34] proposed the Pareto-DQN algorithm to estimate the Pareto front with a high-dimensional state-space and could obtain the ap-proximately real Pareto front. Wang et al [35] proposed the Pareto-optimal actor-critic method to obtain optimal policies by optimizing the coupling objectives, which was not affected by the concavity and convexity of the Pareto front.…”
Section: Uavs Deployment and Chargingmentioning
confidence: 99%