2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561450
|View full text |Cite
|
Sign up to set email alerts
|

Learning robust driving policies without online exploration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(11 citation statements)
references
References 35 publications
0
11
0
Order By: Relevance
“…In practice, multiple such concerns may simultaneously matter about an option and multiple options may be simultaneously evaluated under the same concern. These can be easily accommodated by introducing the corresponding cumulants and options and, where appropriate, allow the various V s and Q s to share underlying computational implementation (Graves et al, 2020b; Sherstan & Pilarski, 2014; Ugur et al, 2007).…”
Section: Affordances As Gvfsmentioning
confidence: 99%
See 4 more Smart Citations
“…In practice, multiple such concerns may simultaneously matter about an option and multiple options may be simultaneously evaluated under the same concern. These can be easily accommodated by introducing the corresponding cumulants and options and, where appropriate, allow the various V s and Q s to share underlying computational implementation (Graves et al, 2020b; Sherstan & Pilarski, 2014; Ugur et al, 2007).…”
Section: Affordances As Gvfsmentioning
confidence: 99%
“…We see evidence of this in learning affordances that relate to success/failure of bin picking (Zeng et al, 2018), traversability (Ugur et al, 2007), and success/failure of mobile manipulation (Wu et al, 2020) where thousands of predictions structured spatially in a grid pattern are learned. Second, when combined with DL, multiple predictions can be made with the same neural network by sharing earlier layers across different prediction heads, allowing for potentially sublinear scaling in the number of predictions (Graves et al, 2020b; Sherstan & Pilarski, 2014; Zeng et al, 2018). Third, as noted above, for a single prediction, the direct perception architecture operates in constant time regardless of how distant a future the prediction needs to be concerned about.…”
Section: Affordances As Gvfsmentioning
confidence: 99%
See 3 more Smart Citations