Proceedings of the 25th International Conference on Machine Learning - ICML '08 2008
DOI: 10.1145/1390156.1390218
|View full text |Cite
|
Sign up to set email alerts
|

Space-indexed dynamic programming

Abstract: We consider the task of learning to accurately follow a trajectory in a vehicle such as a car or helicopter. A number of dynamic programming algorithms such as Differential Dynamic Programming (DDP) and Policy Search by Dynamic Programming (PSDP), can efficiently compute non-stationary policies for these tasks -such policies in general are well-suited to trajectory following since they can easily generate different control actions at different times in order to follow the trajectory. However, a weakness of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 14 publications
(1 citation statement)
references
References 14 publications
(11 reference statements)
0
1
0
Order By: Relevance
“…Commonly used loss functions are the mean square error or the cross entropy [24]. The Hessian is defined as H ij = ∂ θ i θ j L and has a wide range of applications in ML: it can be used to adapt gradient update to the current loss landscape in the so called 'Newton' method [25], for pruning [26,27] or for interpretability purposes with the influence function [28]. Furthermore, it can also be used to study the local curvature of the loss for a better understanding of the loss landscape and the convergence of NNs.…”
Section: Loss Landscape Of Nns: a Brief Reviewmentioning
confidence: 99%
“…Commonly used loss functions are the mean square error or the cross entropy [24]. The Hessian is defined as H ij = ∂ θ i θ j L and has a wide range of applications in ML: it can be used to adapt gradient update to the current loss landscape in the so called 'Newton' method [25], for pruning [26,27] or for interpretability purposes with the influence function [28]. Furthermore, it can also be used to study the local curvature of the loss for a better understanding of the loss landscape and the convergence of NNs.…”
Section: Loss Landscape Of Nns: a Brief Reviewmentioning
confidence: 99%