2015 European Control Conference (ECC) 2015
DOI: 10.1109/ecc.2015.7330911
|View full text |Cite
|
Sign up to set email alerts
|

Online learning as an LQG optimal control problem with random matrices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2017
2017
2017
2017

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(5 citation statements)
references
References 6 publications
0
5
0
Order By: Relevance
“…No periodic re-initialization to Σ w of any of the matrices Σ k was performed. of the matrices Σ k , the Frobenius norm 19 of the Kalman gain matrix H k is expected to be small for k large (see formulas (24) and ( 47)), which is confirmed by Figure 8. Hence, even though the norm of the error y k − C k ŵ † k tends to increase when the parameter vector changes, the KF estimate of w at time k is not affected so much by this change (see formula (23)), hence also the OLL estimate does not change so much (see formula (22)).…”
Section: Remark 15mentioning
confidence: 52%
See 4 more Smart Citations
“…No periodic re-initialization to Σ w of any of the matrices Σ k was performed. of the matrices Σ k , the Frobenius norm 19 of the Kalman gain matrix H k is expected to be small for k large (see formulas (24) and ( 47)), which is confirmed by Figure 8. Hence, even though the norm of the error y k − C k ŵ † k tends to increase when the parameter vector changes, the KF estimate of w at time k is not affected so much by this change (see formula (23)), hence also the OLL estimate does not change so much (see formula (22)).…”
Section: Remark 15mentioning
confidence: 52%
“…Online learning problems have been investigated, e.g., in [33,38,43,44,51,52], but without using an approach based on optimal control theory. As suggested by the preliminary results that we obtained in [24], such an approach can provide a strong theoretical foundation to the choice of a specific online learning algorithm, by selecting the parameter updates as the outputs of a sequence of control laws that solve a suitable optimal control problem modeling online learning itself 1 . A distinguishing feature of our study is that we derive online learning algorithms as closed-form optimal solutions to suitable online learning problems.…”
Section: Application Of Machine-learning Techniques To Optimization/o...mentioning
confidence: 99%
See 3 more Smart Citations