2019
DOI: 10.48550/arxiv.1911.01641
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

New Potential-Based Bounds for Prediction with Expert Advice

Vladimir A. Kobzar,
Robert V. Kohn,
Zhilei Wang

Abstract: This work addresses the classic machine learning problem of online prediction with expert advice. We consider the finite-horizon version of this zero-sum, two-person game.Using verification arguments from optimal control theory, we view the task of finding better lower and upper bounds on the value of the game (regret) as the problem of finding better sub-and supersolutions of certain partial differential equations (PDEs). These sub-and supersolutions serve as the potentials for player and adversary strategies… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 10 publications
(16 reference statements)
0
2
0
Order By: Relevance
“…Following this setting, for the case of N = 4 experts in the geometric horizon setting, Bayraktar, Ekren and Zhang [4] showed that the comb strategy is asymptotically optimal by explicitly solving the corresponding nonlinear PDE. And very recently in [14], Kobzar, Kohn and Wang found lower and upper bounds for the optimal regret for finite stopping problem by constructing certain sub-and supersolutions of (1.1) following the method of [16]. Their results are only tight for N = 3 and improved those of [1].…”
Section: Introductionmentioning
confidence: 99%
“…Following this setting, for the case of N = 4 experts in the geometric horizon setting, Bayraktar, Ekren and Zhang [4] showed that the comb strategy is asymptotically optimal by explicitly solving the corresponding nonlinear PDE. And very recently in [14], Kobzar, Kohn and Wang found lower and upper bounds for the optimal regret for finite stopping problem by constructing certain sub-and supersolutions of (1.1) following the method of [16]. Their results are only tight for N = 3 and improved those of [1].…”
Section: Introductionmentioning
confidence: 99%
“…A different class of examples focuses directly on the experts' outcomes, letting those be determined directly (rather than through a time series) by the market. (A PDE-based discussion of one such problem can be found in [9], and a PDE perspective on potential-based strategies can be found in [14,15,21]; PDE methods have also been applied to another class of problems known as "drifting games" [11].) In the present setting the experts' outcomes are highly constrained, since (i) their progress must be consistent with a time series, and (ii) their predictions depend, at a given time, on the past d items in that time series.…”
Section: Introductionmentioning
confidence: 99%