2020
DOI: 10.1080/03605302.2020.1712418
|View full text |Cite
|
Sign up to set email alerts
|

Finite-time 4-expert prediction problem

Abstract: We explicitly solve the nonlinear PDE that is the continuous limit of dynamic programming equation expert prediction problem in finite horizon setting with N = 4 experts. The expert prediction problem is formulated as a zero sum game between a player and an adversary. By showing that the solution is C 2 , we are able to show that the comb strategies, as conjectured in [13], form an asymptotic Nash equilibrium. We also prove the "Finite vs Geometric regret" conjecture proposed in [12] for N = 4, and show that t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 12 publications
1
12
0
Order By: Relevance
“…[Zhu14] first derived such a PDE to characterize the continuous-time limit of LEA, whose arguments were streamlined in [DK20b]. Exact solutions were obtained in special cases [BEZ20a,BEZ20b,DK20b], and more generally, approximate solutions were derived in [Rok17,KKW20a,KKW20b]. Follow-up works considered history-dependent experts [DC20,DK20a] and malicious experts [BPZ20,BEZ21].…”
Section: Differential Equations For Online Learningmentioning
confidence: 99%
“…[Zhu14] first derived such a PDE to characterize the continuous-time limit of LEA, whose arguments were streamlined in [DK20b]. Exact solutions were obtained in special cases [BEZ20a,BEZ20b,DK20b], and more generally, approximate solutions were derived in [Rok17,KKW20a,KKW20b]. Follow-up works considered history-dependent experts [DC20,DK20a] and malicious experts [BPZ20,BEZ21].…”
Section: Differential Equations For Online Learningmentioning
confidence: 99%
“…The comb adversary a c is then defined by Comb adversary a c : At each t ∈ [T ], the adversary assigns probability 1 2 to each of q c and −q c where (q c ) (i) = 1 if i is odd and (q c ) (i) = −1 if i is even. Gravin et al (2016) suggested that a c might be optimal asymptotically in T for any fixed N and Abbasi-Yadkori et al ( 2017) and Bayraktar et al (2019) showed that to be the case for N = 3 and 4 respectively.…”
Section: Heat Potentialsmentioning
confidence: 99%
“…Drenska and Kohn (2020) showed that, for any fixed N , the value function, in the scaling limit, is the unique solution of an associated nonlinear PDE. Bayraktar et al (2019) determined the closed-form solutions of the PDEs for N = 3 and 4.…”
Section: Introductionmentioning
confidence: 99%
“…In the adversarial setting, the advice of experts is chosen by an adversary so as to maximize the regret of the forecaster, and therefore the problem can be viewed as a zero-sum game between the forecaster and the adversary (see e.g. [12] [9] [8] [5] [4]). In the stochastic setting, the losses of each expert are drawn independent and identically distributed (i.i.d.)…”
Section: Introductionmentioning
confidence: 99%