2007
DOI: 10.1007/s10994-007-5016-8
|View full text |Cite
|
Sign up to set email alerts
|

Logarithmic regret algorithms for online convex optimization

Abstract: In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., chooses a sequence of points in Euclidean space, from a fixed feasible set. After each point is chosen, it encounters a sequence of (possibly unrelated) convex cost functions. Zinkevich (ICML 2003) introduced this framework, which models many natural repeated decision-making problems and generalizes many existing problems such as Prediction from Expert Advice and Cover's Universal Portfolios. Zinkevich showed that a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

10
966
1
3

Year Published

2009
2009
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 759 publications
(992 citation statements)
references
References 14 publications
10
966
1
3
Order By: Relevance
“…Therefore, it follows that E [ g(w t ), w t − w * ] is bounded above by the right-hand side of (15). Since the subgradient g(z) defines the supporting hyperplane of the convex function Φ at z, it follows that g(w t ), w t − w * is an upper bound on Φ(w t ) − Φ(w * ).…”
Section: Resultsmentioning
confidence: 97%
See 4 more Smart Citations
“…Therefore, it follows that E [ g(w t ), w t − w * ] is bounded above by the right-hand side of (15). Since the subgradient g(z) defines the supporting hyperplane of the convex function Φ at z, it follows that g(w t ), w t − w * is an upper bound on Φ(w t ) − Φ(w * ).…”
Section: Resultsmentioning
confidence: 97%
“…This result was extended by Flaxman et al [11] to the case where the optimizer instead obtains an unbiased estimator of the gradient. Under additional technical assumptions on the shape of the convex function, a modified algorithm by Hazan et al [15] achieves a faster convergence rate of O (log(T )). The case where the available information is an unbiased estimator of the objective value, not its derivative, has been studied by Flaxman et al [11] and Kleinberg [22].…”
Section: Literature Review and Our Contributions Classical Inventory mentioning
confidence: 99%
See 3 more Smart Citations