Logarithmic regret algorithms for online convex optimization

Hazan, Elad; Agarwal, Amit; Kale, Satyen

doi:10.1007/s10994-007-5016-8

Cited by 759 publications

(992 citation statements)

References 14 publications

Supporting

Mentioning

966

Contrasting

Unclassified

Order By: Relevance

“…Therefore, it follows that E [ g(w t ), w t − w * ] is bounded above by the right-hand side of (15). Since the subgradient g(z) defines the supporting hyperplane of the convex function Φ at z, it follows that g(w t ), w t − w * is an upper bound on Φ(w t ) − Φ(w * ).…”

Section: Resultsmentioning

confidence: 97%

“…This result was extended by Flaxman et al [11] to the case where the optimizer instead obtains an unbiased estimator of the gradient. Under additional technical assumptions on the shape of the convex function, a modified algorithm by Hazan et al [15] achieves a faster convergence rate of O (log(T )). The case where the available information is an unbiased estimator of the objective value, not its derivative, has been studied by Flaxman et al [11] and Kleinberg [22].…”

Section: Literature Review and Our Contributions Classical Inventory mentioning

confidence: 99%

“…In this section, we show by example that this rate can indeed be Θ(1/ √ T ). Hazan et al [15] have established such a lower bound on the convergence rate in an adversarial setting, but not for the stochastic non-adversarial setting.…”

Section: Proof Of the Rate Of Convergence For The Aim Algorithm (Theomentioning

confidence: 99%

“…Observe that most of the arguments in the proofs of Lemma 3 remain valid in this case. In particular, Equation (15) in the proof of Lemma 3 in Appendix A holds. Thus,…”

Section: Theoremmentioning

confidence: 99%

“…If we modify the step size of the original AIM algorithm so that for t = 1/ (α(b + h)t) for all t, then it follows from Theorem 1 in Hazan et al [15] that, for the perishable inventory case,…”

Section: Discrete Demand and Discrete Ordering Quantitiesmentioning

confidence: 99%

See 4 more Smart Citations

A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand

2009

View full text Add to dashboard Cite

We study stochastic inventory planning with lost sales and instantaneous replenishment, where contrary to the classical inventory theory, the knowledge of the demand distribution is not available. Furthermore, we observe only the sales quantity in each period, and lost sales are unobservable, that is, demand data are censored. The manager must make an ordering decision in each period based only on historical sales data. Excess inventory is either perishable or carried over to the next period. In this setting, we propose non-parametric adaptive policies that generate ordering decisions over time. We show that the T -period average expected cost of our policy differs from the benchmark newsvendor cost -the minimum expected cost that would have incurred if the manager had known the underlying demand distribution -by at most O(1/ √ T ). IntroductionThe problem of inventory control and planning has received much interest from practitioners and academics from the early years of operations research. The early literature in this area modeled demand as deterministic and having known quantities, but it soon became apparent that deterministic modeling was often inadequate, and uncertainty needed to be incorporated in modeling future demand. As a result, a majority of the papers on inventory theory during the past fifty years employ stochastic demand models. In these models, future demand is given by a specific exogenous random variable, and the inventory decisions are made with full knowledge of the future demand distribution. In many applications, however, the demand distribution is not known a priori. Even when past data have been collected, the selection of the most appropriate distribution and its parameters remains ambiguous. In the case when excess demand is lost, the information available to the inventory manager is further limited since she does not observe the realized demand but only observes the sales quantity (often referred to as censored demand), which is the smaller of the stocking level and the realized demand. Motivated by these realistic constraints, we develop a non-parametric approach to stochastic inventory planning in the presence of lost sales and censored demand.

show abstract

Section: Resultsmentioning

confidence: 97%

Section: Literature Review and Our Contributions Classical Inventory mentioning

confidence: 99%

Section: Proof Of the Rate Of Convergence For The Aim Algorithm (Theomentioning

confidence: 99%

“…Observe that most of the arguments in the proofs of Lemma 3 remain valid in this case. In particular, Equation (15) in the proof of Lemma 3 in Appendix A holds. Thus,…”

Section: Theoremmentioning

confidence: 99%

Section: Discrete Demand and Discrete Ordering Quantitiesmentioning

confidence: 99%

See 3 more Smart Citations

A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand

2009

View full text Add to dashboard Cite

show abstract

Improve robustness of machine learning via efficient optimization and conformal prediction

Yan

2024

AI Magazine

View full text Add to dashboard Cite

The advance of machine learning (ML) systems in real‐world scenarios usually expects safe deployment in high‐stake applications (e.g., medical diagnosis) for critical decision‐making process. To this end, provable robustness of ML is usually required to measure and understand how reliable the deployed ML system is and how trustworthy their predictions can be. Many studies have been done to enhance the robustness in recent years from different angles, such as variance‐regularized robust objective functions and conformal prediction (CP) for uncertainty quantification on testing data. Although these tools provably improve the robustness of ML model, there is still an inevitable gap to integrate them into an end‐to‐end deployment. For example, robust objectives usually require carefully designed optimization algorithms, while CP treats ML models as black boxes. This paper is a brief introduction to our recent research focusing on filling this gap. Specifically, for learning robust objectives, we designed sample‐efficient stochastic optimization algorithms that achieves the optimal (or faster compared to existing algorithms) convergence rates. Moreover, for CP‐based uncertainty quantification, we established a framework to analyze the expected prediction set size (smaller size means more efficiency) of CP methods in both standard and adversarial settings. This paper elaborates the key challenges and our exploration towards efficient algorithms with details of background methods, notions for robustness measure, concepts of algorithmic efficiency, our proposed algorithms and results. All of them further motivate our future research on risk‐aware ML that can be critical for AI–human collaborative systems. The future work mainly targets designing conformal robust objectives and their efficient optimization algorithms.

show abstract

An accelerated distributed online gradient push‐sum algorithm on time‐varying directed networks

Fang

Shen

et al. 2021

Asian Journal of Control

View full text Add to dashboard Cite

This paper investigates a distributed online optimization problem with convex objective functions on time-varying directed networks, where each agent holds its own convex cost function and the goal is to cooperatively minimize the sum of the local cost functions. To tackle such optimization problems, an accelerated distributed online gradient push-sum algorithm is firstly proposed, which combines the momentum acceleration technique and the push-sum strategy. Then, we specifically analyze the regret for the proposed algorithm. The theoretical result shows that the individual regret of the proposed algorithm achieves a sublinear regret with order of  ( √ T) , where T is the time horizon. Moreover, we implement the proposed algorithm in sensor networks for solving the distributed online estimation problem, and the results illustrate the effectiveness of the proposed algorithm.

show abstract

Logarithmic regret algorithms for online convex optimization

Cited by 759 publications

References 14 publications

A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand

A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand

Improve robustness of machine learning via efficient optimization and conformal prediction

An accelerated distributed online gradient push‐sum algorithm on time‐varying directed networks

Contact Info

Product

Resources

About