Optimal Rate of Convergence for Quasi-Stochastic Approximation

Bernstein, Andrey; Chen, Yue; Colombino, Marcello; Dall’Anese, Emiliano; Mehta, Prashant G.; Meyn, Sean

doi:10.48550/arxiv.1903.07228

Cited by 3 publications

(20 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While the stylized algorithm (4) and approximation ( 6) is used here for an illustration of the main ideas, the paper considers a more general convex constrained optimization framework, and develops model-free primal-dual methods to track the optimal trajectories. Using a deterministic exploration approach reminiscent to the quasi-stochastic approximation method [5], this paper provides design principles for the exploration signal ξ, as well as other algorithmic parameters, to ensure stability and tracking guarantees. In particular, we show that under some conditions, the iterates x (k) converge within a ball around the optimal solution of (2).…”

Section: Arxiv:190913132v1 [Mathoc] 28 Sep 2019mentioning

confidence: 99%

“…The paper then develops distributed algorithms based on the zero-order approximation of the method of multipliers. In contrast to our paper, [21] considers a stochastic exploration signal for the gradient estimation, and typically requires N > 2 function evaluations to reduce the estimation variance [21, Lemma 1]; see also [5] for the detailed analysis of the advantage of deterministic vs stochastic exploration. Moreover, it considers a static optimization problem.…”

Section: Literature Reviewmentioning

confidence: 99%

“…Recall that the exploration signal ξ (k) is sampled from a continuous-time signal ξ(t). Recent work on quasi-stochastic approximation [5], [33] and extremum seeking control [1] has shown that certain deterministic exploration signals can help reduce the asymptotic variance and improve convergence, in comparison to random exploration (see Fig. 1 of [33]).…”

Section: Convergence Analysismentioning

confidence: 99%

“…in which Σ ξ is positive definite, and the 1/T bound for the error term is independent of t (this is the assumption used in [5]). An example is the signal defined in (16) in which the frequencies may not have a common multiple.…”

Section: Convergence Analysismentioning

confidence: 99%

“…We first show that the primal step ( 18) is approximately equivalent to an averaged primal step, where the singular "gain matrix" ξξ T is replaced with the identity matrix (20). This analysis is performed for the algorithm defined in continuous time, which is justified using standard ODE approximation techniques from the stochastic approximation literature [5], [11] or more recent literature on optimization [32], [37]. We then apply the results of [8] to the resulting approximate primal-dual algorithm to show tracking of the desired trajectory {z ( * ,k) } defined by (11).…”

Section: Convergence Analysismentioning

confidence: 99%

See 4 more Smart Citations

Model-Free Primal-Dual Methods for Network Optimization with Application to Real-Time Optimal Power Flow

Chen

Bernstein

Devraj

et al. 2020

2020 American Control Conference (ACC)

Self Cite

View full text Add to dashboard Cite

This paper examines the problem of real-time optimization of networked systems and develops online algorithms that steer the system towards the optimal trajectory without explicit knowledge of the system model. The problem is modeled as a dynamic optimization problem with timevarying performance objectives and engineering constraints. The design of the algorithms leverages the online zero-order primal-dual projected-gradient method. In particular, the primal step that involves the gradient of the objective function (and hence requires networked systems model) is replaced by its zero-order approximation with two function evaluations using a deterministic perturbation signal. The evaluations are performed using the measurements of the system output, hence giving rise to a feedback interconnection, with the optimization algorithm serving as a feedback controller. The paper provides some insights on the stability and tracking properties of this interconnection. Finally, the paper applies this methodology to a real-time optimal power flow problem in power systems, and shows its efficacy on the IEEE 37-node distribution test feeder for reference power tracking and voltage regulation.

show abstract

Section: Arxiv:190913132v1 [Mathoc] 28 Sep 2019mentioning

confidence: 99%

Section: Literature Reviewmentioning

confidence: 99%

Section: Convergence Analysismentioning

confidence: 99%

Section: Convergence Analysismentioning

confidence: 99%

Section: Convergence Analysismentioning

confidence: 99%

See 3 more Smart Citations

Model-Free Primal-Dual Methods for Network Optimization with Application to Real-Time Optimal Power Flow

Chen

Bernstein

Devraj

et al. 2020

2020 American Control Conference (ACC)

Self Cite

View full text Add to dashboard Cite

show abstract

Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation

Chen

Devraj

Bernstein

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

The ODE (ordinary differential equation) method has been a workhorse for algorithm design and analysis since the introduction of the stochastic approximation technique of Robbins and Monro in the early 1950s. It is now understood that convergence theory amounts to establishing robustness of Euler approximations for ODEs, while theory of rates of convergence requires finer probabilistic analysis. This paper sets out to extend this theory to quasi-stochastic approximation (QSA), based on algorithms in which the "noise" or "exploration" is based on deterministic signals, much like quasi-Monte Carlo. The main results are obtained under minimal assumptions: the usual Lipschitz conditions for ODE vector fields, and for rate results it is assumed that there is a well defined linearization near the optimal parameter θ * , with Hurwitz linearization matrix A * . Algorithm design is performed in continuous time, in anticipation of discrete-time implementation based on Euler approximations, or high-fidelity alternatives.The main contributions are summarized as follows:(i) If the algorithm gain is chosen as at = g/(1 + t) ρ with g > 0 and ρ ∈ (0, 1), then the rate of convergence of the algorithm is 1/t ρ . There is also a well defined "finite-t" approximation:

show abstract

Extremely Fast Convergence Rates for Extremum Seeking Control with Polyak-Ruppert Averaging

Lauand¹,

Meyn²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Stochastic approximation is a foundation for many algorithms found in machine learning and optimization. It is in general slow to converge: the mean square error vanishes as O(n −1 ). A deterministic counterpart known as quasi-stochastic approximation (QSA) is a viable alternative in many applications, including gradient free optimization and reinforcement learning. It was assumed in recent prior research that the optimal achievable convergence rate is O(n −2 ). It is shown in this paper that through design it is possible to obtain far faster convergence, of order O(n −4+δ ), with δ > 0 arbitrary. Two acceleration techniques are introduced for the first time to achieve this rate of convergence. The theory is also specialized within the context of gradient-free optimization, and tested on standard benchmarks. The main results are based on a combination of recent results from number theory and techniques adapted from stochastic approximation theory.

show abstract

Optimal Rate of Convergence for Quasi-Stochastic Approximation

Cited by 3 publications

References 23 publications

Model-Free Primal-Dual Methods for Network Optimization with Application to Real-Time Optimal Power Flow

Model-Free Primal-Dual Methods for Network Optimization with Application to Real-Time Optimal Power Flow

Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation

Extremely Fast Convergence Rates for Extremum Seeking Control with Polyak-Ruppert Averaging

Contact Info

Product

Resources

About