Bandit Convex Optimization for Scalable and Dynamic IoT Management

Chen, Tianyi; Giannakis, Georgios B.

doi:10.1109/jiot.2018.2839563

Cited by 117 publications

(118 citation statements)

References 28 publications

Supporting

Mentioning

116

Contrasting

Order By: Relevance

“…This is in agreement with (14b), (20b), and the theoretical results shown in [39], [40]. From the zoomed figures, we can see that the centralized algorithms in [39], [40] achieve smaller expected dynamic regret and constraint violation than our distributed algorithms, which is reasonable. We can also see that Algorithm 2 achieves smaller expected dynamic regret and constraint Expected dynamic regret 10 4 Algorithm 1 Algorithm 2 [39] (One-Point Sampling) [39] (Two-Point Sampling) [40] 200 Expected constraint violation 10 4 Algorithm 1 Algorithm 2 [39] (One-Point Sampling) [39] (Two-Point Sampling) [40]…”

Section: Numerical Simulationssupporting

confidence: 90%

“…Algorithm 1 can also achieve sublinear expected dynamic regret if V (x * T ) grows sublinearly. In this case, there exists a constant ν ∈ [0, 1), [24], [26]- [29], [39], [41]. Note that these papers did not consider bandit feedback for timevarying inequality constraints or did not even consider timevarying inequality constraints at all.…”

Section: B Expected Regret and Constraint Violation Boundsmentioning

confidence: 99%

“…In the literature, there are few papers considering bandit online convex optimization with such constraints, although such constraints are common in applications. The authors of [38] studied online convex optimization with static inequality constraints and bandit feedback for constraints, while the authors of [39] studied online convex optimization with time-varying inequality constraints and bandit feedback for loss functions. The authors of [40] studied online convex optimization with time-varying inequality constraints and bandit feedback for both loss and constraint functions.…”

mentioning

confidence: 99%

“…To the best of our knowledge, this is the first algorithm to solve the online convex optimization problem with time-varying inequality constraints in the one-point bandit feedback setting. An advantage of our algorithm is that the total number of rounds is not used in the algorithm, which is an improvement compared to the one-point sampling algorithms in [24], [26]- [29], [39], [41], although these paper did not consider bandit feedback for the time-varying inequality constraints or did not even consider time-varying inequality constraints at all. Sublinear expected regret and constraint violation are achieved by this algorithm if the accumulated variation of the comparator sequence also grows sublinearly.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Xie

et al. 2020

IEEE Trans. Signal Process.

125

View full text Add to dashboard Cite

This paper considers the problem of distributed bandit online convex optimization with time-varying coupled inequality constraints. This problem can be defined as a repeated game between a group of learners and an adversary. The learners attempt to minimize a sequence of global loss functions and at the same time satisfy a sequence of coupled constraint functions. The global loss and the coupled constraint functions are the sum of local convex loss and constraint functions, respectively, which are adaptively generated by the adversary. The local loss and constraint functions are revealed in a bandit manner, i.e., only the values of loss and constraint functions at sampled points are revealed to the learners, and the revealed function values are held privately by each learner. We consider two scenarios, one-and two-point bandit feedback, and propose two corresponding distributed bandit online algorithms used by the learners. We show that sublinear expected regret and constraint violation are achieved by these two algorithms, if the accumulated variation of the comparator sequence also grows sublinearly. In particular, we show that O(T θ 1 ) expected static regret and O(T 7/4−θ 1 ) constraint violation are achieved in the one-point bandit feedback setting, and O(T max{κ,1−κ} ) expected static regret and O(T 1−κ/2 ) constraint violation in the two-point bandit feedback setting, where θ1 ∈ (3/4, 5/6] and κ ∈ (0, 1) are user-defined trade-off parameters. Finally, these theoretical results are illustrated by numerical simulations of a simple power grid example.

show abstract

Section: Numerical Simulationssupporting

confidence: 90%

Section: B Expected Regret and Constraint Violation Boundsmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Xie

et al. 2020

IEEE Trans. Signal Process.

125

View full text Add to dashboard Cite

show abstract

“…• Gradient errors: the gradient of the cost κ(i, s) log(1 + z(i, s)) for each exogenous traffic flow is estimated using a multi-point bandit feedback [15], [26]; the estimation error depends on the number of functional evaluations in constructing the proxy of the gradient in (24). • Solution dynamics: at each time step, the channel gain of links are generated by using a complex Gaussian random variable with mean 1 + 1 and a given variance v c for both real and imaginary parts; the transmit power for each node is a Gaussian random variable with mean 1 and a variance v p ; the exogenous traffics are random with mean [0.2, 0.3, 0.3, 0.4, 0.5, 0.2, 0.1, 0.4] and a given variance; and, the cost is perturbed by modifying a t .…”

Section: Illustrative Numerical Resultsmentioning

confidence: 99%

On the Convergence of the Inexact Running Krasnosel’skiĭ–Mann Method

Dall’Anese

Simonetto

Bernstein

2019

IEEE Control Syst. Lett.

View full text Add to dashboard Cite

This paper leverages a framework based on averaged operators to tackle the problem of tracking fixed points associated with maps that evolve over time. In particular, the paper considers the Krasnosel'skiȋ-Mann method in a settings where: (i) the underlying map may change at each step of the algorithm, thus leading to a "running" implementation of the Krasnosel'skiȋ-Mann method; and, (ii) an imperfect information of the map may be available. An imperfect knowledge of the maps can capture cases where processors feature a finite precision or quantization errors, or the case where (part of) the map is obtained from measurements. The analytical results are applicable to inexact running algorithms for solving optimization problems, whenever the algorithmic steps can be written in the form of (a composition of) averaged operators; examples are provided for inexact running gradient methods and the forward-backward splitting method. Convergence of the average fixed-point residual is investigated for the nonexpansive case; linear convergence to a unique fixed-point trajectory is showed in the case of inexact running algorithms emerging from contractive operators.

show abstract

Composite optimization with coupling constraints via dual proximal gradient method with applications to asynchronous networks

Wang

2022

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

In this article, we consider solving a composite optimization problem with affine coupling constraints in a multi-agent network based on proximal gradient method. In this problem, all the agents jointly minimize the sum of individual cost functions composed of smooth and possibly non-smooth parts. To this end, we derive the dual problem by the concept of Fenchel conjugate, which gives rise to the dual proximal gradient (DPG) algorithm by allowing for the asymmetric individual interpretations of the coupling constraints. Then, an asynchronous DPG (Asyn-DPG) algorithm is proposed for the asynchronous networks with heterogeneous step-sizes and communication delays. For both the two algorithms, if the non-smooth parts of the objective functions are simple-structured, we only need to update dual variables by some simple operations, accounting for the reduction of the overall computational complexity. Analytical convergence rate of the proposed algorithms is derived and their efficacy is verified by solving a social welfare optimization problem of electricity market in the numerical simulation.

show abstract

Bandit Convex Optimization for Scalable and Dynamic IoT Management

Cited by 117 publications

References 28 publications

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

On the Convergence of the Inexact Running Krasnosel’skiĭ–Mann Method

Composite optimization with coupling constraints via dual proximal gradient method with applications to asynchronous networks

Contact Info

Product

Resources

About