Xiuxian Li scite author profile

This paper considers the problem of distributed bandit online convex optimization with time-varying coupled inequality constraints. This problem can be defined as a repeated game between a group of learners and an adversary. The learners attempt to minimize a sequence of global loss functions and at the same time satisfy a sequence of coupled constraint functions. The global loss and the coupled constraint functions are the sum of local convex loss and constraint functions, respectively, which are adaptively generated by the adversary. The local loss and constraint functions are revealed in a bandit manner, i.e., only the values of loss and constraint functions at sampled points are revealed to the learners, and the revealed function values are held privately by each learner. We consider two scenarios, one-and two-point bandit feedback, and propose two corresponding distributed bandit online algorithms used by the learners. We show that sublinear expected regret and constraint violation are achieved by these two algorithms, if the accumulated variation of the comparator sequence also grows sublinearly. In particular, we show that O(T θ 1 ) expected static regret and O(T 7/4−θ 1 ) constraint violation are achieved in the one-point bandit feedback setting, and O(T max{κ,1−κ} ) expected static regret and O(T 1−κ/2 ) constraint violation in the two-point bandit feedback setting, where θ1 ∈ (3/4, 5/6] and κ ∈ (0, 1) are user-defined trade-off parameters. Finally, these theoretical results are illustrated by numerical simulations of a simple power grid example.

show abstract

Ultra-Wideband and Odometry-Based Cooperative Relative Localization With Application to Multi-UAV Formation Control

Guo

Xie

2020

IEEE Trans. Cybern.

171

View full text Add to dashboard Cite

Distributed Proximal Algorithms for Multiagent Optimization With Coupled Inequality Constraints

Feng

Xie

2021

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

This paper aims to address distributed optimization problems over directed, time-varying, and unbalanced networks, where the global objective function consists of a sum of locally accessible convex objective functions subject to a feasible set constraint and coupled inequality constraints whose information is only partially accessible to each agent. For this problem, a distributed proximal-based algorithm, called distributed proximal primal-dual (DPPD) algorithm, is proposed based on the celebrated centralized proximal point algorithm. It is shown that the proposed algorithm can lead to the global optimal solution with a general stepsize, which is diminishing and non-summable, but not necessarily square-summable, and the saddle-point running evaluation error vanishes proportionally to O(1/ √ k), where k > 0 is the iteration number. Finally, a simulation example is presented to corroborate the effectiveness of the proposed algorithm.

show abstract

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Yang

et al. 2021

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

Rota-Baxter operators on pre-Lie algebras

Li¹,

Hou²,

Bai³

2007

JNMP

View full text Add to dashboard Cite

show abstract

Distributed Aggregative Optimization Over Multi-Agent Networks

Xie

2022

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

Dynamic Formation Control Over Directed Networks Using Graphical Laplacian Approach

Xie

2018

IEEE Trans. Automat. Contr.

View full text Add to dashboard Cite

On the linear convergence of distributed Nash equilibrium seeking for multi-cluster games under partial-decision information

Meng

Li²

2020

Preprint

View full text Add to dashboard Cite

This paper considers the distributed strategy design for Nash equilibrium (NE) seeking in multi-cluster games under a partial-decision information scenario. In the considered game, there are multiple clusters and each cluster consists of a group of agents. A cluster is viewed as a virtual noncooperative player that aims to minimize its local payoff function and the agents in a cluster are the actual players that cooperate within the cluster to optimize the payoff function of the cluster through communication via a connected graph. In our setting, agents have only partial-decision information, that is, they only know local information and cannot have full access to opponents' decisions. To solve the NE seeking problem of this formulated game, a discrete-time distributed algorithm, called distributed gradient tracking algorithm (DGT), is devised based on the inter-and intra-communication of clusters. In the designed algorithm, each agent is equipped with strategy variables including its own strategy and estimates of other clusters' strategies. With the help of a weighted Fronbenius norm and a weighted Euclidean norm, theoretical analysis is presented to rigorously show the linear convergence of the algorithm. Finally, a numerical example is given to illustrate the proposed algorithm.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiuxian Li

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Ultra-Wideband and Odometry-Based Cooperative Relative Localization With Application to Multi-UAV Formation Control

Distributed Proximal Algorithms for Multiagent Optimization With Coupled Inequality Constraints

Distributed Bandit Online Convex Optimization With Time-Varying Coupled Inequality Constraints

Rota-Baxter operators on pre-Lie algebras

Distributed Aggregative Optimization Over Multi-Agent Networks

Dynamic Formation Control Over Directed Networks Using Graphical Laplacian Approach

On the linear convergence of distributed Nash equilibrium seeking for multi-cluster games under partial-decision information

Contact Info

Product

Resources

About