2020
DOI: 10.1038/s41598-020-62939-3
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures

Abstract: Multi-agent coordination is prevalent in many real-world applications. However, such coordination is challenging due to its combinatorial nature. An important observation in this regard is that agents in the real world often only directly affect a limited set of neighbouring agents. Leveraging such loose couplings among agents is key to making coordination in multi-agent systems feasible. In this work, we focus on learning to coordinate. Specifically, we consider the multi-agent multi-armed bandit framework, i… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(18 citation statements)
references
References 20 publications
1
17
0
Order By: Relevance
“…To ensure scalability of the control optimization process toward large wind farms, we need a learning algorithm that can decompose and exploit the farm's topology. To this end, we propose a new wind farm control algorithm, Set-Point Thompson Sampling (SPTS), which constructs a dependency graph and regimes from wake fields and load profiles, and relies on multi-agent Thompson sampling (MATS) [40,41] to efficiently explore the joint set-point configuration space using the factored representation.…”
Section: Set-point Thompson Samplingmentioning
confidence: 99%
See 1 more Smart Citation
“…To ensure scalability of the control optimization process toward large wind farms, we need a learning algorithm that can decompose and exploit the farm's topology. To this end, we propose a new wind farm control algorithm, Set-Point Thompson Sampling (SPTS), which constructs a dependency graph and regimes from wake fields and load profiles, and relies on multi-agent Thompson sampling (MATS) [40,41] to efficiently explore the joint set-point configuration space using the factored representation.…”
Section: Set-point Thompson Samplingmentioning
confidence: 99%
“…The expected power productions P w s G(w) for each possible local set-point configuration s G(w) are unknown. Similar to MATS [40], SPTS uses a Bayesian formalism, which means users can exert their beliefs over P w s G(w) in the form of a prior. If wind turbines were not affected by wake, the incoming wind speed can be used to predict the expected power production of the wind turbine, which is provided with the turbine design specifications [28].…”
Section: Explorationmentioning
confidence: 99%
“…Distributed control seem to cope well with variations in operating conditions of power systems [133], while multiobjective optimization may offer versatility in short-and long-term economic objectives [134]. Multi-agent artificial intelligence algorithms applied to wind farm throughput optimization seems to be better positioned when data and structural features are integrated [135].…”
Section: F New Energy Demand and Deliverymentioning
confidence: 99%
“…These extensions are motivated by problems arising in various fields, including online recommendation systems (search engines, display ads, etc. ), financial portfolio design [4,5], adaptive network routing [6], web crawling [7], clinical trials [8,1], and, recently, in control theory [9] and system identification [10,11].…”
Section: Introductionmentioning
confidence: 99%