Daniel Szer scite author profile

Daniel Szer

3Publications

62Citation Statements Received

12Citation Statements Given

How they've been cited

How they cite others

Affiliations

French Institute for Research in Computer Science and Automation, Lorraine Research Laboratory in Computer Science and its Applications, University of Massachusetts Dartmouth

Publications

Order By: Most citations

An Optimal Best-First Search Algorithm for Solving Infinite Horizon DEC-POMDPs

Szer

Charpillet

2005

View full text Add to dashboard Cite

Abstract. In the domain of decentralized Markov decision processes, we develop the first complete and optimal algorithm that is able to extract deterministic policy vectors based on finite state controllers for a cooperative team of agents. Our algorithm applies to the discounted infinite horizon case and extends best-first search methods to the domain of decentralized control theory. We prove the optimality of our approach and give some first experimental results for two small test problems. We believe this to be an important step forward in learning and planning in stochastic multi-agent systems.

show abstract

A parallel growing architecture for self-organizing maps with unsupervised learning

Valova¹,

Szer²,

Gueorguieva³

et al. 2005

Neurocomputing

View full text Add to dashboard Cite

Improving coordination with communication in multi-agent reinforcement learning

Szer

Charpillet

View full text Add to dashboard Cite

In the following paper we present a new algorithm for cooperative reinforcement learning in multi-agent systems. We consider autonomous and independently learning agents, and we seek to obtain an optimal solution for the team as a whole while keeping the learning as much decentralized as possible. Coordination between agents occurs through communication, namely the mutual notification algorithm.We define the learning problem as a decentralized process using the MDP formalism. We then give an optimality criterion and prove the convergence of the algorithm for deterministic environments. We introduce variable and hierarchical communication strategies which considerably reduce the number of communications. Finally we study the convergence properties and communication overhead on a small example.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daniel Szer

An Optimal Best-First Search Algorithm for Solving Infinite Horizon DEC-POMDPs

A parallel growing architecture for self-organizing maps with unsupervised learning

Improving coordination with communication in multi-agent reinforcement learning

Contact Info

Product

Resources

About