Jacopo Castellini scite author profile

Jacopo Castellini

5Publications

57Citation Statements Received

84Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Liverpool, University of Perugia

Publications

Order By: Most citations

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning

Castellini

Oliehoek

Savani

et al. 2021

Auton Agent Multi-Agent Syst

View full text Add to dashboard Cite

Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However, given the lack of theoretical insight, it remains unclear what the employed neural networks are learning, or how we should enhance their learning power to address the problems on which they fail. In this work, we empirically investigate the learning power of various network architectures on a series of one-shot games. Despite their simplicity, these games capture many of the crucial problems that arise in the multi-agent setting, such as an exponential number of joint actions or the lack of an explicit coordination mechanism. Our results extend those in Castellini et al. (Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS’19.International Foundation for Autonomous Agents and Multiagent Systems, pp 1862–1864, 2019) and quantify how well various approaches can represent the requisite value functions, and help us identify the reasons that can impede good performance, like sparsity of the values or too tight coordination requirements.

show abstract

Fake Twitter followers detection by denoising autoencoder

Castellini

Poggioni

Sorbi

2017

View full text Add to dashboard Cite

Difference Rewards Policy Gradients

Castellini¹,

Devlin²,

Oliehoek³

et al. 2020

Preprint

View full text Add to dashboard Cite

The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning

Castellini¹,

Oliehoek²,

Savani³

et al. 2019

Preprint

View full text Add to dashboard Cite

Difference rewards policy gradients

Castellini

Devlin

Oliehoek

et al. 2022

Neural Comput & Applic

View full text Add to dashboard Cite

Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent’s contribution to the overall performance, which is crucial for learning good policies. We propose a novel algorithm called Dr.Reinforce that explicitly tackles this by combining difference rewards with policy gradients to allow for learning decentralized policies when the reward function is known. By differencing the reward function directly, Dr.Reinforce avoids difficulties associated with learning the Q-function as done by counterfactual multi-agent policy gradients (COMA), a state-of-the-art difference rewards method. For applications where the reward function is unknown, we show the effectiveness of a version of Dr.Reinforce that learns an additional reward network that is used to estimate the difference rewards.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.