Felipe Leno da Silva scite author profile

Multiagent Reinforcement Learning (RL) solves complex tasks that require coordination with other agents through autonomous exploration of the environment. However, learning a complex task from scratch is impractical due to the huge sample complexity of RL algorithms. For this reason, reusing knowledge that can come from previous experience or other agents is indispensable to scale up multiagent RL algorithms. This survey provides a unifying view of the literature on knowledge reuse in multiagent RL. We define a taxonomy of solutions for the general knowledge reuse problem, providing a comprehensive discussion of recent progress on knowledge reuse in Multiagent Systems (MAS) and of techniques for knowledge reuse across agents (that may be actuating in a shared environment or not). We aim at encouraging the community to work towards reusing all the knowledge sources available in a MAS. For that, we provide an in-depth discussion of current lines of research and open questions.

show abstract

Agents teaching agents: a survey on inter-agent transfer learning

Silva

Warnell

Costa

et al. 2019

Auton Agent Multi-Agent Syst

View full text Add to dashboard Cite

Evaluating classification and feature selection techniques for honeybee subspecies identification using wing images

Silva

Sella

Francoy

et al. 2015

Computers and Electronics in Agriculture

View full text Add to dashboard Cite

Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents

Silva

Hernández-Leal

Kartal

et al. 2020

AAAI

View full text Add to dashboard Cite

Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in sequential decision making problems, the sample-complexity of RL techniques still represents a major challenge for practical applications. To combat this challenge, whenever a competent policy (e.g., either a legacy system or a human demonstrator) is available, the agent could leverage samples from this policy (advice) to improve sample-efficiency. However, advice is normally limited, hence it should ideally be directed to states where the agent is uncertain on the best action to execute. In this work, we propose Requesting Confidence-Moderated Policy advice (RCMP), an action-advising framework where the agent asks for advice when its epistemic uncertainty is high for a certain state. RCMP takes into account that the advice is limited and might be suboptimal. We also describe a technique to estimate the agent uncertainty by performing minor modifications in standard value-function-based RL methods. Our empirical evaluations show that RCMP performs better than Importance Advising, not receiving advice, and receiving it at random states in Gridworld and Atari Pong scenarios.

show abstract

Coordination of Electric Vehicle Charging Through Multiagent Reinforcement Learning

Silva

Nishida

Roijers

et al. 2020

IEEE Trans. Smart Grid

View full text Add to dashboard Cite

Performance Evaluation of Cryptographic Algorithms over IoT Platforms and Operating Systems

Pereira

Alves

Silva

et al. 2017

Security and Communication Networks

View full text Add to dashboard Cite

The deployment of security services over Wireless Sensor Networks (WSN) and IoT devices brings significant processing and energy consumption overheads. These overheads are mainly determined by algorithmic efficiency, quality of implementation, and operating system. Benchmarks of symmetric primitives exist in the literature for WSN platforms but they are mostly focused on single platforms or single operating systems. Moreover, they are not up to date with respect to implementations and/or operating systems versions which had significant progress. Herein, we provide time and energy benchmarks of reference implementations for different platforms and operating systems and analyze their impact. Moreover, we not only give the first benchmark results of symmetric cryptography for the Intel Edison IoT platform but also describe a methodology of how to measure energy consumption on that platform.

show abstract

A reference process for automating bee species identification based on wing images and digital image processing

Santana

Costa

Truzzi

et al. 2014

Ecological Informatics

View full text Add to dashboard Cite

Towards Knowledge Transfer in Deep Reinforcement Learning

Glatt¹,

Silva²,

Costa³

2016

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.