Myungsik Cho scite author profile

Myungsik Cho

2Publications

3Citation Statements Received

35Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning

Kim¹,

Jung²,

Cho³

et al. 2020

Preprint

View full text Add to dashboard Cite

In this paper, we propose a maximum mutual information (MMI) framework for multi-agent reinforcement learning (MARL) to enable multiple agents to learn coordinated behaviors by regularizing the accumulated return with the mutual information between actions. By introducing a latent variable to induce nonzero mutual information between actions and applying a variational bound, we derive a tractable lower bound on the considered MMI-regularized objective function. Applying policy iteration to maximize the derived lower bound, we propose a practical algorithm named variational maximum mutual information multi-agent actor-critic (VM3-AC), which follows centralized learning with decentralized execution (CTDE). We evaluated VM3-AC for several games requiring coordination, and numerical results show that VM3-AC outperforms MADDPG and other MARL algorithms in multi-agent tasks requiring coordination.Preprint. Under review.

show abstract

Robust Imitation Learning against Variations in Environment Dynamics

Chae¹,

Han²,

Jung³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Myungsik Cho

A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning

Robust Imitation Learning against Variations in Environment Dynamics

Contact Info

Product

Resources

About