2021
DOI: 10.48550/arxiv.2108.01843
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Model-Based Opponent Modeling

Abstract: When one agent interacts with a multi-agent environment, it is challenging to deal with various opponents unseen before. Modeling the behaviors, goals, or beliefs of opponents could help the agent adjust its policy to adapt to different opponents. In addition, it is also important to consider opponents who are learning simultaneously or capable of reasoning. However, existing work usually tackles only one of the aforementioned types of opponent. In this paper, we propose model-based opponent modeling (MBOM), w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…Grover et al use imitation learning and contrastive learning to predict the next actions of an opponent [17]. Yu et al utilize a model-based approach with neural networks for predicting the next state of the opponents [18]. We differ from prior approaches by 1) only utilizing sparse observations (i.e., we do not have access to the true states of opponents unless detected) and 2) we justify the value of the designed filter by helping train a MARL to complete S&T tasks which could not be done without the filter as shown in §V-B.…”
Section: B Learning-based Nonlinear Filtersmentioning
confidence: 99%
“…Grover et al use imitation learning and contrastive learning to predict the next actions of an opponent [17]. Yu et al utilize a model-based approach with neural networks for predicting the next state of the opponents [18]. We differ from prior approaches by 1) only utilizing sparse observations (i.e., we do not have access to the true states of opponents unless detected) and 2) we justify the value of the designed filter by helping train a MARL to complete S&T tasks which could not be done without the filter as shown in §V-B.…”
Section: B Learning-based Nonlinear Filtersmentioning
confidence: 99%