2020 American Control Conference (ACC) 2020
DOI: 10.23919/acc45564.2020.9147974
|View full text |Cite
|
Sign up to set email alerts
|

Beating humans in a penny-matching game by leveraging cognitive hierarchy theory and Bayesian learning

Abstract: It is a long-standing goal of artificial intelligence (AI) to be superior to human beings in decision making. Games are suitable for testing AI capabilities of making good decisions in non-numerical tasks. In this paper, we develop a new AI algorithm to play the penny-matching game considered in Shannon's "mind-reading machine" (1953) against human players. In particular, we exploit cognitive hierarchy theory and Bayesian learning techniques to continually evolve a model for predicting human player decisions, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…Hence, the latent states' transition model is reduced to P(θ t+1 |θ t , st , σ)=I(θ t+1 =θ t ). In general repeated games, the transition model can be represented as a Markov chain and its parameters σ can be embedded in the POMDP and learned simultaneously as in [41]. High-level robot planning.…”
Section: Reward Functions the Reward Function Of A Car Is A Linear Co...mentioning
confidence: 99%
“…Hence, the latent states' transition model is reduced to P(θ t+1 |θ t , st , σ)=I(θ t+1 =θ t ). In general repeated games, the transition model can be represented as a Markov chain and its parameters σ can be embedded in the POMDP and learned simultaneously as in [41]. High-level robot planning.…”
Section: Reward Functions the Reward Function Of A Car Is A Linear Co...mentioning
confidence: 99%
“…The level-K framework characterizes agents' strategies according to their depth of reasoning, denoted as "level." It models agents' interactions based on the assumption that a level-K player makes optimal decisions while considering the other agents as level-(K−1) players [18]- [20]. In this paper, we employ the level-K framework to design a racing controller for a leading robot that is chased by a following robot who is attempting to overtake [21].…”
Section: Introductionmentioning
confidence: 99%