2021 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom) 2021
DOI: 10.1109/blackseacom52164.2021.9527857
|View full text |Cite
|
Sign up to set email alerts
|

Using Counterfactual Regret Minimization and Monte Carlo Tree Search for Cybersecurity Threats

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…Third, the accuracy to predict opponent movements (i.e., played card sets of other three players) is particularly evaluated for AI 4.0 player. Finally, we conduct experiments to compare the win rates and remaining points of existing AI players (i.e., AI 1.0 [12], [19], AI 2.0 [32], [33], AI 3.0 [34], [35], [36], and RL-PPO [20]) and our developed AI player (i.e., AI 4.0). Furthermore, the games of AI players against human players are performed to compare their win rates and remaining points under different numbers of played games.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Third, the accuracy to predict opponent movements (i.e., played card sets of other three players) is particularly evaluated for AI 4.0 player. Finally, we conduct experiments to compare the win rates and remaining points of existing AI players (i.e., AI 1.0 [12], [19], AI 2.0 [32], [33], AI 3.0 [34], [35], [36], and RL-PPO [20]) and our developed AI player (i.e., AI 4.0). Furthermore, the games of AI players against human players are performed to compare their win rates and remaining points under different numbers of played games.…”
Section: Discussionmentioning
confidence: 99%
“…However, the cards in hand and the card sets played in a real game cannot be fully determined by those expected values that are only based on the number of cards in hand. To make simulated playing more close to real playing in AI 2.0, we further integrate regret minimization [32], [33] into AI 1.0 to dynamically calculate the expected values P 2.0 (instead of P 1.0 ) of card sets to be played based on historical game data. In particular, specific playing strategies can be further learned in AI 2.0 from game-playing data; for example, if there are three remaining cards including Spade 2, Diamond 10, and Club 10, Single of Spade 2 can be played first (which could make P1, P2, and P3 all pass their turns) and then Pair of Diamond 10 and Club 10 can be immediately played (as the new dominant card set) to win more points in the game (because no more cards can be played by P1, P2, and P3).…”
Section: B Ai 20 -Dynamic Weight Adjustmentmentioning
confidence: 99%
“…However, limited by computing resources and storage resources, the solution scale of CFR is not enough for large-scale games. When solving large-scale game problems, the large-scale game problems must be abstracted and expert knowledge is needed for the detailed design, which greatly limits its further application [8,[10][11][12].…”
Section: Introductionmentioning
confidence: 99%