2022
DOI: 10.1016/j.array.2022.100262
|View full text |Cite
|
Sign up to set email alerts
|

Cyberspace attack and defense game based on reward randomization reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(10 citation statements)
references
References 45 publications
(51 reference statements)
0
5
0
Order By: Relevance
“…7: A mixed threshold strategy where σ( θ(1) l ) is the threshold (0.5 in this example); the x-axis indicates the defender's belief state b(1) ∈ [0, 1] and the y-axis indicates the probability prescribed by π1, θ (1) to the stop action S. (32). Similarly, the attacker's best response strategy π2 is parameterized with the vector θ(2) ∈ R 2L (33).…”
Section: Our Self-play Algorithm: T-fpmentioning
confidence: 99%
See 4 more Smart Citations
“…7: A mixed threshold strategy where σ( θ(1) l ) is the threshold (0.5 in this example); the x-axis indicates the defender's belief state b(1) ∈ [0, 1] and the y-axis indicates the probability prescribed by π1, θ (1) to the stop action S. (32). Similarly, the attacker's best response strategy π2 is parameterized with the vector θ(2) ∈ R 2L (33).…”
Section: Our Self-play Algorithm: T-fpmentioning
confidence: 99%
“…Although a growing body of work uses reinforcement learning and game theory to find intrusion response strategies (see Section VII), a direct comparison between the defender strategies learned in our framework and those found in previous work is not feasible for two reasons. First, nearly all of the prior works have developed defender strategies for custom simulations [8], [10], [10], [11], [18]- [27], [33]- [35], [38], [39], [60], [62], [63], [67]- [72], [107]- [119] and there is no obvious way to map their solutions to an emulated environment like ours (see Fig. 1 and Appendix C).…”
Section: A Learning Equilibrium Strategies Through Self-playmentioning
confidence: 99%
See 3 more Smart Citations