2021 IEEE Security and Privacy Workshops (SPW) 2021
DOI: 10.1109/spw53761.2021.00031
|View full text |Cite
|
Sign up to set email alerts
|

RL-GRIT: Reinforcement Learning for Grammar Inference

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…Drawing inspiration from prior work on RL for grammatical inference (RL-GRIT), 26 we leveraged a pairwise policy update approach. This has a number of potential advantages: (1) as with appendix A.2, loss scales according to changes in expected reward, (2) it works the same for any kind of action space, and (3) as discussed in appendix A.6, it lends itself very well to an alternative of MCTS that is faster to compute while conferring similar benefits.…”
Section: A4 Complex Action Spaces Via a Pairwise Policy Updatementioning
confidence: 99%
“…Drawing inspiration from prior work on RL for grammatical inference (RL-GRIT), 26 we leveraged a pairwise policy update approach. This has a number of potential advantages: (1) as with appendix A.2, loss scales according to changes in expected reward, (2) it works the same for any kind of action space, and (3) as discussed in appendix A.6, it lends itself very well to an alternative of MCTS that is faster to compute while conferring similar benefits.…”
Section: A4 Complex Action Spaces Via a Pairwise Policy Updatementioning
confidence: 99%