Exploring the Use of Invalid Action Masking in Reinforcement Learning: A Comparative Study of On-Policy and Off-Policy Algorithms in Real-Time Strategy Games

Hou, Yueqi; Liang, Xiaolong; Zhang, Jiaqiang; Yang, Qisong; Yang, Aiwu; Wang, Ning

doi:10.3390/app13148283

Applied Sciences

2023

DOI: 10.3390/app13148283

|View full text |Cite

Exploring the Use of Invalid Action Masking in Reinforcement Learning: A Comparative Study of On-Policy and Off-Policy Algorithms in Real-Time Strategy Games

Yueqi Hou

Xiaolong Liang

Jiaqiang Zhang

et al.

Abstract: Invalid action masking is a practical technique in deep reinforcement learning to prevent agents from taking invalid actions. Existing approaches rely on action masking during policy training and utilization. This study focuses on developing reinforcement learning algorithms that incorporate action masking during training but can be used without action masking during policy execution. The study begins by conducting a theoretical analysis to elucidate the distinction between naive policy gradient and invalid ac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

A New Graph-Based Reinforcement Learning Environment for Targeted Molecular Generation and Optimization✱

Mahmoud,

Alyan,

Elkerdawy

et al. 2023

Proceedings of the 2023 12th International Conference on Software and Information Engineering

View full text Add to dashboard Cite

A New Graph-Based Reinforcement Learning Environment for Targeted Molecular Generation and Optimization✱

Mahmoud,

Alyan,

Elkerdawy

et al. 2023

Proceedings of the 2023 12th International Conference on Software and Information Engineering

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Exploring the Use of Invalid Action Masking in Reinforcement Learning: A Comparative Study of On-Policy and Off-Policy Algorithms in Real-Time Strategy Games

Cited by 1 publication

References 29 publications

A New Graph-Based Reinforcement Learning Environment for Targeted Molecular Generation and Optimization✱

A New Graph-Based Reinforcement Learning Environment for Targeted Molecular Generation and Optimization✱

Contact Info

Product

Resources

About