SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics

Flet-Berliac, Yannis; Basu, Debabrota

doi:10.48550/arxiv.2204.09424

Cited by 2 publications

(2 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Classification. Because of this analysis operation based on sequence features, it is consistent with reinforcement learning applied to sequence decision features 8 . This kind of problem that requires step-by-step operations on sequence data can be solved using policy gradients in reinforcement learning.…”

Section: Introductionmentioning

confidence: 74%

Malware behavior detection method based on reinforcement learning

Cui

Leng

Wang

et al. 2023

International Conference on Computer Application and Information Security (ICCAIS 2022)

View full text Add to dashboard Cite

Malware in the network environment is a serious threat to the security of industrial control systems. With the gradual increase of malware variants, it brings great challenges to the detection and security protection of industrial control system malware. The existing detection methods have limitations such as low intelligence in adaptive detection and recognition. In response to this problem, this paper designs a detection application method framework by combining the use of reinforcement learning, an advanced machine learning algorithm, around the malware objects that threaten the network security of industrial control systems. In the implementation process, according to the actual needs of malware behavior detection, fully combined with intelligent features such as sequential decision-making and dynamic feedback learning of reinforcement learning, the key application modules such as feature extraction network, policy network and classification network are discussed and designed in detail. The application experiments based on the actual malware test data set verify the effectiveness of the method in this paper, which can provide an intelligent decision-making aid for general malware behavior detection.

show abstract