2v2 Air Combat Confrontation Strategy Based on Reinforcement Learning

Wang, Jinlin; Zhu, Longtao; Yang, Hong‐Yu; Ji, Yulong; Wang, Xiaoming

doi:10.1007/978-981-99-0479-2_125

“…The barrier function serves as a formal safety certificate associated with a control policy, guaranteeing the state-wise safety of a dynamical system [21,22]. Classical control theory often relaxes the stringent conditions of the barrier function into optimization formulations like linear programs [23,24] and quadratic programs [25,26].…”

Section: State-wise Safe Reinforcement Learningmentioning

confidence: 99%

“…Recent research has explored the joint learning of control policies and neural barrier functions to optimize state-wise safety constraints in reinforcement learning [27][28][29]. In the context of autonomous driving, ShieldNN [30] leverages CBF to design a safety filter neural network, providing safety assurances for environments with known bicycle dynamics models.…”

Section: State-wise Safe Reinforcement Learningmentioning

confidence: 99%

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints

Wang

¹

,

Wang

²

2024

Sensors

2

0

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

“…Various deep learning methods have addressed constraint learning through techniques such as backpropagation over (in)equality completions [27], differentiable projection layers that map interior points to boundary regions [43], convex programming layers [1], and problem-specific repair mechanisms [22]. In addition, safe reinforcement learning has approached feasibility through constrained MDPs with primal-dual techniques [26], soft barrier functions [79], and safety shields [3].…”

Section: Machine Learning For Optimization Problemsmentioning

confidence: 99%

Towards a Deep Reinforcement Learning Model of Master Bay Stowage Planning

Twiller

¹

,

Grbic

²

,

Jensen

³

2023

Lecture Notes in Computer Science

1

0

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

show abstract

Reinforcement Learning Methods for Fixed-Wing Aircraft Control

Li

¹

,

Zhang

²

,

Wang³

et al. 2023

2023 9th International Conference on Systems and Informatics (ICSAI)

1

0

Get access via publisher Add to dashboard Cite

Exaggerated anticipatory anxiety is common in social anxiety disorder (SAD). Neuroimaging studies have revealed altered neural activity in response to social stimuli in SAD, but fewer studies have examined neural activity during anticipation of feared social stimuli in SAD. The current study examined the time course and magnitude of activity in threat processing brain regions during speech anticipation in socially anxious individuals and healthy controls (HC). Method Participants (SAD n = 58; HC n = 16) underwent functional magnetic resonance imaging (fMRI) during which they completed a 90s control anticipation task and 90s speech anticipation task.

2v2 Air Combat Confrontation Strategy Based on Reinforcement Learning

Cited by 3 publications

References 6 publications

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints

Towards a Deep Reinforcement Learning Model of Master Bay Stowage Planning

Reinforcement Learning Methods for Fixed-Wing Aircraft Control

Contact Info

Product

Resources

About