2021
DOI: 10.1088/1741-4326/ac121b
|View full text |Cite
|
Sign up to set email alerts
|

Feedforward beta control in the KSTAR tokamak by deep reinforcement learning

Abstract: In this work, we address a new feedforward control scheme for the normalized beta (β N) in tokamak plasmas, using the deep reinforcement learning (RL) technique. The deep RL algorithm optimizes an artificial decision-making agent that adjusts the discharge scenario to obtain a given target β N from the state–action–reward sets explored by its own trial and error in a virtual tokamak environment. The virtual environment for the RL training is constructed using a long short-term memory (LSTM) network that imitat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 38 publications
(33 citation statements)
references
References 44 publications
0
33
0
Order By: Relevance
“…Recently, Seo et al. 49 have developed feedforward signals for beta control using RL, which have then been verified on the KSTAR tokamak.…”
Section: Methodsmentioning
confidence: 99%
“…Recently, Seo et al. 49 have developed feedforward signals for beta control using RL, which have then been verified on the KSTAR tokamak.…”
Section: Methodsmentioning
confidence: 99%
“…The integrated tokamak simulation with multiple physics codes [14] can offer virtual experiments for the RL agent, but it is still computationally expensive for millions of simulations. In this work, instead, we employ a data-driven simulator using long short-term memory (LSTM) [15]-based neural networks, which provides fast enough simulations and experimentally relevant plasma responses [11,16].…”
Section: Modelingmentioning
confidence: 99%
“…Active control of plasma parameters has been widely attempted with various techniques, such as the proportionalintegral-differential (PID) feedback algorithm [8,9] and model predictive control (MPC) [10]. Recently, feedforward control of β N has been conducted by designing the operation scenario with the deep reinforcement learning (RL) technique [11]. The RL method has also been applied for real-time magnetic coil control to form various plasma configurations, requiring sophisticated decision-making in multi-dimensional engineering space based on magneto-hydrodynamics (MHD) [12].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, it is really important building a robust counterfactual generator as well as perfecting the RWM stability forecaster. The implementation of ML-based controllers has been gaining attention in recent years with through the usage of reinforcement learning, for example to determine the optimal control scenario to obtain a constant user-defined β N level in the KSTAR tokamak [54]. The direction we have taken in the present work follows the observation that the disruptive β N is not constant when it comes to RWM stability and the usage of counterfactuals might lay the basis for a control algorithm that takes this variability into account.…”
Section: Ml-informed Safe Scenarios For Potential Real-time Controlmentioning
confidence: 99%