2021
DOI: 10.1016/j.compchemeng.2021.107527
|View full text |Cite
|
Sign up to set email alerts
|

Twin actor twin delayed deep deterministic policy gradient (TATD3) learning for batch process control

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 29 publications
(14 citation statements)
references
References 33 publications
0
11
0
Order By: Relevance
“…This case study involves a nonlinear process in which a non‐isothermal reaction takes place in a batch reactor with reactant A converting into products (see Figure 8). [ 70,108,109 ]…”
Section: Evaluation Of Ac Methodsmentioning
confidence: 99%
“…This case study involves a nonlinear process in which a non‐isothermal reaction takes place in a batch reactor with reactant A converting into products (see Figure 8). [ 70,108,109 ]…”
Section: Evaluation Of Ac Methodsmentioning
confidence: 99%
“…where ãφ A = tanh µ φ A (s) + σ θ (s).ξ and ξ ∼ N(0, 1). Recent works in the literature have explored the deployment of an ensemble of actor networks in the actor-critic RL framework [14]. In complex environments where the best strategy cannot be represented by a single network, multiple actor networks can be used as a potential solution to learn the optimal policy.…”
Section: Maximum Entropy Rl and Sacmentioning
confidence: 99%
“…This kind of model-free RL approach addresses the main limitation of model-based control approaches by eliminating the requirement of a high fidelity process model. Even if an approximate process model is available, it can be used in the offline learning stage and data generation [14], thereby significantly reducing the requirement of data and the risk associated with safety. Thus, the burden on the online computation will be relatively less as the policy obtained through offline learning can be used as the warm start during the online implementation.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations