2018
DOI: 10.1016/j.ifacol.2018.09.583
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Model Predictive and Reinforcement Learning Methods for Fault Tolerant Control

Abstract: A desirable property in fault-tolerant controllers is adaptability to system changes as they evolve during systems operations. An adaptive controller does not require optimal control policies to be enumerated for possible faults. Instead it can approximate one in real-time. We present two adaptive fault-tolerant control schemes for a discrete time system based on hierarchical reinforcement learning. We compare their performance against a model predictive controller in presence of sensor noise and persistent fa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 15 publications
(14 reference statements)
0
7
0
Order By: Relevance
“…The history of prior states does not affect the value or reward of following states and actions. This is known as the Markov property [7]. In the most exciting and challenging cases, actions may affect not only the immediate reward but also the following situation and, through that, all subsequent rewards.…”
Section: Reinforcement Learning (Rl) Methodsmentioning
confidence: 99%
“…The history of prior states does not affect the value or reward of following states and actions. This is known as the Markov property [7]. In the most exciting and challenging cases, actions may affect not only the immediate reward but also the following situation and, through that, all subsequent rewards.…”
Section: Reinforcement Learning (Rl) Methodsmentioning
confidence: 99%
“…Many researchers have applied this feature to quickly adapt to system failures and external disturbances. In [ 20 , 21 , 22 , 23 ], the authors presented a series of methods based on meta-RL to quickly adapt their control policies to maintain degraded performance when faults occur in the aircraft fuel transfer system. The scheme of FTC methods includes offline metatraining and online metatesting stages.…”
Section: Related Workmentioning
confidence: 99%
“…To verify the control effect of the designed sliding mode controller on the non-linear system, a simulation test is carried out for the rotational speed of the high-pressure rotor. Firstly, through debugging, the parameters p=7 , q=5 , = 2 , = 2.5, =1,2 are set in formula (9); and in the exponential approaching law = 5, = 10; in the low-pass filter = 20. Besides, since the above-mentioned controller is applied to the outer loop in the large closed-loop rotational speed of aero-engine and the output of the outer loop is the given position Lr of the metering valve, the given fuel quantity needs to be converted into Lr and then provided to the small closedloop actuator position.…”
Section: ̇= − ⋅ ( ) − ⋅ (22)mentioning
confidence: 99%
“…For example, Wallhagen et al [7] performed fault diagnosis by comparing the response difference of the engine output between step input and normal control, and on this basis, the fault sensor signal is reconstructed by the resolution margin to realize the fault tolerance of the rotational speed closed-loop control loop; Napolitan et al [8] proposed a sensor fault diagnosis and faulttolerant control method based on online BP(Back Propagation) neural network. In recent years, Ahmed et al [9] designed an adaptive fault-tolerant controller based on the C-130 aircraft fuel tank model; Chen, L et al [10] developed a fault-tolerant integral sliding mode control scheme based on a large civil Boeing 747 engine model, which is through effective use of actuator redundancy to retain the nominal (fault-free) closedloop performance when the actuator fails or faults. Seo et al [11] proposed a design method of single-engined aircraft faulttolerant control system to deal with the problem of the gradual loss of reasoning over time.…”
Section: Introductionmentioning
confidence: 99%