Reinforcement Learning Aided Performance Optimization of Feedback Control Systems 2021
DOI: 10.1007/978-3-658-33034-7_3
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning and Feedback Control

Abstract: In recent years, large language models (LLMs) have had a dramatic impact on various sub-fields of AI, most notably on natural language understanding tasks. However, there is widespread agreement that the logical reasoning capabilities of contemporary LLMs are, at best fragmentary (i.e., may work well on some problem instances but fail dramatically on others). While traditional LLM finetuning approaches (e.g., those that use human feedback) do address this problem to some degree, they suffer from many issues, i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 13 publications
(20 reference statements)
0
4
0
Order By: Relevance
“…In reality, it is important to consistently align the coach responses to user need throughout a conversation and adapt as the dialogue unfolds. Methods such as reinforcement learning with human feedback can help to offer this adaptability (29). Finally, the behaviour model used was a simplistic one that conceptualizes user behaviour only along three axes -future studies could consider using more sophisticated behavior science frameworks, which may help to better target coach actions.…”
Section: Discussionmentioning
confidence: 99%
“…In reality, it is important to consistently align the coach responses to user need throughout a conversation and adapt as the dialogue unfolds. Methods such as reinforcement learning with human feedback can help to offer this adaptability (29). Finally, the behaviour model used was a simplistic one that conceptualizes user behaviour only along three axes -future studies could consider using more sophisticated behavior science frameworks, which may help to better target coach actions.…”
Section: Discussionmentioning
confidence: 99%
“…Unsupervised learning methods use unlabeled data to discover hidden patterns or structures in the data, such as grouping similar EEG signals into clusters or reducing the dimensionality of highdimensional biosignals [44]. Reinforcement learning methods use feedback from the environment to learn an optimal policy or action for a given state, such as controlling a prosthetic limb using EMG signals [45]. For each category, we will introduce some of the most popular and effective methods and their advantages and limitations, as well as some case studies and applications in biosignal processing.…”
Section: Machine Learning Methods For Biosignal Processingmentioning
confidence: 99%
“…In reality, it is important to consistently align the coach responses to user need throughout a conversation and adapt as the dialogue unfolds. Methods such as reinforcement learning with human feedback can help to offer this adaptability [15]. Finally, the behaviour model used was a simplistic one that conceptualizes user behaviour only along three axesfuture studies could consider using more sophisticated behavior science frameworks, which may help to better target coach actions.…”
Section: Plos Digital Healthmentioning
confidence: 99%
“…over-specialization of the model [12]. Knowledge infusion is an active area of research and many other methods exist including customizing training objectives [13], reinforcement learning with human feedback [14,15], in-context learning via prompt engineering or priming [16,17] and many associated prompt design variants [18][19][20][21]. There have also been numerous strategies to ensemble knowledge infusion techniques, including post-hoc re-ranking or summarization of model outputs to further align the model with the task of interest [22,23].…”
Section: Introductionmentioning
confidence: 99%