2021

DOI: 10.1016/j.neucom.2020.12.116

|View full text |Cite

|

Sign up to set email alerts

|

Demonstration actor critic

¹

,

²

,

³

et al.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Related Work1

Citation Types

Supporting

0

Mentioning

1

Contrasting

0

Year Published

2022

2022

2024

2024

Publication Types

Select...

Article4

Other1

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 5 publications

(1 citation statement)

References 4 publications

Supporting

0

Mentioning

1

Contrasting

0

Order By: Relevance

“…Value-based algorithms learn the optimal value function and then use it to derive the optimal policy, whereas policy-based algorithms learn the optimal policy directly. Actor-critic methods [69,94] are hybrid approaches that use policy-based methods to improve a policy while also evaluating it by estimating its corresponding value function. Several studies, including [22,95], investigated the adaptability of value-based algorithms to environmental changes.…”

Section: Related Workmentioning

confidence: 99%

Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning☆

¹

,

²

,

³

2023

Neurocomputing

View full text Add to dashboard Cite

No abstract

“…Value-based algorithms learn the optimal value function and then use it to derive the optimal policy, whereas policy-based algorithms learn the optimal policy directly. Actor-critic methods [69,94] are hybrid approaches that use policy-based methods to improve a policy while also evaluating it by estimating its corresponding value function. Several studies, including [22,95], investigated the adaptability of value-based algorithms to environmental changes.…”

Section: Related Workmentioning

confidence: 99%

Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning☆

¹

,

²

,

³

2023

Neurocomputing

View full text Add to dashboard Cite

No abstract

Soft imitation reinforcement learning with value decomposition for portfolio management

Dong,

Zheng

2024

Applied Soft Computing

View full text Add to dashboard Cite

No abstract

Autonomous driving policy learning from demonstration using regression loss function

Xiao,

An,

Li

et al. 2024

Knowledge-Based Systems

View full text Add to dashboard Cite

No abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.