2020

DOI: 10.1155/2020/4708075

|View full text |Cite

|

Sign up to set email alerts

|

Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess

¹

,

²

,

³

et al.

Abstract: In this study, hybrid state-action-reward-state-action (SARSAλ) and Q-learning algorithms are applied to different stages of an upper confidence bound applied to tree search for Tibetan Jiu chess. Q-learning is also used to update all the nodes on the search path when each game ends. A learning strategy that uses SARSAλ and Q-learning algorithms combining domain knowledge for a feedback function for layout and battle stages is proposed. An improved deep neural network based on ResNet18 is used for self-play tr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction1

Citation Types

Supporting

0

Mentioning

0

Contrasting

0

Year Published

2022

2022

2024

2024

Publication Types

Select...

Book2

Other2

Article1

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 5 publications

(1 citation statement)

References 27 publications

(35 reference statements)

Supporting

0

Mentioning

0

Contrasting

0

Order By: Relevance

“…Since there is no interaction between the agent and physical model of ADN during training, this method can achieve physical model-free control. However, it requires a large amount of training data and distribution mismatch may degrade the performance of the algorithm even when sufficiently large and diverse data are given [33]. Ref.…”

Section: Introductionmentioning

confidence: 99%

Model-free voltage control of active distribution system with PVs using surrogate model-based deep reinforcement learning

¹

,

²

,

³

et al. 2022

Applied Energy

View full text Add to dashboard Cite

No abstract

“…Since there is no interaction between the agent and physical model of ADN during training, this method can achieve physical model-free control. However, it requires a large amount of training data and distribution mismatch may degrade the performance of the algorithm even when sufficiently large and diverse data are given [33]. Ref.…”

Section: Introductionmentioning

confidence: 99%

Model-free voltage control of active distribution system with PVs using surrogate model-based deep reinforcement learning

¹

,

²

,

³

et al. 2022

Applied Energy

View full text Add to dashboard Cite

No abstract

Tibetan Jiu Chess Intelligent Game Platform

Chen,

Wu,

Yan

et al. 2024

Communications in Computer and Information Science

View full text Add to dashboard Cite

No abstract

The Survey of Self-play Method in Computer Games

Li,

Wang,

Liu

et al. 2023

Communications in Computer and Information Science

View full text Add to dashboard Cite

No abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.