2022
DOI: 10.1049/csy2.12059
|View full text |Cite
|
Sign up to set email alerts
|

A new noise network and gradient parallelisation‐based asynchronous advantage actor‐critic algorithm

Abstract: Asynchronous advantage actor-critic (A3C) algorithm is a commonly used policy optimization algorithm in reinforcement learning, in which asynchronous is parallel interactive sampling and training, and advantage is a sampling multi-step reward estimation method for computing weights. In order to address the problem of low efficiency and insufficient convergence caused by the traditional heuristic exploration of A3C algorithm in reinforcement learning, an improved A3C algorithm is proposed in this paper. In this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…To address this problem, a reinforcement learning approach using Generalized Advantage Estimation (GAE) to enable autonomous vehicles to learn to navigate complex environments is proposed [20], [21]. Reinforcement learning enables agents to learn by interacting with their environment and receiving feedback as rewards or penalties [22].…”
Section: Introductionmentioning
confidence: 99%
“…To address this problem, a reinforcement learning approach using Generalized Advantage Estimation (GAE) to enable autonomous vehicles to learn to navigate complex environments is proposed [20], [21]. Reinforcement learning enables agents to learn by interacting with their environment and receiving feedback as rewards or penalties [22].…”
Section: Introductionmentioning
confidence: 99%
“…To facilitate the management and retrieval of available information, tag recommendation systems have become an indispensable component of many commerce platforms [1], enabling users to more quickly select products of interest. A large number of applications are gradually gaining attention under the continuous development and popularity of intelligent technology [2, 3]. Many web applications employ tag recommendation systems to help users index resources of interest, such as Flickr, Delicious, LastFm, MovieLens and Bibsonomy.…”
Section: Introductionmentioning
confidence: 99%
“…attention under the continuous development and popularity of intelligent technology [2,3]. Many web applications employ tag recommendation systems to help users index resources of interest, such as Flickr, Delicious, LastFm, MovieLens and Bibsonomy.…”
mentioning
confidence: 99%