2021
DOI: 10.1109/tnnls.2020.2981377
|View full text |Cite
|
Sign up to set email alerts
|

Actor–Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…First, the impact of λ , β , θ , the number of SUs and the number of channels on the performance of the proposed scheme is analyzed. Then the performance of the proposed spectrum access algorithm Feilin is compared with DQN+RC [18], Q-learning [11], PG+RDA [17], and MPQ-L+DPG [38]. All results in the following scenarios are the average of 1000 independent experiments.…”
Section: A Experimental Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…First, the impact of λ , β , θ , the number of SUs and the number of channels on the performance of the proposed scheme is analyzed. Then the performance of the proposed spectrum access algorithm Feilin is compared with DQN+RC [18], Q-learning [11], PG+RDA [17], and MPQ-L+DPG [38]. All results in the following scenarios are the average of 1000 independent experiments.…”
Section: A Experimental Setupmentioning
confidence: 99%
“…Based on the deterministic policy [15], the deep deterministic policy gradient (DDPG) was developed for continuous control [16]. To obtain the solution of the minimization problem by learning stochastic and deterministic approximate optimal policies, a regularized dual-averaging policy gradient (RDA-PG) scheme was proposed [17]. However, the learning-based methods mentioned above depend on the centralized model training, which increases transmission overhead and degrades the real-time performance.…”
mentioning
confidence: 99%
“…Making sophisticated statistical methods applicable to the existing market data and to provide better data analysis for investor decision-making have become a hot topic. In particular, in the fields of index tracking, portfolio management, and risk hedging, broad application platforms for feature selection methods arise [4], [5]. Index tracking is a significant investment strategy [2], [3]in fund management that aims to replicate the movements of a specific market index.…”
Section: Introduction a Related Workmentioning
confidence: 99%