2020
DOI: 10.1016/j.asoc.2020.106490
|View full text |Cite
|
Sign up to set email alerts
|

Target tracking strategy using deep deterministic policy gradient

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 24 publications
0
15
0
Order By: Relevance
“…The dynamic interaction process can be defined by the Markov decision process (MDP) and divided into finite discrete time steps in the RL control framework 32 . At each time step k, the agent observes state sk, takes action ak, receives scalar reward rk, and then progresses to the next state sk+1, as indicated by the red dashed line τ{}s0,a0,s1,a1,,sN,aN shown in Figure 2b 33…”
Section: Controller Design For Rths Based On Rlmentioning
confidence: 99%
See 1 more Smart Citation
“…The dynamic interaction process can be defined by the Markov decision process (MDP) and divided into finite discrete time steps in the RL control framework 32 . At each time step k, the agent observes state sk, takes action ak, receives scalar reward rk, and then progresses to the next state sk+1, as indicated by the red dashed line τ{}s0,a0,s1,a1,,sN,aN shown in Figure 2b 33…”
Section: Controller Design For Rths Based On Rlmentioning
confidence: 99%
“… |θμJθμ1Ntruei=kNaQ()sk,μ()skfalse|θQa=μθθμμ()skfalse|θμ, where represents the gradient operation. θμJθμ represents the sampled policy gradient, which is typically optimized using various optimizers based on gradient descent, such as the Adam method 32,39 …”
Section: Controller Design For Rths Based On Rlmentioning
confidence: 99%
“…At present, deep reinforcement learning has been widely applied in unmanned vehicle control, [ 20 ] robot path planning and control, [ 21 ] pursuit and avoidance of targets, [ 22 ] unmanned driving [ 23 , 24 ], and real-time strategy games [ 25 , 26 ]. However, most of the reinforcement learning algorithms used in air combat maneuvering decision making are discrete action space algorithms, which inevitably face the problems of rough flight paths and limited reachable domains.…”
Section: Introductionmentioning
confidence: 99%
“…The image contains a variety of information, so the most basic link to restore the real 3D scene from the image taken by UAV is image matching. At present, image matching is currently used extensively in target tracking [1][2][3][4], 3D reconstruction [5][6][7][8], visual SLAM [9][10][11], UAV obstacle avoidance navigation [12,13], and land surveying and mapping [14]. However, one of the common situations in real life is in a low-light environment photographed by UAV.…”
Section: Introductionmentioning
confidence: 99%