2021
DOI: 10.1109/taes.2021.3057649
|View full text |Cite
|
Sign up to set email alerts
|

Policy Rollout Action Selection in Continuous Domains for Sensor Path Planning

Abstract: Policy rollout is a method for the online computation of future costs in approximate dynamic programming, and has been utilized for various problems including sensor management. In previous work, it has predominately been applied to the selection of actions from discrete sets. In this paper we present methods for action selection from continuous sets and analyze their trade-offs. The methods are evaluated on the problem of sensor path planning, with the intent of minimizing the time to localize an emitter usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 39 publications
0
6
0
Order By: Relevance
“…Since the arrival time of measurements from different targets w.r.t. each sensor is random, existing researches [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26] cannot be applied to the 3-D CTO problem in the passive MTT context. We introduce the fusion time interval for time alignment and implement CTO in each fusion time instant.…”
Section: B Main Contributionsmentioning
confidence: 99%
See 4 more Smart Citations
“…Since the arrival time of measurements from different targets w.r.t. each sensor is random, existing researches [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26] cannot be applied to the 3-D CTO problem in the passive MTT context. We introduce the fusion time interval for time alignment and implement CTO in each fusion time instant.…”
Section: B Main Contributionsmentioning
confidence: 99%
“…To tackle this challenge, several TO models and corresponding solution algorithms are proposed for target localization/tracking in [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. Tzoreff and Weiss [7] discuss online and offline TO problems for a single UAV in the presence of time of arrival (TOA) measurements, subject to speed and no-fly zone constraints.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations