2011
DOI: 10.1002/rnc.1686
|View full text |Cite
|
Sign up to set email alerts
|

Approximate dynamic programming with state aggregation applied to UAV perimeter patrol

Abstract: SUMMARYOne encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for large-scale controlled Markov chains. In this paper, we provide a reward-based aggregation method to construct suboptimal policies for a perimeter surveillance control problem which gives rise to a large scale Markov chain. The novelty of this approach lies in circumventing the need for value iteration over the entire state space. Instead, the state space is partitioned and the value fu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(16 citation statements)
references
References 16 publications
0
16
0
Order By: Relevance
“…Note that the alarm state is zeroed out upon a loiter by one of the UAVs. In addition the model of [4] keeps up with the length of time each alert waits to be serviced. We denote by T i (t) the number of time steps an alert at station i has been waiting to be serviced by a UAV.…”
Section: Defining the Perimeter Patrol Problemmentioning
confidence: 99%
See 2 more Smart Citations
“…Note that the alarm state is zeroed out upon a loiter by one of the UAVs. In addition the model of [4] keeps up with the length of time each alert waits to be serviced. We denote by T i (t) the number of time steps an alert at station i has been waiting to be serviced by a UAV.…”
Section: Defining the Perimeter Patrol Problemmentioning
confidence: 99%
“…To illustrate, we consider the example computed and illustrated in [4]'s paper, with 1 UAV, 4 UGSs, and a perimeter discretized with N=15 locations. For the original information gain/waiting time penalty formulation, we take =0.9, =0.0013.…”
Section: Max-plus Solution Of Perimeter Patrol Planningmentioning
confidence: 99%
See 1 more Smart Citation
“…The distance between the single well in the changqing oil field areas is very long, and the environment for esert and animal husbandry is very complex [4,5,6], and the complex terrain of inspection line, which cause the way of manual inspection is inefficient by walking. Labor patrol also has the risk as personnel injury [7].…”
Section: Introductionmentioning
confidence: 99%
“…The first sub-factory of PCOC is located in the Yishan Slope of the Ordos Basin in Yan'an, all of the region is hilly gully region [7,8,9], most of the oil pipeline is laying along the topography, which is difficult for labor patrol. So you would like to explore drones in the pipeline patrolling, alleviate patrol staff workload.…”
Section: Introductionmentioning
confidence: 99%