2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton) 2018
DOI: 10.1109/allerton.2018.8636086
|View full text |Cite
|
Sign up to set email alerts
|

Policy Design for Active Sequential Hypothesis Testing using Deep Learning

Abstract: Information theory has been very successful in obtaining performance limits for various problems such as communication, compression and hypothesis testing. Likewise, stochastic control theory provides a characterization of optimal policies for Partially Observable Markov Decision Processes (POMDPs) using dynamic programming. However, finding optimal policies for these problems is computationally hard in general and thus, heuristic solutions are employed in practice. Deep learning can be used as a tool for desi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(30 citation statements)
references
References 21 publications
(25 reference statements)
0
30
0
Order By: Relevance
“…It was shown in [10] that maximizing J g N can be formulated as a Partially Observable Markov Decision Problem (POMDP). Using heuristics for solving POMDPs, we can approximately optimize J g N and one such heuristic was presented in [10]. The strategy based on EJS divergence [9] happens to be a one-step greedy policy with respect to this POMDP.…”
Section: Discussion On Strategy Designmentioning
confidence: 99%
“…It was shown in [10] that maximizing J g N can be formulated as a Partially Observable Markov Decision Problem (POMDP). Using heuristics for solving POMDPs, we can approximately optimize J g N and one such heuristic was presented in [10]. The strategy based on EJS divergence [9] happens to be a one-step greedy policy with respect to this POMDP.…”
Section: Discussion On Strategy Designmentioning
confidence: 99%
“…There is a plethora of studies related to active hypothesis testing that explore the model-based approaches in the literature [13]- [17]. Recently, the active hypothesis testing framework is combined with deep learning algorithms to design data-driven anomaly detection algorithms [11], [12], [18], [19]. However, all these algorithms assume a static setting where the states of the processes remain unchanged throughout the decision-making process.…”
Section: A Related Literaturementioning
confidence: 99%
“…We note that this approach is different from the Bayesian log likelihood ratio based-approach in [3], [4], [10]. Therefore, we define the instantaneous objective function that the agent aims to minimize at time k as follows:…”
Section: B Preferencesmentioning
confidence: 99%