Constrained Markov Decision Processes 2021
DOI: 10.1201/9781315140223-3
|View full text |Cite
|
Sign up to set email alerts
|

Markov decision processes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
486
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 257 publications
(489 citation statements)
references
References 0 publications
3
486
0
Order By: Relevance
“…An overview of AI safety methods can be found in Pecka and Svoboda (2014) and Garcıa and Fernández (2015). A large body of work in the area of Safe AI focuses on constrained RL (Altman, 1999;Wen and Topcu, 2018). Constrained RL depends on a-priori defined safety constraints which are states or actions that the agent should avoid.…”
Section: Ai Safetymentioning
confidence: 99%
“…An overview of AI safety methods can be found in Pecka and Svoboda (2014) and Garcıa and Fernández (2015). A large body of work in the area of Safe AI focuses on constrained RL (Altman, 1999;Wen and Topcu, 2018). Constrained RL depends on a-priori defined safety constraints which are states or actions that the agent should avoid.…”
Section: Ai Safetymentioning
confidence: 99%
“…Stochastic Optimization using Markov Decision Processes has very rich roots (Howard, 1960). There have been work in understanding convergence of the algorithm to find optimal policies for known MDPs (Bertsekas and Tsitsiklis, 1996;Altman, 1999). Also, when the MDP is not known, there are algorithms with asymptotic guarantees for learning the optimal policies (Watkins and Dayan, 1992) which maximize an objective without any constraints.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to the example above, many applications require to keep some costs low while simultaneously maximizing the rewards (Altman, 1999). Owing to the importance of this problem, in this paper, we consider the problem of constrained Markov Decision Processes (constrained MDP or CMDP).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations