Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes

Kurano, Masami; Yasuda, Masami; Nakagami, Jun-ichi; Yoshida, Yūji

doi:10.1007/11526018_28

Cited by 2 publications

(3 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The perceptive analysis developed in this paper is related to our previous works. A model of stopping problems is formulated in Kurano et al (2004) and that of Markov decision processes is in Kurano et al (2005a) and Kurano et al (2005b). However the basic assumption implemented in the previous stopping problem is different from this paper.…”

Section: Introductionmentioning

confidence: 94%

A Fuzzy Perceptive Value for Multi-Variate Stopping Problem With a Monotone Rule

Kurano¹,

Yasuda

Nakagami³

et al. 2007

Bulletin of Informatics and Cybernetics

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 94%

A Fuzzy Perceptive Value for Multi-Variate Stopping Problem With a Monotone Rule

Kurano¹,

Yasuda

Nakagami³

et al. 2007

Bulletin of Informatics and Cybernetics

Self Cite

View full text Add to dashboard Cite

show abstract

“…In [3], a fuzzy total expected reward criterion is analyzed for an MDP with finite state space and with a trapezoidal fuzzy reward function. On the other hand, one of the most studied criteria in the literature is the discounted total expected reward/cost, see, for instance, [2,14,15,16,17] and [25]. In these works, the fuzzy approach is applied either in the reward/cost function ( [2,14,15,25]) or in the dynamic of the system ( [14,16,17]), all of them under finite state and action spaces framework.…”

Section: Introductionmentioning

confidence: 99%

“…On the other hand, one of the most studied criteria in the literature is the discounted total expected reward/cost, see, for instance, [2,14,15,16,17] and [25]. In these works, the fuzzy approach is applied either in the reward/cost function ( [2,14,15,25]) or in the dynamic of the system ( [14,16,17]), all of them under finite state and action spaces framework. In regards to the long-run expected average cost criterion, only the following two works were found: [10] and [13].…”

Section: Introductionmentioning

confidence: 99%

An extended version of average Markov decision processes on discrete spaces under fuzzy environment

Cruz−Suárez¹,

Montes-de-Oca²,

Ortega-Gutiérrez³

2023

Kybernetika

View full text Add to dashboard Cite

The article presents an extension of the theory of standard Markov decision processes on discrete spaces and with the average cost as the objective function which permits to take into account a fuzzy average cost of a trapezoidal type. In this context, the fuzzy optimal control problem is considered with respect to two cases: the max-order of the fuzzy numbers and the average ranking order of the trapezoidal fuzzy numbers. Each of these cases extends the standard optimal control problem, and for each of them the optimal solution is related to a suitable standard optimal control problem, and it is obtained that (i) the optimal policy coincides with the optimal policy of this suitable standard control problem, and (ii) the fuzzy optimal value function is of a trapezoidal shape. Two models: a queueing system and a machine replacement problem are provided in order to examplify the theory given.

show abstract

Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes

Cited by 2 publications

References 15 publications

A Fuzzy Perceptive Value for Multi-Variate Stopping Problem With a Monotone Rule

A Fuzzy Perceptive Value for Multi-Variate Stopping Problem With a Monotone Rule

An extended version of average Markov decision processes on discrete spaces under fuzzy environment

Contact Info

Product

Resources

About