“…In [3], a fuzzy total expected reward criterion is analyzed for an MDP with finite state space and with a trapezoidal fuzzy reward function. On the other hand, one of the most studied criteria in the literature is the discounted total expected reward/cost, see, for instance, [2,14,15,16,17] and [25]. In these works, the fuzzy approach is applied either in the reward/cost function ( [2,14,15,25]) or in the dynamic of the system ( [14,16,17]), all of them under finite state and action spaces framework.…”