“…Few works consider the more complex Partially Observable Markov Decision Process (POMDP) where the observation is just a partial representation of the underlying state. However, POMDP is ubiquitous in real robotics applications [16], [17], such as robot navigation [18], [19], robotic manipulation [20], autonomous driving [21], [22], [23], and planning under uncertainty [24], [25], [26], [27]. Partial observability may be due to limited sensing capability, or an incomplete system model resulting in uncertainty about full observability.…”