“…An interesting problem is how to control the pendulum so that it can keep its equilibrium upward state in various conditions [8][9][10]. Recently, there are many algorithms in controlling the RIP systems such as LQR [6], PID [11,12], sliding mode [1,[13][14][15][16][17], backstepping [18], semi-optimal [8,19], fuzzy logic [9,20,21], and reinforcement learning controllers [22], etc. Due to the unique instability nature, the pendulum will fall instantly if affected by any strong external disturbance.…”