“…To overcome the difficulty, many approximation methods are proposed to obtain optimal tracking control law [2][3][4][5]. Among these approximate approaches, adaptive dynamic programming (ADP) algorithm, proposed by Werbos [6,7], has played an important role in seeking approximate solutions of dynamic programming problems as a way to solve the computational issue forward-in-time [8][9][10][11][12][13][14]. There are several synonyms used for ADP including "adaptive critic designs" [15], "adaptive dynamic programming" [16,17], "approximate dynamic programming" [18,19], "neural dynamic programming" [20], "neuro-dynamic programming" [21], and "reinforcement learning" [22].…”