“…Markov game formalism was extended to continuous state-action space problems by proposing a continuous action variant of Minimax-Q. Continuous action neural Markov game control [31] has been found to outperform other game based and nongame based contemporary control schemes, i.e., RL based robust controller [62], and an H ∞ theory based robust game controller [50]. For the inverted pendulum swing-up, each trial is started from an initial state x 0 = ( 0 , 0 ) where  0 and 0 are selected from a uniform distribution that ranges between 0 and 0.01.…”