“…They have been used in robotics, to study environments with multiple autonomous robots [1,14,17,23], and control systems to study motion of mobile robots [22,26]. They have also been used in telecommunications, in conjunction with Q-Learning [2,19,20], to enhance routing techniques [18], in power engineering, exploring the "decisions-inherent in engineering multi-agent systems" for power-related applications [12,13], and in power systems, using multiagent reinforcement learning to solve the problems that arise from the nonlinearity of a power system [4].…”