“…There are some papers considering the multi-agent in the sequential decision-making systems including the cooperative setting (Littman, 2001;González-Sánchez and Hernández-Lerma, 2013;Zhang et al, 2018;Perolat et al, 2018;Shi et al, 2022) and competing setting (Littman, 1994;Auer and Ortner, 2006;Zinkevich et al, 2007;Wei et al, 2017;Fiez et al, 2019;Jin et al, 2020). Zhong et al (2021) study the multi-player general-sum Markov games with one of the players designated as the leader and the other players regarded as followers and establish the efficient RL algorithms to achieve the Stackelberg-Nash equilibrium.…”