“…XLand [20] also focuses on the generalization capability of agents and supports multi-agent scenarios, but it is not open-source. Existing Interest This environment has been used as a testbed for RL in research competitions 2 and many researchers have conducted experiments under the environment of Honor of Kings [3,4,11,24,25,26,28,29,27].Though some of them verified the feasibility of reinforcement learning in tackling the game [11,26,28,29], they are more focused on methodological novelty in planning, treesearching, etc. Unlike these papers, this paper focuses on making the environment open-accessible and providing benchmarking results, which could serve as a reference and foundation for future research.…”