“…Assessing the capabilities of Arti cial Intelligence (AI) has been an important research direction since the inception of AI and this became more urgent after large language models, especially GPT, attracted popular attention (Bubeck et al, 2023). Most research focuses on cognitive capabilities, such as reasoning (Dasgupta, et al, 2022), induction (Han, et al, 2022), and creativity (Stevenson, et al, 2022;Uludag, 2023). Recently, Bubeck et al (2023) conducted a wide range of tests on GPT-4, the latest model developed by OpenAI, exploring its mathematical abilities, multimodal capabilities, tool usage, and coding.…”