“…In this context, the Windows operating system (OS) stands out as representative platform for LAMs, due to its high market share in the daily use of computer systems Adekotujo et al (2020), the presence of versatile applications and GUIs built upon it Ramler et al (2018), and the complexity of tasks that necessitate long-term planning and interaction across various applications Stallings (2005). The prospect of having a general intelligent agent that can comprehend user requests in natural language, and autonomously interact with the UIs of applications built on Windows is highly appealing.…”