A survey on large language model based autonomous agents

Wang, Lei; Ma, Chen; Feng, Xueyang; Zhang, Zeyu; Yang, Hao; Zhang, Jingsen; Chen, Zhiyuan; Tang, Jiakai; Chen, Xu; Lin, Yankai; Zhao, Wayne Xin; Wei, Zhewei; Wen, Jirong

doi:10.1007/s11704-024-40231-1

Cited by 52 publications

(8 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To alleviate loneliness among older adults, companion robots can provide users with the opportunity to reconnect with friends and family, thereby, mitigating the risks of over-reliance on interaction with technology. Foundation models capable of utilizing tools for social media, phones, and various devices (see Wang et al (2024) for a survey) that leverage edge computing can enable this functionality (e.g., Dong L. et al, 2023;Shen et al, 2023). Additionally, robots can facilitate new online connections for users by harnessing their social media networks with the assistance of other deep learning architectures (e.g., Ding et al (2017); Chen et al (2020).…”

Section: Social Engagementmentioning

confidence: 99%

“…Semantic understanding, i.e., the relations among entities within visual scenes through object, scene, or action recognition, can be achieved with foundation models to provide advice based on the situational context that extends beyond the capabilities of verbal context ( Bommasani et al, 2022 ). For instance, the robot can suggest the user a recipe based on their preferences, and offer help with cooking verbally or potentially physically if integrated with manipulators, in which foundation models can be used for generating robot plans and actions, by referring to/using the learned locations of the equipment and ingredients (see Wang et al (2024) ; Firoozi et al (2023) for surveys of LLMs and foundation models in robotics for task planning and control).…”

Section: Design Recommendationsmentioning

confidence: 99%

See 1 more Smart Citation

Recommendations for designing conversational companion robots with older adults through foundation models

Irfan,

Kuoppamäki,

Skantze

2024

Front. Robot. AI

View full text Add to dashboard Cite

Companion robots are aimed to mitigate loneliness and social isolation among older adults by providing social and emotional support in their everyday lives. However, older adults’ expectations of conversational companionship might substantially differ from what current technologies can achieve, as well as from other age groups like young adults. Thus, it is crucial to involve older adults in the development of conversational companion robots to ensure that these devices align with their unique expectations and experiences. The recent advancement in foundation models, such as large language models, has taken a significant stride toward fulfilling those expectations, in contrast to the prior literature that relied on humans controlling robots (i.e., Wizard of Oz) or limited rule-based architectures that are not feasible to apply in the daily lives of older adults. Consequently, we conducted a participatory design (co-design) study with 28 older adults, demonstrating a companion robot using a large language model (LLM), and design scenarios that represent situations from everyday life. The thematic analysis of the discussions around these scenarios shows that older adults expect a conversational companion robot to engage in conversation actively in isolation and passively in social settings, remember previous conversations and personalize, protect privacy and provide control over learned data, give information and daily reminders, foster social skills and connections, and express empathy and emotions. Based on these findings, this article provides actionable recommendations for designing conversational companion robots for older adults with foundation models, such as LLMs and vision-language models, which can also be applied to conversational robots in other domains.

show abstract

Section: Social Engagementmentioning

confidence: 99%

Section: Design Recommendationsmentioning

confidence: 99%

Recommendations for designing conversational companion robots with older adults through foundation models

Irfan,

Kuoppamäki,

Skantze

2024

Front. Robot. AI

View full text Add to dashboard Cite

show abstract

“…In recent years, the revolutionary advancements in deep learning technologies, highlighted by the introduction of LLMs such as the GPT series, have empowered LLM-based agents with formidable natural language processing capabilities. Agents 41,42,43,98 , capable of autonomously perceiving their environment, cognitive reasoning, decision-making, and executing actions through tool invocation, have emerged as a highly promising direction in the pursuit of general artificial intelligence. Specifically, an AI agent comprises four modules: the perception module for gathering environmental information, the cognition and decision-making module to analyze inputs and devise action strategies, the memory module to archive knowledge and past behaviors, and the action module to implement decisions by manipulating tools to impact the environment.…”

Section: Literature Reviewmentioning

confidence: 99%

“…1b. Agents, the core components of DII-MAS, employ LLMs as their controlling nucleus and exhibit a high degree of autonomous capability 41,42,43 .…”

Section: B Research Statementmentioning

confidence: 99%

See 1 more Smart Citation

A Data-Intelligence-Intensive Bioinformatics Copilot System for Large-scale Omics Researches and Scientific Insights

Liu,

Shen,

Zhou

et al. 2024

Preprint

View full text Add to dashboard Cite

Advancements in high-throughput sequencing technologies and artificial intelligence offer unprecedented opportunities for groundbreaking discoveries while posing significant analytical challenges. This study introduces a data-intelligence-intensive scientific research paradigm that synergizes human expertise with AI to facilitate hypothesis-free exploratory research in life science. We propose a multi-agent system (DII-MAS) based on large language models (LLMs), enabling efficient human-agent interaction, agent group management, interdisciplinary knowledge empowerment, and continuous learning. This novel framework is demonstrated through the construction of a human lung cell atlas, showcasing its capability to overcome the limitations of standalone AI applications, improve research efficiency, and adapt to complex life science tasks. This study substantiates three key hypotheses: the collective intelligence workflow can significantly propel life science tasks, proactive interactions within DII-MAS mitigate comprehension biases and incomplete information issues, and continuous learning empowers DII-MAS to make optimal decisions and tool selections. The contributions of this study comprise the delineation of a data-intelligence-intensive research paradigm, the development of DII-MAS, and the introduction of novel evaluation metrics for agent performance. This study underscores the potential of integrating AI with expert knowledge to accelerate discoveries and navigate uncharted territories in life sciences.

show abstract

Legally-Guided Automated Decision-Making System Using Language Model Agents for Autonomous Driving

Wang,

Barta,

Hesse

et al. 2024

Lecture Notes in Computer Science

View full text Add to dashboard Cite

A survey on large language model based autonomous agents

Cited by 52 publications

References 47 publications

Recommendations for designing conversational companion robots with older adults through foundation models

Recommendations for designing conversational companion robots with older adults through foundation models

A Data-Intelligence-Intensive Bioinformatics Copilot System for Large-scale Omics Researches and Scientific Insights

Legally-Guided Automated Decision-Making System Using Language Model Agents for Autonomous Driving

Contact Info

Product

Resources

About