2021
DOI: 10.48550/arxiv.2103.04918
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey of Embodied AI: From Simulators to Research Tasks

Abstract: There has been an emerging paradigm shift from the era of "internet AI" to "embodied AI", whereby AI algorithms and agents no longer simply learn from datasets of images, videos or text curated primarily from the internet. Instead, they learn through embodied physical interactions with their environments, whether real or simulated. Consequently, there has been substantial growth in the demand for embodied AI simulators to support a diversity of embodied AI research tasks. This growing interest in embodied AI i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 120 publications
0
5
0
Order By: Relevance
“…Artificial agents may be able to obtain these same benefits if they learn in a similar way rather than being trained from a predefined dataset of static images (Smith and Gasser 2005). To facilitate this, various simulation tools have been developed that allow artificial agents to roam and interact in a simulated world to learn about it directly (Duan et al 2021). An example of applying this to object normalization is to learn spatial invariants of object classification by approaching objects in different ways in the simulated world (Caudell et al 2011).…”
Section: Feasibility Of Architecture Componentsmentioning
confidence: 99%
“…Artificial agents may be able to obtain these same benefits if they learn in a similar way rather than being trained from a predefined dataset of static images (Smith and Gasser 2005). To facilitate this, various simulation tools have been developed that allow artificial agents to roam and interact in a simulated world to learn about it directly (Duan et al 2021). An example of applying this to object normalization is to learn spatial invariants of object classification by approaching objects in different ways in the simulated world (Caudell et al 2011).…”
Section: Feasibility Of Architecture Componentsmentioning
confidence: 99%
“…However, those works primarily focused on human intuitions of recognizing or predicting motions. But with the advancement and rise of deep learning, computer graphics, and embodied AI [22,23,24], there has been a paradigm shift towards generating synthetic datasets that range from simple 2D cartoons [25,26,27] to realistic interaction in 3D environments [13,28,14,29,30,15], all with the aim to explore machine perception of physics and causal reasoning on a deeper level. However, only datasets from CLEVRER [15], CoPhy [13], and CATER [14] are most relevant to our work.…”
Section: Related Workmentioning
confidence: 99%
“…The visual quality of the rendered scenes depends both on the features of the 3D engine and on the design of the 3D models that are shared together with the environment itself. Several different tasks are studied using these 3D simulators, such as generic robot navigation, visual recognition, visual QA -see [22] and references therein. Some environments are developed in the context of open projects that might benefit from the contributions of large communities [2], [7], [8], while others are based on closed source solutions [4].…”
Section: A 3d Virtual Environmentsmentioning
confidence: 99%