2020
DOI: 10.48550/arxiv.2010.09948
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Object Permanence Through Audio-Visual Representations

Fanjun Bu,
Chien-Ming Huang

Abstract: As robots perform manipulation tasks and interact with objects, it is probable that they accidentally drop objects that subsequently bounce out of their visual fields (e.g., due to an inadequate grasp of an unfamiliar object). To enable robots to recover from such errors, we draw upon the concept of object permanence-objects remain in existence even when they are not being sensed (e.g., seen) directly. In particular, we developed a multimodal neural network model-using a partial, observed bounce trajectory and… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…There are works that, given vision, enhance sounds [30,18], fill in missing sounds [42], and generate sounds entirely from video [32,43]. Further, there have been recent works in integrating vision and sound to improve recognition of environmental properties [3,21,8] and object properties, such as geometry and materials [40,39]. Lastly, there have been works in using audiovisual data for representation learning [33,4,28].…”
Section: Related Workmentioning
confidence: 99%
“…There are works that, given vision, enhance sounds [30,18], fill in missing sounds [42], and generate sounds entirely from video [32,43]. Further, there have been recent works in integrating vision and sound to improve recognition of environmental properties [3,21,8] and object properties, such as geometry and materials [40,39]. Lastly, there have been works in using audiovisual data for representation learning [33,4,28].…”
Section: Related Workmentioning
confidence: 99%