2022
DOI: 10.48550/arxiv.2202.13330
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following

Xiaofeng Gao,
Qiaozi Gao,
Ran Gong
et al.

Abstract: Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively. We present DialFRED, a dialogue-enabled embodied instruction following benchmark based on the ALFRED benchmark. DialFRED allows an agent to actively ask questions to the human user; the additional information in the user's response is used by the agent … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(17 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…Meanwhile, for tasks (Thomason et al, 2019b;Padmakumar et al, 2021) that do not provide an oracle agent to answer question in natural language, researchers also need to build a rule-based (Padmakumar et al, 2021) or neural-based (Roman et al, 2020) oracle. Dial-FRED (Gao et al, 2022) uses a language model as an oracle to answer questions.…”
Section: Asking For Helpmentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile, for tasks (Thomason et al, 2019b;Padmakumar et al, 2021) that do not provide an oracle agent to answer question in natural language, researchers also need to build a rule-based (Padmakumar et al, 2021) or neural-based (Roman et al, 2020) oracle. Dial-FRED (Gao et al, 2022) uses a language model as an oracle to answer questions.…”
Section: Asking For Helpmentioning
confidence: 99%
“…Language-Active Environment Room-to-Room (Anderson et al, 2018b) Matterport3D ✗ Indoor Room-for-Room Matterport3D ✗ Indoor Room-Across-Room (Ku et al, 2020) Matterport3D ✗ Indoor Landmark-RxR (He et al, 2021) Matterport3D ✗ Indoor XL-R2R (Yan et al, 2020) Matterport3D ✗ Indoor VLNCE (Krantz et al, 2020) Habitat ✗ Indoor StreetLearn Google Street View ✗ Outdoor StreetNav (Hermann et al, 2020) Google Street View ✗ Outdoor TOUCHDOWN Google Street View ✗ Outdoor Talk2Nav (Vasudevan et al, 2021) Google Street View ✗ Outdoor LANI (Misra et al, 2018) -✗ Outdoor RoomNav (Wu et al, 2018) House3D ✗ Indoor EmbodiedQA (Das et al, 2018) House3D ✗ Indoor REVERIE Matterport3D ✗ Indoor SOON (Zhu et al, 2021a) Matterport3D ✗ Indoor IQA (Gordon et al, 2018) AI2-THOR ✗ Indoor CHAI (Misra et al, 2018) CHALET ✗ Indoor ALFRED (Shridhar et al, 2020) AI2-THOR ✗ Indoor VNLA Matterport3D ✓ Indoor HANNA (Nguyen and Daumé III, 2019) Matterport3D ✓ Indoor CEREALBAR -✓ Indoor Just Ask (Chi et al, 2020) Matterport3D ✓ Indoor CVDN (Thomason et al, 2019b) Matterport3D ✓ Indoor RobotSlang (Banerjee et al, 2020) -✓ Indoor Talk the Walk (de Vries et al, 2018) -✓ Outdoor MC Collab (Narayan- Minecraft ✓ Outdoor TEACh (Padmakumar et al, 2021) AI2-THOR ✓ Indoor DialFRED (Gao et al, 2022) AI2-THOR ✓ Indoor…”
Section: Namementioning
confidence: 99%
“…The follower converses with the commander and interacts with the environment to complete various house tasks such as making coffee. Dial-FRED (Gao et al, 2022) extends ALFRED (Shridhar et al, 2020) dataset by allowing the agent to actively ask questions.…”
Section: Human Dialoguementioning
confidence: 99%
“…Meanwhile, for tasks (Thomason et al, 2019b;Padmakumar et al, 2021) that do not provide an oracle agent to answer question in natural language, researchers also need to build a rule-based (Padmakumar et al, 2021) or neural-based (Roman et al, 2020 oracle. DialFRED (Gao et al, 2022) uses a language model as an oracle to answer questions.…”
Section: Asking For Helpmentioning
confidence: 99%
See 1 more Smart Citation