“…We assume a closed world (A.5), but in general an embodied agent will encounter new people, objects, and parts of an environment. There are active lines of research regarding environment exploration (Wang et al, 2018), object discovery (Tucker, Aksaray, Paul, Stein, & Roy, 2017), and identifying missing referents via dialog (Amiri, Bajracharya, Goktolga, Thomason, & Zhang, 2019). These strategies are compatible with our current dialog framework, which grounds to and asks enumeration questions about all known, relevant referents via a visual depiction achievable by taking photos of new referents.…”