Continual Adaptation for Efficient Machine Communication

Hawkins, Robert X. D.; Kwon, Minae; Sadigh, Dorsa; Goodman, Noah D.

doi:10.18653/v1/2020.conll-1.33

Cited by 13 publications

(25 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The study presented in this paper provides new empirical evidence on language production in dialogue which we believe can directly inform the development of natural language generation models. Our findings suggest that models that take relevant contextual units into account (Takmaz et al, 2020;Hawkins et al, 2020) are better suited for reproducing human patterns of information transmission, and confirm that the use of training objectives that enforce a uniform organisation of information density (Meister et al, 2020;Wei et al, 2021) is a promising avenue for training language models.…”

Section: Discussionsupporting

confidence: 69%

Is Information Density Uniform in Task-Oriented Dialogues?

Giulianelli¹,

Sinclair²,

Fernández³

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

The Uniform Information Density principle states that speakers plan their utterances to reduce fluctuations in the density of the information transmitted. In this paper, we test whether, and within which contextual units this principle holds in task-oriented dialogues. We show that there is evidence supporting the principle in written dialogues where participants play a cooperative reference game as well as in spoken dialogues involving instruction giving and following. Our study underlines the importance of identifying the relevant contextual components, showing that information content increases particularly within topically and referentially related contextual units.

show abstract

Section: Discussionsupporting

confidence: 69%

Is Information Density Uniform in Task-Oriented Dialogues?

Giulianelli¹,

Sinclair²,

Fernández³

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…For instance, hierarchical architectures that appropriately incorporate compositionality or incrementality into the speaker's production model may be able to reinforce component parts of longer utterances in the shared history (e.g. Hawkins, Kwon, Sadigh, & Goodman, 2020). Still, such an approach would have more in common with our proposal than to the model-free heuristics in the existing literature.…”

Section: Discussionmentioning

confidence: 99%

From partners to populations: A hierarchical Bayesian account of coordination and convention

Hawkins¹,

Franke²,

Frank³

et al. 2021

Preprint

View full text Add to dashboard Cite

Languages are powerful solutions to coordination problems: they provide stable, shared expectations about how the words we say correspond to the beliefs and intentions in our heads. Yet language use in a variable and non-stationary social environment requires linguistic representations to be flexible: old words acquire new ad hoc or partner-specific meanings on the fly. In this paper, we introduce a hierarchical Bayesian theory of convention formation that aims to reconcile the long-standing tension between these two basic observations. More specifically, we argue that the central computational problem of communication is not simply transmission, as in classical formulations, but learning and adaptation over multiple timescales. Under our account, rapid learning within dyadic interactions allows for coordination on partner-specific common ground, while social conventions are stable priors that have been abstracted away from interactions with multiple partners. We present new empirical data alongside simulations showing how our model provides a cognitive foundation for explaining several phenomena that have posed a challenge for previous accounts: (1) the convergence to more efficient referring expressions across repeated interaction with the same partner, (2) the gradual transfer of partner-specific common ground to novel partners, and (3) the influence of communicative context on which conventions eventually form.

show abstract

“…We vary different design decisions, and experiment for seven interaction rounds. 9 We experiment with four system variants: (a) FULL: our full approach described in Section 5; (b) POS-ONLY: use only examples with positive labels y = +1; (c) TC-ONLY: ignore the feedback questions, instead if the user completes the task according to our task success measure we add positive examples with both the system plan and user execution, otherwise we add a negative example using the system plan; (d) NO-ENSEMBLE: train and deploy a single model each round, starting from an initial model randomly sampled from these we use for FULL; and (e) FINE-TUNING: train model parameters θ r+1 on D r for N epochs, starting from θ r , avoiding overfitting with rehearsal (Rebuffi et al, 2017;Hawkins et al, 2020a). In rehearsal, in each batch, half the examples are sampled randomly from the previous datasets D 0 ,.…”

Section: System Variants Studymentioning

confidence: 99%

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Kojima¹,

Suhr²,

Artzi³

2021

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

We study continual learning for natural language instruction generation, by observing human users’ instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system’s success communicating its intent. We show how to use this signal to improve the system’s ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.

show abstract

Continual Adaptation for Efficient Machine Communication

Cited by 13 publications

References 58 publications

Is Information Density Uniform in Task-Oriented Dialogues?

Is Information Density Uniform in Task-Oriented Dialogues?

From partners to populations: A hierarchical Bayesian account of coordination and convention

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Contact Info

Product

Resources

About