Emily Dinan scite author profile

Chit-chat models are known to have several problems: they lack specificity, do not display a consistent personality and are often not very captivating. In this work we present the task of making chit-chat more engaging by conditioning on profile information. We collect data and train models to (i) condition on their given profile information; and (ii) information about the person they are talking to, resulting in improved dialogues, as measured by next utterance prediction. Since (ii) is initially unknown, our model is trained to engage its partner with personal topics, and we show the resulting dialogue can be used to predict profile information about the interlocutors.

show abstract

Adversarial NLI: A New Benchmark for Natural Language Understanding

Nie¹,

Williams²,

Dinan³

et al. 2020

385

451

View full text Add to dashboard Cite

We introduce a new large-scale NLI benchmark dataset, collected via an iterative, adversarial human-and-model-in-the-loop procedure. We show that training models on this new dataset leads to state-of-the-art performance on a variety of popular NLI benchmarks, while posing a more difficult challenge with its new test set. Our analysis sheds light on the shortcomings of current state-of-theart models, and shows that non-expert annotators are successful at finding their weaknesses. The data collection method can be applied in a never-ending learning scenario, becoming a moving target for NLU, rather than a static benchmark that will quickly saturate.

show abstract

Recipes for Building an Open-Domain Chatbot

Roller¹,

Dinan²,

Goyal³

et al. 2021

290

282

View full text Add to dashboard Cite

Building open-domain chatbots is a challenging area for machine learning research. While prior work has shown that scaling neural models in the number of parameters and the size of the data they are trained on gives improved results, we highlight other ingredients. Good conversation requires blended skills: providing engaging talking points, and displaying knowledge, empathy and personality appropriately, while maintaining a consistent persona. We show that large scale models can learn these skills when given appropriate training data and choice of generation strategy. We build variants of these recipes with 90M, 2.7B and 9.4B parameter models, and make our models and code publicly available. Human evaluations show our best models outperform existing approaches in multi-turn dialogue on engagingness and humanness measurements. We then discuss the limitations of this work by analyzing failure cases of our models.

show abstract

The Second Conversational Intelligence Challenge (ConvAI2)

et al. 2019

View full text Add to dashboard Cite

We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots. Some key takeaways from the competition are: (i) pretrained Transformer variants are currently the best performing models on this task, (ii) but to improve performance on multi-turn conversations with humans, future systems must go beyond single word metrics like perplexity to measure the performance across sequences of utterances (conversations) in terms of repetition, consistency and balance of dialogue acts (e.g. how many questions asked vs. answered).

show abstract

Recipes for building an open-domain chatbot

Roller¹,

Dinan²,

Goyal³

et al. 2020

Preprint

202

View full text Add to dashboard Cite

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Zhang

Dinan

Urbanek

et al. 2018

Preprint

149

View full text Add to dashboard Cite

Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack

Dinan¹,

Humeau²,

Chintagunta³

et al. 2019

137

View full text Add to dashboard Cite

The detection of offensive language in the context of a dialogue has become an increasingly important application of natural language processing. The detection of trolls in public forums (Galán-García et al., 2016), and the deployment of chatbots in the public domain (Wolf et al., 2017) are two examples that show the necessity of guarding against adversarially offensive behavior on the part of humans. In this work, we develop a training scheme for a model to become robust to such human attacks by an iterative build it, break it, fix it strategy with humans and models in the loop. In detailed experiments we show this approach is considerably more robust than previous systems. Further, we show that offensive language used within a conversation critically depends on the dialogue context, and cannot be viewed as a single sentence offensive detection task as in most previous work. Our newly collected tasks and methods will be made open source and publicly available.

show abstract

Retrieve and Refine: Improved Sequence Generation Models For Dialogue

Weston¹,

Dinan²,

Miller³

2018

130

111

View full text Add to dashboard Cite

Sequence generation models for dialogue are known to have several problems: they tend to produce short, generic sentences that are uninformative and unengaging. Retrieval models on the other hand can surface interesting responses, but are restricted to the given retrieval set leading to erroneous replies that cannot be tuned to the specific context. In this work we develop a model that combines the two approaches to avoid both their deficiencies: first retrieve a response and then refine it -the final sequence generator treating the retrieval as additional context. We show on the recent CON-VAI2 challenge task our approach produces responses superior to both standard retrieval and generation models in human evaluations.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Emily Dinan

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Adversarial NLI: A New Benchmark for Natural Language Understanding

Recipes for Building an Open-Domain Chatbot

The Second Conversational Intelligence Challenge (ConvAI2)

Recipes for building an open-domain chatbot

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Build it Break it Fix it for Dialogue Safety: Robustness from Adversarial Human Attack

Retrieve and Refine: Improved Sequence Generation Models For Dialogue

Contact Info

Product

Resources

About