Douwe Kiela scite author profile

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors (Kiros et al., 2015) on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available 1 .

show abstract

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Zhang¹,

Dinan²,

Urbanek³

et al. 2018

860

1,056

View full text Add to dashboard Cite

Chit-chat models are known to have several problems: they lack specificity, do not display a consistent personality and are often not very captivating. In this work we present the task of making chit-chat more engaging by conditioning on profile information. We collect data and train models to (i) condition on their given profile information; and (ii) information about the person they are talking to, resulting in improved dialogues, as measured by next utterance prediction. Since (ii) is initially unknown, our model is trained to engage its partner with personal topics, and we show the resulting dialogue can be used to predict profile information about the interlocutors.

show abstract

Adversarial NLI: A New Benchmark for Natural Language Understanding

Nie¹,

Williams²,

Dinan³

et al. 2020

392

462

View full text Add to dashboard Cite

We introduce a new large-scale NLI benchmark dataset, collected via an iterative, adversarial human-and-model-in-the-loop procedure. We show that training models on this new dataset leads to state-of-the-art performance on a variety of popular NLI benchmarks, while posing a more difficult challenge with its new test set. Our analysis sheds light on the shortcomings of current state-of-theart models, and shows that non-expert annotators are successful at finding their weaknesses. The data collection method can be applied in a never-ending learning scenario, becoming a moving target for NLU, rather than a static benchmark that will quickly saturate.

show abstract

What makes a good conversation? How controllable attributes affect human judgments

See¹,

Roller²,

Kiela³

et al. 2019

187

278

View full text Add to dashboard Cite

A good conversation requires balance -between simplicity and detail; staying on topic and changing it; asking questions and answering them. Although dialogue agents are commonly evaluated via human judgments of overall quality, the relationship between quality and these individual factors is less well-studied. In this work, we examine two controllable neural text generation methods, conditional training and weighted decoding, in order to control four important attributes for chitchat dialogue: repetition, specificity, response-relatedness and question-asking. We conduct a large-scale human evaluation to measure the effect of these control parameters on multi-turn interactive conversations on the PersonaChat task. We provide a detailed analysis of their relationship to high-level aspects of conversation, and show that by controlling combinations of these variables our models obtain clear improvements in human quality judgments.

show abstract

The Second Conversational Intelligence Challenge (ConvAI2)

et al. 2019

View full text Add to dashboard Cite

We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots. Some key takeaways from the competition are: (i) pretrained Transformer variants are currently the best performing models on this task, (ii) but to improve performance on multi-turn conversations with humans, future systems must go beyond single word metrics like perplexity to measure the performance across sequences of utterances (conversations) in terms of repetition, consistency and balance of dialogue acts (e.g. how many questions asked vs. answered).

show abstract

Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics

Kiela¹,

Bottou²

2014

178

206

View full text Add to dashboard Cite

We construct multi-modal concept representations by concatenating a skip-gram linguistic representation vector with a visual concept representation vector computed using the feature extraction layers of a deep convolutional neural network (CNN) trained on a large labeled object recognition dataset. This transfer learning approach brings a clear performance gain over features based on the traditional bag-of-visual-word approach. Experimental results are reported on the WordSim353 and MEN semantic relatedness evaluation tasks. We use visual features computed using either ImageNet or ESP Game images.

show abstract

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Zhang

Dinan

Urbanek

et al. 2018

Preprint

150

149

View full text Add to dashboard Cite

Dynamic Meta-Embeddings for Improved Sentence Representations

2018

View full text Add to dashboard Cite

While one of the first steps in many NLP systems is selecting what pre-trained word embeddings to use, we argue that such a step is better left for neural networks to figure out by themselves. To that end, we introduce dynamic meta-embeddings, a simple yet effective method for the supervised learning of embedding ensembles, which leads to stateof-the-art performance within the same model class on a variety of tasks. We subsequently show how the technique can be used to shed new light on the usage of word embeddings in NLP systems.• Multi-domain Standard word embeddings

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Douwe Kiela

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Adversarial NLI: A New Benchmark for Natural Language Understanding

What makes a good conversation? How controllable attributes affect human judgments

The Second Conversational Intelligence Challenge (ConvAI2)

Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics

Personalizing Dialogue Agents: I have a dog, do you have pets too?

Dynamic Meta-Embeddings for Improved Sentence Representations

Contact Info

Product

Resources

About