Loïc Barrault scite author profile

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors (Kiros et al., 2015) on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available 1 .

show abstract

What you can cram into a single $&!#* vector: Probing sentence embeddings for linguistic properties

Conneau¹,

et al. 2018

View full text Add to dashboard Cite

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing. "Downstream" tasks, often based on sentence classification, are commonly used to evaluate the quality of sentence representations. The complexity of the tasks makes it however difficult to infer what kind of information is present in the representations. We introduce here 10 probing tasks designed to capture simple linguistic features of sentences, and we use them to study embeddings generated by three different encoders trained in eight distinct ways, uncovering intriguing properties of both encoders and training methods.

show abstract

Very Deep Convolutional Networks for Text Classification

Conneau¹,

Schwenk²,

Barrault³

et al. 2017

773

579

View full text Add to dashboard Cite

The dominant approach for many NLP tasks are recurrent neural networks, in particular LSTMs, and convolutional neural networks. However, these architectures are rather shallow in comparison to the deep convolutional networks which have pushed the state-of-the-art in computer vision. We present a new architecture (VD-CNN) for text processing which operates directly at the character level and uses only small convolutions and pooling operations. We are able to show that the performance of this model increases with the depth: using up to 29 convolutional layers, we report improvements over the state-ofthe-art on several public text classification tasks. To the best of our knowledge, this is the first time that very deep convolutional nets have been applied to text processing.

show abstract

Findings of the 2019 Conference on Machine Translation (WMT19)

Barrault¹,

Bojar²,

Costa-jussà³

et al. 2019

296

261

View full text Add to dashboard Cite

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2019. Participants were asked to build machine translation systems for any of 18 language pairs, to be evaluated on a test set of news stories. The main metric for this task is human judgment of translation quality. The task was also opened up to additional test suites to probe specific aspects of translation.

show abstract

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Conneau¹,

Schwenk²,

Barrault³

et al. 2017

Preprint

140

243

View full text Add to dashboard Cite

Very Deep Convolutional Networks for Text Classification

Conneau¹,

Schwenk²,

Barrault³

et al. 2016

Preprint

177

View full text Add to dashboard Cite

Findings of the Third Shared Task on Multimodal Machine Translation

Barrault¹,

Bougares²,

Specia³

et al. 2018

113

115

View full text Add to dashboard Cite

We present the results from the third shared task on multimodal machine translation. In this task a source sentence in English is supplemented by an image and participating systems are required to generate a translation for such a sentence into German, French or Czech. The image can be used in addition to (or instead of) the source sentence. This year the task was extended with a third target language (Czech) and a new test set. In addition, a variant of this task was introduced with its own test set where the source sentence is given in multiple languages: English, French and German, and participating systems are required to generate a translation in Czech. Seven teams submitted 45 different systems to the two variants of the task. Compared to last year, the performance of the multimodal submissions improved, but text-only systems remain competitive.

show abstract

Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

Elliott¹,

Frank²,

Barrault³

et al. 2017

134

104

View full text Add to dashboard Cite

We present the results from the second shared task on multimodal machine translation and multilingual image description. Nine teams submitted 19 systems to two tasks. The multimodal translation task, in which the source sentence is supplemented by an image, was extended with a new language (French) and two new test sets. The multilingual image description task was changed such that at test time, only the image is given. Compared to last year, multimodal systems improved, but text-only systems remain competitive.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Loïc Barrault

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

What you can cram into a single $&!#* vector: Probing sentence embeddings for linguistic properties

Very Deep Convolutional Networks for Text Classification

Findings of the 2019 Conference on Machine Translation (WMT19)

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Very Deep Convolutional Networks for Text Classification

Findings of the Third Shared Task on Multimodal Machine Translation

Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

Contact Info

Product

Resources

About