Matthias Gallé scite author profile

Performance drop due to domain-shift is an endemic problem for NLP models in production. This problem creates an urge to continuously annotate evaluation datasets to measure the expected drop in the model performance which can be prohibitively expensive and slow. In this paper we study the problem of predicting the performance drop of modern NLP models under domain-shift, in the absence of any target domain labels. We investigate three families of methods (H-divergence, reverse classification accuracy and confidence measures), show how they can be used to predict the performance drop and study their robustness to adversarial domain-shifts. Our results on sentiment classification and sequence labeling show that our method is able to predict performance drops with an error rate as low as 2.15% and 0.89% for sentiment analysis and POS tagging respectively.

show abstract

Monolingual Adapters for Zero-Shot Neural Machine Translation

Philip¹,

Bérard²,

Gallé³

et al. 2020

View full text Add to dashboard Cite

We propose a novel adapter layer formalism for adapting multilingual models. They are more parameter-efficient than existing adapter layers while obtaining as good or better performance. The layers are specific to one language (as opposed to bilingual adapters) allowing to compose them and generalize to unseen language-pairs. In this zero-shot setting, they obtain a median improvement of +2.77 BLEU points over a strong 20-language multilingual Transformer baseline trained on TED talks. * Work done during an internship at NAVER LABS Europe.

show abstract

Unsupervised Aspect-Based Multi-Document Abstractive Summarization

Coavoux¹,

Elsahar²,

Gallé³

2019

View full text Add to dashboard Cite

User-generated reviews of products or services provide valuable information to customers. However, it is often impossible to read each of the potentially thousands of reviews: it would therefore save valuable time to provide short summaries of their contents. We address opinion summarization, a multi-document summarization task, with an unsupervised abstractive summarization neural system. Our system is based on (i) a language model that is meant to encode reviews to a vector space, and to generate fluent sentences from the same vector space (ii) a clustering step that groups together reviews about the same aspects and allows the system to generate summary sentences focused on these aspects. Our experiments on the Oposum dataset empirically show the importance of the clustering step.

show abstract

Self-Supervised and Controlled Multi-Document Opinion Summarization

Elsahar¹,

Coavoux²,

Rozen³

et al. 2021

View full text Add to dashboard Cite

We address the problem of unsupervised abstractive summarization of collections of user generated reviews through self-supervision and control. We propose a self-supervised setup that considers an individual document as a target summary for a set of similar documents. This setting makes training simpler than previous approaches by relying only on standard log-likelihood loss and mainstream models. We address the problem of hallucinations through the use of control codes, to steer the generation towards more coherent and relevant summaries. Our benchmarks on two English datasets against graph-based and recent neural abstractive unsupervised models show that our proposed method generates summaries with a superior quality and relevance, as well as a high sentiment and topic alignment with the input reviews. This is confirmed in our human evaluation which focuses explicitly on the faithfulness of generated summaries. We also provide an ablation study showing the importance of the control setup in controlling hallucinations.

show abstract

Searching for smallest grammars on large sequences and application to DNA

Carrascosa

Coste

Gallé

et al. 2012

Journal of Discrete Algorithms

View full text Add to dashboard Cite

Investigating the Effectiveness of BPE: The Power of Shorter Sequences

Gallé¹

2019

View full text Add to dashboard Cite

Byte-Pair Encoding (BPE) is an unsupervised sub-word tokenization technique, commonly used in neural machine translation and other NLP tasks. Its effectiveness makes it a de facto standard, but the reasons for this are not well understood. We link BPE to the broader family of dictionary-based compression algorithms and compare it with other members of this family. Our experiments across datasets, language pairs, translation models, and vocabulary size show that-given a fixed vocabulary size budget-the fewer tokens an algorithm needs to cover the test set, the better the translation (as measured by BLEU).

show abstract

Choosing Word Occurrences for the Smallest Grammar Problem

Carrascosa

Coste

Gallé

et al. 2010

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Matthias Gallé

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

To Annotate or Not? Predicting Performance Drop under Domain Shift

Monolingual Adapters for Zero-Shot Neural Machine Translation

Unsupervised Aspect-Based Multi-Document Abstractive Summarization

Self-Supervised and Controlled Multi-Document Opinion Summarization

Searching for smallest grammars on large sequences and application to DNA

Investigating the Effectiveness of BPE: The Power of Shorter Sequences

Choosing Word Occurrences for the Smallest Grammar Problem

Contact Info

Product

Resources

About