Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Naik, Aakanksha; Lehman, Jill Fain; Rosé, Carolyn P.

doi:10.1162/tacl_a_00500

Cited by 2 publications

(1 citation statement)

References 70 publications

(80 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While we do not test whether models can memorize long-tail knowledge, we instead test whether models can process long-tail sentences. Naik et al (2022) note that it is challenging to catalogue and evaluate generalization along micro-level dimensions and instead propose benchmarks that vary along macro-level dimensions (such as the language and domain) as a proxy. We hypothesize that LMs learn which micro-level phenomena are rare, as this would improve their overall language modeling objective.…”

Section: Related Workmentioning

confidence: 99%

Benchmarking Long-tail Generalization with Likelihood Splits

Godbole,

Jia

2023

Findings of the Association for Computational Linguistics: EACL 2023

View full text Add to dashboard Cite

In order to reliably process natural language, NLP systems must generalize to the long tail of rare utterances. We propose a method to create challenging benchmarks that require generalizing to the tail of the distribution by re-splitting existing datasets. We create 'Likelihood Splits' where examples that are assigned lower likelihood by a pre-trained language model (LM) are placed in the test set, and more likely examples are in the training set. This simple approach can be customized to construct meaningful traintest splits for a wide range of tasks. Likelihood Splits surface more challenges than random splits: relative error rates of state-of-the-art models increase by 59% for semantic parsing on SPIDER, 93% for natural language inference on SNLI, and 33% for yes/no question answering on BOOLQ, on our splits compared with the corresponding random splits. Moreover, Likelihood Splits create fairer benchmarks than adversarial filtering; when the LM used to create the splits is also employed as the task model, our splits do not unfairly penalize the LM.

show abstract

Section: Related Workmentioning

confidence: 99%

Benchmarking Long-tail Generalization with Likelihood Splits

Godbole,

Jia

2023

Findings of the Association for Computational Linguistics: EACL 2023

View full text Add to dashboard Cite

show abstract

General then Personal: Decoupling and Pre-training for Personalized Headline Generation

Song,

Chen,

Wang

et al. 2023

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

Personalized Headline Generation aims to generate unique headlines tailored to users’ browsing history. In this task, understanding user preferences from click history and incorporating them into headline generation pose challenges. Existing approaches typically rely on predefined styles as control codes, but personal style lacks explicit definition or enumeration, making it difficult to leverage traditional techniques. To tackle these challenges, we propose General Then Personal (GTP), a novel framework comprising user modeling, headline generation, and customization. We train the framework using tailored designs that emphasize two central ideas: (a) task decoupling and (b) model pre-training. With the decoupling mechanism separating the task into generation and customization, two mechanisms, i.e., information self-boosting and mask user modeling, are further introduced to facilitate the training and text control. Additionally, we introduce a new evaluation metric to address existing limitations. Extensive experiments conducted on the PENS dataset, considering both zero-shot and few-shot scenarios, demonstrate that GTP outperforms state-of-the-art methods. Furthermore, ablation studies and analysis emphasize the significance of decoupling and pre-training. Finally, the human evaluation validates the effectiveness of our approaches.1

show abstract

Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Cited by 2 publications

References 70 publications

Benchmarking Long-tail Generalization with Likelihood Splits

Benchmarking Long-tail Generalization with Likelihood Splits

General then Personal: Decoupling and Pre-training for Personalized Headline Generation

Contact Info

Product

Resources

About