Task-Aware Representation of Sentences for Generic Text Classification

Halder, Kishaloy; Akbik, Alan; Krapac, Josip; Vollgraf, Roland

doi:10.18653/v1/2020.coling-main.285

Cited by 36 publications

(57 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We aim to design a personalization framework that is generalizable to arbitrary classification tasks without requiring modification to the model architecture. To that end, we draw inspiration from (Halder et al, 2020) and formulate the multiclass classification problem as a series of binary classification tasks:…”

Section: Universal Binary Classificationmentioning

confidence: 99%

“…We train for 50 epochs (unless noted otherwise) and report F1 scores on the test set. For hyper-parameters, we use a batch size of 16 and a learning rate of 0.02, following the standard in (Halder et al, 2020).…”

Section: Experimental Settingsmentioning

confidence: 99%

“…Zeroshot models have been studied recently as a method for applying pre-trained models to solve new problems without any additional training (Brown et al, 2020a;Raffel et al, 2020;Sanh et al, 2021). While the zeroshot approach is promising, there still exists a large performance gap in the level of accuracy that is needed for production usage (Halder et al, 2020;Wenpeng Yin and Roth, 2019). Thus, the research question remains as to how to enable Personalized Intelligence at a production scale for millions of users and beyond.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Towards Personalized Intelligence at Scale

Kang¹,

Mahendra²,

Clarke³

et al. 2022

Preprint

View full text Add to dashboard Cite

Personalized Intelligence (PI) is the problem of providing customized AI experiences tailored to each individual user. In many applications, PI is preferred or even required (Martinez et al., 2017;Rudovic et al., 2018). Existing personalization approaches involve finetuning pre-trained models to create new customized models. However, these approaches require a significant amount of computation to train, scaling with model size and the number of users, inhibiting PI to be realized widely. In this work, we introduce a novel model architecture and training/inference framework to enable Personalized Intelligence at scale. We achieve this by attaching a Personalization Head (PH) to pre-trained language models (LM). During training, the base LMs are frozen and only the parameters in PH are updated and are unique per user. This results in significantly smaller overall model sizes and training cost than traditional fine-tuning approaches when scaled across many users. We evaluate PHs on academia and industryfocused datasets and show that the PHs outperform zeroshot baseline in F1 score and are significantly more scalable than traditional finetuning approaches. We identify key factors required for effective PH design and training.

show abstract

Section: Universal Binary Classificationmentioning

confidence: 99%

Section: Experimental Settingsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Towards Personalized Intelligence at Scale

Kang¹,

Mahendra²,

Clarke³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…We only require the name of classes rather than manually constructed prompts or templates to convey label semantics. TARS (Halder et al, 2020) also leverages pre-trained language models and label semantics based on binary text classification. However, our method further strengthens generalization ability via meta-learning framework especially in cross-domain and fine-grained cases.…”

Section: Using Label Information For Text Classificationmentioning

confidence: 99%

Don’t Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text Classification

Luo¹,

Liu²,

Lin³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Increasing studies leverage pre-trained language models and meta-learning frameworks to solve few-shot text classification problems. Most of the current studies focus on building a meta-learner from the information of input texts but ignore abundant semantic information beneath class labels. In this work, we show that class-label information can be utilized for extracting more discriminative feature representation of the input text from a pretrained language model like BERT, and can achieve a performance boost when the samples are scarce. Building on top of this discovery, we propose a framework called Labelsemantic augmented meta-learner (LaSAML) to make full use of label semantics. We systematically investigate various factors in this framework and show that it can be plugged into the existing few-shot text classification system. Through extensive experiments, we demonstrate that the few-shot text classification system upgraded by LaSAML can lead to significant performance improvement over its original counterparts.

show abstract

“…In the manual labeling process, we achieve 77% agreement between two trained annotators and solve conflicts with a third annotator (the main author of this paper). To classify documents, we use a few-shot learning model based on Task-Aware Representation of Sentences (Halder et al, 2020), implemented in flairNLP (Akbik et al, 2019). We achieve 80.8% accuracy on the holdout set.…”

Section: Classification Of Document Functionmentioning

confidence: 99%

MAIR: Framework for mining relationships between research articles, strategies, and regulations in the field of explainable artificial intelligence

Gizinski,

Kuzba,

Pielinski

et al. 2021

Preprint

View full text Add to dashboard Cite

The growing number of AI applications, also for high-stake decisions, increases the interest in Explainable and Interpretable Machine Learning (XI-ML). This trend can be seen both in the increasing number of regulations and strategies for developing trustworthy AI and the growing number of scientific papers dedicated to this topic. To ensure the sustainable development of AI, it is essential to understand the dynamics of the impact of regulation on research papers as well as the impact of scientific discourse on AI-related policies. This paper introduces a novel framework for joint analysis of AI-related policy documents and eXplainable Artificial Intelligence (XAI) research papers. The collected documents are enriched with metadata and interconnections, using various NLP methods combined with a methodology inspired by Institutional Grammar. Based on the information extracted from collected documents, we showcase a series of analyses that help understand interactions, similarities, and differences between documents at different stages of institutionalization. To the best of our knowledge, this is the first work to use automatic language analysis tools to understand the dynamics between XI-ML methods and regulations. We believe that such a system contributes to better cooperation between XAI researchers and AI policymakers.

show abstract

Task-Aware Representation of Sentences for Generic Text Classification

Cited by 36 publications

References 18 publications

Towards Personalized Intelligence at Scale

Towards Personalized Intelligence at Scale

Don’t Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text Classification

MAIR: Framework for mining relationships between research articles, strategies, and regulations in the field of explainable artificial intelligence

Contact Info

Product

Resources

About